Digital Biology: A survey of topics in bioinformatics and functional genomics 
BIOL-L/MLS-M 388 Spring 2023 
Course description
BIOL L388 DB
Class times and locations
Tues, Thur 11:30a - 12:45p (Credits: 3.0); Swain East (SE) 010
Tentative schedule (click the icon for a 
 version)
| DAY | DATE | LECTURE | TOPIC | LECTURER | 
| Tues | Jan 10 | 0.1 |  OrientationWhat is Digital Biology? Topics in bioinformatics, scope of class, and resources Motivation New virus outbreak: BBC Jan. 19, 2020 New virus outbreak: BBC Jan. 20, 2020 2023: XBB.1.5 NCBI SARS-CoV-2 Precedence: The Fastest Outbreak - connection to bioinformatics work Bioinformatics Databases and Computing Resources National Center for Biotechnology Information (NCBI), a great starting point for "anything" bioinformatics Mapping biological question onto computational problems: The modeling spiral Sequence gazing: the famous TATA-box AlphaFold  | Volker Brendel | 
| Thur | Jan 12 | 1.1 |  Module 1: Bioinformatics resources and workspacesUbuntu on Windows: ... a quick way to get to a command line terminal Virtual Machines: VirtualBox Linux Basics Basic UNIX shell tutorial Command-line bootcamp The UNIX Shell The UNIX Shell: Summary of Basic Commands vi(m) editor tutorial  | Volker Brendel | 
| Tues | Jan 17 | 1.2 |   Customizing your Linux work space Getting code with git Working with NCBI data  | Volker Brendel | 
| Thur | Jan 19 | 1.3 |   Command line access to NCBI data: EDirect Entrez Direct NCBI Sequence search fields MEDLINE/PubMed search fields  | Volker Brendel | 
| Tues | Jan 24 | 1.4 | Review: Command-line bootcamp | Volker Brendel | 
| Thur | Jan 26 | 1.5 |   Review: Statistical evaluation of sequence features Using generic Linux commands to process sequence data (at least for previews or consistency checks ...) Home Work Assignment 1 posted. Due: Feb. 2, 6:00pm  | Volker Brendel | 
| Tues | Jan 31 | 2.1 |   Module 2: Pairwise Sequence AlignmentMotivation How Do We Compare Biological Sequences? from Bioinformatics Algorithms: An Active Learning Approach  | Volker Brendel | 
| Thur | Feb 2 | 2.2 |   PWSA: Definition and representation of "alignments" Home Work Assignment 1 due at 6:00pm  | Volker Brendel | 
| Tues | Feb 7 | 2.3 |   Global alignment (Needleman-Wunsch) How to calculate the number of NW alignments  | Volker Brendel | 
| Thur | Feb 9 | 2.4 |   Scoring alignments How to calculate the optimal alignment score (and find an optimal alignment) PWSA: allowing "double-gaps" PWSA: local alignment (Smith-Waterman) Home Work Assignment 2 posted. Due: Feb. 17, 6:59pm  | Volker Brendel | 
| Tues | Feb 14 | 2.5 |   Sequence analysis with scores: Concepts and statistical foundations | Volker Brendel | 
| Thur | Feb 16 | 2.6 |   Sequence analysis with scores: Practice BLAST and Substitution scoring matrices Biological Sequence Analysis I (Lecturer: Dr. Andy Baxevanis) Slides for Sequence Analysis I presentation Handout for Sequence Analysis I presentation  | Volker Brendel | 
| Tues | Feb 21 | 3.1 |  Module 3: Basic Concepts in Molecular PhylogeneticsMolecular Phylogeny: Models The powers and pitfalls of parsimony  | Volker Brendel | 
| Thur | Feb 23 | 3.2 |  Lectures on molecular phylogeny (from Bioinformatics: An Active Learning Approach)  | Volker Brendel | 
| Tues | Feb 28 | 3.3 |   Methods I: Parsimony Home Work Assignment 3 posted. Due: March 7, 11:00pm  | Volker Brendel | 
| Thur | March 2 | 3.4 |   Methods II: Distance matrix methods  | Volker Brendel | 
| Tues | March 7 | 4.1 |   Module 4: Hidden Markov ModelsHidden Markov Models: Concepts and Algorithms Hidden Markov Models (from Bioinformatics: An Active Learning Approach)  | Volker Brendel | 
| Thur | March 9 | 4.2 |  Hidden Markov Models: Applications Profile Hidden Markov Models TagDust Tagdust2 on github GeneMark.hmm prokaryotic GENSCAN; paper see here Background: GeneMark article Review: Conditional Probability  | Volker Brendel | 
| Tues | March 14 | Spring Break | ||
| Thur | March 16 | Spring Break | ||
| Tues | March 21 | 5.1 |   Module 5: Genome Assembly and AnnotationEukaryotic gene finding: GeneMark AUGUSTUS  | Volker Brendel | 
| Thur | March 23 | 5.2 |   Genome Annotation: Evaluation Sensitivity, specificity, and all that sample paper How to evaluate gene structure prediction accuracy  | Volker Brendel | 
| Tues | March 28 | 5.3 |   Genome Sequencing: Illumina sequencing nanopore sequencing Assembly basics NCBI Assembly Help  | Volker Brendel | 
| Thur | March 30 | 5.4 |   Genome assembly: How do we assemble genomes? from Bioinformatics Algorithms: An Active Learning Approach  | Volker Brendel | 
| Tues | April 4 | 6.1 |   Module 6: Genetic VariationHome Work Assignment 5 posted. Due: April 11, 7:00pm Home Work Assignment 6 posted. Due: April 14, 11:00pm  | Volker Brendel | 
| Thur | April 6 | 6.2 |   Sequence Alignment Map format (SAM) SAM flags explained Pileup format (used by samtools) Relevant code: samtools bwa  | Volker Brendel | 
| Tues | April 11 | 6.3 |   Relevant file format specifications: Variant Call Format (VCF) Relevant code: NCBI SRA Toolkit freebayes  | Volker Brendel | 
| Thur | April 13 | 6.4 |   Gene Expression Analyses from Bioinformatics Algorithms: An Active Learning Approach MIT Lecture: Gene Regulatory Networks MIT Data Science: Clustering  | Volker Brendel | 
| Tues | April 18 | 7.1 |   Module 7: Protein StructurePDB: What is a protein? PDB: How enzymes work Guide to PDB PDB Molecule of the Month NCBI Protein  | Volker Brendel | 
| Thur | April 20 | 7.2 |   peptide bond Ramachandran plot ... very nice visualization thereof (thanks to Prof. Eric Martz) Secondary Structure 2StruCompare server Jpred - secondary structure prediction SPIDER3 HMMer PFAM foldit AlphaFold  | Volker Brendel | 
| Tues | April 25 | 8.1 |   Review: Managing workflows loops in bash  | Volker Brendel | 
| Thur | April 27 | 8.2 | Review | Volker Brendel | 
| Thur | May 4 | 12:20pm | Final Project submission due | Students | 
