| |
Abstract/Syllabus:
|
Church, George, HST.508 Genomics and Computational Biology, Fall 2002. (Massachusetts Institute of Technology: MIT OpenCourseWare), http://ocw.mit.edu (Accessed 09 Jul, 2010). License: Creative Commons BY-NC-SA
Next-generation DNA sequencing technology from University of California, Berkeley. (Image courtesy of the U.S. Department of Energy Genomes to Life Program.)
Course Highlights
Audio of each lecture session, in addition to downloadable problem sets and lecture notes, are available for this course.
Course Description
This course will assess the relationships among sequence, structure, and function in complex biological networks as well as progress in realistic modeling of quantitative, comprehensive, functional genomics analyses. Exercises will include algorithmic, statistical, database, and simulation approaches and practical applications to medicine, biotechnology, drug discovery, and genetic engineering. Future opportunities and current limitations will be critically addressed. In addition to the regular lecture sessions, supplementary sections are scheduled to address issues related to Perl, Mathematica and biology.
Special Features
Technical Requirements
Microsoft® Excel software is recommended for viewing the .xls files found on this course site. Free Microsoft® Excel viewer software can also be used to view the .xls files.
Mathematica® software is required to run the .nb files found on this course site.
Media player software, such as Quicktime® Player, RealOne™ Player, or Windows Media® Player, is required to run the .mp3 files found on this course site.
RealOne™ is a trademark or a registered trademark of RealNetworks, Inc.
Microsoft® is a registered trademark or trademark of Microsoft Corporation in the U.S. and/or other countries.
Mathematica® is a registered trademark of Wolfram Research, Inc.
QuickTime® is a trademark of Apple Computer, Inc., registered in the U.S. and other countries.
Windows Media® is a registered trademark or trademark of Microsoft Corporation in the U.S. and/or other countries.
Syllabus
Introductory courses in biology, computer science, and statistics. If you have any doubt about whether you have the equivalent experience, you should attend the appropriate sections which will focus on catching up with extra sections supplementing catch-up topics in greatest demand.
In addition to the lectures there will be section discussion meetings at the days and times above. Students will participate in at least one of those sections and form problem-solving and project teams consisting of at least one biology expert and one math/computer/engineering expert (or two to four people knowledgeable in both disciplines). Separate section on the basics of programming and molecular biology will be available in the first few weeks to even out the expected wide variation in backgrounds. Grades will be based on six problem sets, one final project, and participation in discussion sections covering the problems and about one scientific article per week. Your time commitment will be about 4 hours for classes and 6 to 12 additional hours per week, depending on background.
Calendar
|
|
|
|
LEC # |
|
|
|
TOPICS |
|
|
|
|
|
|
|
|
1 |
|
|
|
Intro 1: Course overview and introduction to the computational side of computational biology. Why use Perl & Mathematica? Write and run simple scripts. We will also assign sections addressing Biology, computing, and advanced topics. Questionaires due. |
|
|
|
|
|
|
|
|
2 |
|
|
|
Intro 2: Biological Side of Computational Biology; Central Dogma; Comparative Genomics; Models & Real World Applications.
Note: Please take your initial observations about Problem Set#1 to you first section meetings (i.e. check that you actually have access to Perl & Mathematica). |
|
|
|
|
|
|
|
|
3 |
|
|
|
DNA 1: Genome Sequencing, Polymorphisms, Populations, Statistics, Pharmacogenomics; Databases.
Note: Problem Set #1 is due at the start of class. (Answers will be posted 48 hrs later.) |
|
|
|
|
|
|
|
|
4 |
|
|
|
DNA 2: Dynamic Programming, Blast, Multi-alignment, HiddenMarkovModels. |
|
|
|
|
|
|
|
|
5 |
|
|
|
RNA 1: Microarrays, Library Sequencing and Quantitation Concepts.
Note: Problem Set #2 is due. |
|
|
|
|
|
|
|
|
6 |
|
|
|
RNA 2: Clustering by Gene or Condition and Other Regulon Data Sources Nucleic Acid Motifs; The Nature of Biological "proofs." |
|
|
|
|
|
|
|
|
7 |
|
|
|
Proteins 1: 3D Structural Genomics, Homology, Catalytic and Regulatory Dynamics, Function & Drug Design. |
|
|
|
|
|
|
|
|
8 |
|
|
|
Proteins 2: Mass Spectrometry, Post-synthetic Modifications, Quantitation of Proteins, Metabolites, & Interactions. |
|
|
|
|
|
|
|
|
9 |
|
|
|
Networks 1: Systems Biology, Metabolic Kinetic & Flux Balance Optimization Methods.
Note: Problem Set #4 is due. (#5 will be available but not due until lecture 14.) |
|
|
|
|
|
|
|
|
10 |
|
|
|
Networks 2: Molecular Computing, Self-assembly, Genetic Algorithms, Neural Networks. |
|
|
|
|
|
|
|
|
11 |
|
|
|
Networks 3: The Future of Computational Biology: Cellular, Developmental, Social, Ecological & Commercial Models. |
|
|
|
|
|
|
|
|
12 |
|
|
|
Project Presentations; All written project reports and overhead slides (for presentations) due. |
|
|
|
|
|
|
|
|
13 |
|
|
|
Project Presentations. |
|
|
|
|
|
|
|
|
14 |
|
|
|
Project Presentations; Problem Set #5 due. |
|
|
|
|
|
|
|
|
Further Reading:
|
Readings
|
|
|
|
|
|
LEC # |
|
|
|
TOPICS |
|
|
|
READINGS |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
|
|
|
Intro 1: Course overview and introduction to the computational side of computational biology. Why use Perl & Mathematica? Write and run simple scripts. We will also assign sections addressing Biology, computing, and advanced topics. Questionaires due. |
|
|
|
Mount Chapter 1. Gibas & Jambeck (G&J) Chapters 1, 2, 12 (Those running Unix may find G&J Chaps. 3-5 useful). Students familiar with the above topics should look ahead to next week's readings. |
|
|
|
|
|
|
|
|
|
|
|
|
2 |
|
|
|
Intro 2: Biological Side of Computational Biology; Central Dogma; Comparative Genomics; Models & Real World Applications
Note: Please take your initial observations about Problem Set#1 to you first section meetings (i.e. check that you actually have access to Perl & Mathematica). |
|
|
|
For non-biologists: Primer on molecular biology and Molecular Biology for Computer Scientists. |
|
|
|
|
|
|
|
|
|
|
|
|
3 |
|
|
|
DNA 1: Genome Sequencing, Polymorphisms, Populations, Statistics, Pharmacogenomics; Databases.
Note: Problem Set #1 is due at the start of class. (Answers will be posted 48 hrs later.) |
|
|
|
Mount Chap. 2 & 7. G&J Chap. 6, 11, pp. 294-303.
Advanced: Pritchard, JK. Are Rare Variants Responsible for Susceptibility to Complex Diseases? Am. J. Hum Gen. 69:124. |
|
|
|
|
|
|
|
|
|
|
|
|
4 |
|
|
|
DNA 2: Dynamic Programming, Blast, Multi-alignment, HiddenMarkovModels. |
|
|
|
Mount Chap. 3. Durbin Chap. 3-5, G&J Chap. 7, 8 (pp. 191-9). Smith T. F., Waterman M. S. "Identification of Common Molecular Subsequences. J Mol Biol 1981 147:195-7. |
|
|
|
|
|
|
|
|
|
|
|
|
5 |
|
|
|
RNA 1: Microarrays, Library Sequencing and Quantitation Concepts.
Note: Problem Set #2 is due. |
|
|
|
Lockhart and Winzeler. Genomics, Gene Expression and DNA Arrays. Mount Chap. 10. G&J pp. 311-317. Nature 2000 405:827-36. |
|
|
|
|
|
|
|
|
|
|
|
|
6 |
|
|
|
RNA 2: Clustering by Gene or Condition and Other Regulon Data Sources Nucleic Acid Motifs; The Nature of Biological "proofs". |
|
|
|
Tavazoie et al. Systematic Determination of Genetic Network Architecture. G&J 1999, pp. 205-214. Nature Genetics 22:281-5. |
|
|
|
|
|
|
|
|
|
|
|
|
7 |
|
|
|
Proteins 1: 3D Structural Genomics, Homology, Catalytic and Regulatory Dynamics, Function & Drug Design.
Note: Problem Set #3 is due. |
|
|
|
G&J Chap. 9 pp. 215-29 and Chap. 10. Advanced: Thompson J., et al. "Analysis of Mutations at Residues A2451 and G2447 of 23S rRNA in the Peptidyltransferase Active Site of the 50S Ribosomal Subunit." PNAS 98, 16 (Jul 31 2001): 9002-9007. |
|
|
|
|
|
|
|
|
|
|
|
|
8 |
|
|
|
Proteins 2: Mass Spectrometry, Post-synthetic Modifications, Quantitation of Proteins, Metabolites, & Interactions. |
|
|
|
Ideker, et al. Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network. Chap. 11 pp. 321-328. G&J. Science 292:929 (2001). |
|
|
|
|
|
|
|
|
|
|
|
|
9 |
|
|
|
Networks 1: Systems Biology, Metabolic Kinetic & Flux Balance Optimization Methods.
Note: Problem Set #4 is due. (#5 will be available but not due until after lecture 13.) |
|
|
|
Edwards and Palsson. Metabolic Flux Balance Analysis and the in Silico Analysis of Escherichia Coli K-12 Gene Deletions. BMC Bioinformatics 1, 1 (2000): 1.
Jamshidi, N., et al. Dynamic Simulation of the Human Red Blood Cell Metabolic Network. Bioinformatics 2001 Mar;17(3):286-287. |
|
|
|
|
|
|
|
|
|
|
|
|
10 |
|
|
|
Networks 2: Molecular Computing, Self-assembly, Genetic Algorithms, Neural Networks. |
|
|
|
Hopfield, J. J. Odor Space and Olfactory Processing: Collective Algorithms and Neural Implementation. PNAS 96, 22 (1999): 12506-12511. |
|
|
|
|
|
|
|
|
|
|
|
|
11 |
|
|
|
Networks 3: The Future of Computational Biology: Cellular, Developmental, Social, Ecological & Commercial Models. |
|
|
|
Bagowski, C. P., and J. E. Ferrell. Bistability in the JNK Cascade. Curr Biol 2001 Aug 7;11(15):1176-1182. |
|
|
|
|
|
|
|
|
|
|
|
|
12 |
|
|
|
Project Presentations; All written project reports and overhead slides (for presentations) due. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
|
|
|
Project Presentations. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
|
|
|
Project Presentations; Problem Set #5 due. |
|
|
|
|
|
|
|
|
|
|
|
Recommended Readings
Please note: These "recommended texts" etc. are not required. They are rough order of decreasing relevance. They do not perfectly cover key topics of interest including quantitative functional-genomics, linkage/population genomics, and biosystem modeling. Beginning after the first problem set we will assign readings from primary bioinformatics articles. These will focus on those subjects and help teach the important skills of reading cutting-edge research, but will cover only part of the lecture material.
Tisdall, James. Beginning Perl for Bioinformatics. O'Reilly & Assoc. 2001.
Mount, David. Bioinformatics: Sequence and Genome Analysis. CSHL 2001.
Hunter, L. (ed.) Molecular Biology for Computer Scientists. Recommended for all non-biologists.
Brown, T. A. Genomes. A good biology reference.
Durbin, Richard, S. Eddy, A. Krogh, and G. Mitchison. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. This book covers basic and advanced models for sequence analysis.
Weiss, Neil A. Introductory Statistics. 5th ed. Addison-Wesley, 1999.
Mathematica 4.2.
Gibas, Cynthia, and Per Jambeck. Developing Bioinformatics Computer Skills. O'Reilly, 2001.
Walsh, Linda. The Perl CD Bookshelf. O'Reilly. This CD contains 6 "books" that might supplement numerous free resources on the web: General Perl documents & download.
|
|
|
Rating:
0 user(s) have rated this courseware
Views:
16807
|
|
|
|
|