Cbio course, spring 2005, Hebrew University Computational Methods In Molecular Biology CS-67693, Spring 2005 School of Computer Science & Engineering Hebrew.

Slides:



Advertisements
Similar presentations
Basic Molecular Biology for CS374 Scientific Method: The widely held philosophy that a theory can never be proved, only disproved, and that all attempts.
Advertisements

RNA and Protein Synthesis
Review: The flow of genetic information in the cell is DNA  RNA  protein  The sequence of codons in DNA spells out the primary structure of a polypeptide.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Basic Biology for CS262 OMKAR DESHPANDE (TA) Overview Structures of biomolecules How does DNA function? What is a gene? How are genes regulated?
1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.
1 Algorithms in Computational Biology (236522) Fall Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Wednesday 12:30-13:30 (or.
Prepared with lots of help from friends... Metsada Pasmanik-Chor, Zohar Yakhini and NUMEROUS WEB RESOURCES. BioInformatics / Computational Biology Introduction.
DNA and RNA. I. DNA Structure Double Helix In the early 1950s, American James Watson and Britain Francis Crick determined that DNA is in the shape of.
. Introduction to Algorithms in Computational Biology Lecture 1 This class has been edited from Nir Friedman’s lecture which is available at
13.3: RNA and Gene Expression
RNA Transcription.
. Algorithms in Computational Biology (236522) Spring 2002 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours Wednesday TA: Ydo Wexler,
12-3: RNA AND PROTEIN SYNTHESIS Biology 2. DNA double helix structure explains how DNA can be copied, but not how genes work GENES: sequence of DNA that.
Transcription Transcription is the synthesis of mRNA from a section of DNA. Transcription of a gene starts from a region of DNA known as the promoter.
Protein Synthesis Pages Part 3. Warm-Up: DNA DNA is a double stranded sequence of ___________ (smallest unit of DNA). 2.Short segments of.
. Algorithms in Computational Biology (236522) Spring 2002 Lecturer: Prof. Shlomo Moran TA: Ydo Wexler Lecture: Tuesday12:30-14:30, Taub 6 Tutorial: Tuesday11:30-12:30,
FROM DNA TO PROTEIN Transcription – Translation We will use:
Gene Expression and Gene Regulation. The Link between Genes and Proteins At the beginning of the 20 th century, Garrod proposed: – Genetic disorders such.
Chapter 13: RNA and Protein Synthesis
Chapter 13.1 and 13.2 RNA, Ribosomes, and Protein Synthesis
FROM DNA TO PROTEIN Transcription – Translation. I. Overview Although DNA and the genes on it are responsible for inheritance, the day to day operations.
1 TRANSCRIPTION AND TRANSLATION. 2 Central Dogma of Gene Expression.
From Gene to Protein A.P. Biology. Regulatory sites Promoter (RNA polymerase binding site) Start transcription DNA strand Stop transcription Typical Gene.
From Gene To Protein Chapter 17. From Gene to Protein The “Central Dogma of Molecular Biology” is DNA  RNA  protein Meaning that our DNA codes our RNA.
Lecture #3 Transcription Unit 4: Molecular Genetics.
Protein Synthesis Transcription. DNA vs. RNA Single stranded Ribose sugar Uracil Anywhere Double stranded Deoxyribose sugar Thymine Nucleus.
12.3 DNA, RNA, and Protein Objective: 6(C) Explain the purpose and process of transcription and translation using models of DNA and RNA.
Chapter 9 From DNA to Protein.
What is central dogma? From DNA to Protein
Transcription & TranslationNovember , 2012 W ARM U P … What are the differences between DNA & RNA?
Transcription and Translation How genes are expressed (a.k.a. How proteins are made) Biology.
Core Transcription and Translation
CHAPTER 13 RNA and Protein Synthesis. Differences between DNA and RNA  Sugar = Deoxyribose  Double stranded  Bases  Cytosine  Guanine  Adenine 
Protein Synthesis Transcription. DNA vs. RNA Single stranded Ribose sugar Uracil Anywhere Double stranded Deoxyribose sugar Thymine Nucleus.
Cell Signaling to Transcription and Translation
RNA Makin’ Proteins DNAMutations Show off those Genes!
Protein Synthesis. DNA mRNA DNA Cannot the nucleus Sends to the cytoplasm via Its base sequence (called a codon) determines the amino acid in proteins.
Introduction to Molecular Biology and Genomics BMI/CS 776 Mark Craven January 2002.
The Central Dogma of Molecular Biology DNA  RNA  Protein  Trait.
Microbial Genetics Structure and Function of Genetic Material The Regulation of Bacterial Gene Expression Mutation: Change in Genetic Material Genetic.
Higher Human Biology Unit 1 Human Cells KEY AREA 3: Gene Expression.
Gene Expression DNA, RNA, and Protein Synthesis. Gene Expression Genes contain messages that determine traits. The process of expressing those genes includes.
8.3 DNA Replication KEY CONCEPT DNA replication copies the genetic information of a cell.
1 Genes and Proteins The genetic information contained in the nucleotide sequence of DNA specifies a particular type of protein Enzymes = proteins that.
8.2 KEY CONCEPT DNA structure is the same in all organisms.
DNA to RNA to Protein. RNA Made up of 1. Phosphate 2. Ribose (a sugar) 3. Four bases RNA bases are: Adenine Guanine Cytosine Uracil (instead of thymine)
Section 20.2 Gene Expression
FROM DNA TO PROTEIN Transcription – Translation
Chapter 13.1: RNA Essential Questions
Review What monomers make up protein polymers?
Gene Expression: From Gene to Protein
Transcription and Translation
Protein Synthesis in Detail
Transcription and Translation
Gene Expression: From Gene to Protein
UNIT 5 Protein Synthesis.
What is RNA? Do Now: What is RNA made of?
How Proteins are Made Biology I: Chapter 10.
Central Dogma Central Dogma categorized by: DNA Replication Transcription Translation From that, we find the flow of.
Gene Expression: From Gene to Protein
12-3 RNA and Protein Synthesis
4/6 Objective: Explain the steps and key players in transcription.
From DNA to Protein Genotype to Phenotype.
An Overview of Gene Expression
Genes and Protein Synthesis Review
4/2 Objective: Explain the steps and key players in transcription.
12-3 RNA & Protein Synthesis
Segment 5 Molecular Biology Part 1b
Presentation transcript:

cbio course, spring 2005, Hebrew University Computational Methods In Molecular Biology CS-67693, Spring 2005 School of Computer Science & Engineering Hebrew University, Jerusalem

cbio course, spring 2005, Hebrew University Class 1: Introduction

cbio course, spring 2005, Hebrew University Introduction u What is Comp. Bio.? Why is it great? u What are the aims and basic concepts of this course u High level biological review: give basic bio background and motivation for tasks handled in the course u Administration…

cbio course, spring 2005, Hebrew University The Cell

cbio course, spring 2005, Hebrew University Example: Tissues in Stomach

cbio course, spring 2005, Hebrew University DNA Components Four nucleotide types: u Adenine u Guanine u Cytosine u Thymine Hydrogen bonds: u A-T u C-G

cbio course, spring 2005, Hebrew University The Double Helix Source: Alberts et al

cbio course, spring 2005, Hebrew University DNA Organization Source: Alberts et al

cbio course, spring 2005, Hebrew University Genome Sizes u E.Coli (bacteria)4.6 x 10 6 bases u Yeast (simple fungi) 15 x 10 6 bases u Smallest human chromosome 50 x 10 6 bases u Entire human genome 3 x 10 9 bases

cbio course, spring 2005, Hebrew University Related Computational Tasks u Need a way to reconstruct DNA sequence from fragments – major contribution of comp. bio. ! u Related: sequence comparison, sequence alignment

cbio course, spring 2005, Hebrew University DNA Duplication Source: Mathews & van Holde

cbio course, spring 2005, Hebrew University Genes The DNA strings include: u Coding regions (“genes”)  E. coli has ~4,000 genes  Yeast has ~6,000 genes  C. Elegans has ~13,000 genes  Humans have ~32,000 genes u Control regions  These typically are adjacent to the genes  They determine when a gene should be expressed u “Junk” DNA (unknown function)

cbio course, spring 2005, Hebrew University The Tree of Life Source: Alberts et al

cbio course, spring 2005, Hebrew University Evolution u Related organisms have similar DNA  Similarity in sequences of proteins  Similarity in organization of genes along the chromosomes u Evolution plays a major role in biology  Many mechanisms are shared across a wide range of organisms (e.g. orthologes)  During the course of evolution existing components are adapted for new functions (e.g paraloges)

cbio course, spring 2005, Hebrew University Evolution Evolution of new organisms is driven by u Diversity  Different individuals carry different variants of the same basic blue print u Mutations  The DNA sequence can be changed due to single base changes, deletion/insertion of DNA segments, etc. u Selection bias

cbio course, spring 2005, Hebrew University Related Computational Tasks u Phylogeny – not just theory!:  Rebuild the tree of life…  Infer relations between genes/pathways etc. across species  Learn models for changes and development  Major benefit: exploit the information we do have/observe to infer about the systems on which we have very little knowledge and observations….

cbio course, spring 2005, Hebrew University How Do Genes Code for Proteins? Transcription RNA Translation Protein DNA

cbio course, spring 2005, Hebrew University Transcription u Coding sequences can be transcribed to RNA u RNA nucleotides:  Similar to DNA, slightly different backbone  Uracil (U) instead of Thymine (T) Source: Mathews & van Holde

cbio course, spring 2005, Hebrew University RNA Editing

cbio course, spring 2005, Hebrew University Translation

cbio course, spring 2005, Hebrew University Translation u The ribosome attaches to the mRNA at a translation initiation site u Then ribosome moves along the mRNA sequence and in the process constructs a poly-peptide u When the ribosome encounters a stop signal, it releases the mRNA. The construct poly-peptide is released, and folds into a protein. u Translation is mediated by the ribosome u Ribosome is a complex of protein & rRNA molecules

cbio course, spring 2005, Hebrew University Translation Source: Alberts et al

cbio course, spring 2005, Hebrew University Translation Source: Alberts et al

cbio course, spring 2005, Hebrew University Translation Source: Alberts et al

cbio course, spring 2005, Hebrew University Translation Source: Alberts et al

cbio course, spring 2005, Hebrew University Translation Source: Alberts et al

cbio course, spring 2005, Hebrew University Genetic Code

cbio course, spring 2005, Hebrew University Transcription RNA Translation Protein DNA The Central Dogma Genes Experiments

cbio course, spring 2005, Hebrew University TF TFs Basal Promoter mRNA Gene 5’5’ 3’3’ Transcription start site 3’3’ 5’5’ RNA polymerase II 5’5’ Eukaryotic Transcription Regulation “Classical Model” u Composition of promoter region determines rate of transcription initiation u Combinations of TFs control the transcription of gene sets under specific conditions Genes TF

cbio course, spring 2005, Hebrew University From Data to Model >YKL112W Chr 11 ATGGACAAATTAGTCGTGAATTATTATGAATACAAGCACCCTAT AATTAATAAAGACCTGGCCATTGGAGCCCATGGAGGCAAAAA ATTTCCCACCTTGGGTGCTTGGTATGATGTAATTAATGAGTAC GAATTTCAGACGCGTTGCCCTATTATTTTAAAGAATTCGCATA GGAACAAACATTTTACATTTGCCTGTCATTTGAAAAACTGTCCA TTTAAAGTCTTGCTAAGCTATGCTGGCAATGCTGCATCCTCAG AAACCTCATCTCCTTCTGCAAATAATAATACCAACCCTCCGGG TACTCCTGATCATATTCATCATCATAGCAACAACATGAACAACG AGGACAATGATAATAACAATGGCAGTAATAATAAGGTTAGCAA TGACAGTAAACTTGACTTCGTTACTGATGATCTTGAATACCATC TGGCGAACACTCATCCGGACGACACCAATGACAAAGTGGAGT CGAGAAGCAATGAGGTGAATGGGAACAATGACGATGATGCTG ATGCCAACAACATTTTTAAACAGCAAGGTGTTACTATCAAGAA CGACACTGAAGATGATTCGATAAATAAGGCCTCTAT

cbio course, spring 2005, Hebrew University Many Related Computational Tasks… u Information is in the code book →:  How alternative splicing is determined and where?  Build models for regulation of genes at different levels of complexity  Relate genotype and phenotype: What are the expression patterns of some disease? How do they relate to sequence? What model can explain the observations? Can we predict phenomenon based on our models?

cbio course, spring 2005, Hebrew University Who came first? u Chicken or egg?  Egg u DNA or Protein?  RNA… u Thomas Cech & Sidney Altman ( 80’s !):  RNA as an “independent” molecule  Probably more close to the ancient “source”

cbio course, spring 2005, Hebrew University RNA roles u Messenger RNA (mRNA)  Encodes protein sequences u Transfer RNA (tRNA)  Adaptor between mRNA molecules and amino- acids (protein building blocks) u Ribosomal RNA (rRNA)  Part of the ribosome, a machine for translating mRNA to proteins u...

cbio course, spring 2005, Hebrew University Transfer RNA Anticodon: u matches a codon (triplet of mRNA nucleotides) Attachment site: u matches a specific amino-acid

cbio course, spring 2005, Hebrew University Related Computational Tasks u RNA secondary structure prediction:  based on CFG and CM u RNA coding area prediction u …

cbio course, spring 2005, Hebrew University RNA Editing Source: Mathews & van Holde

cbio course, spring 2005, Hebrew University Translation

cbio course, spring 2005, Hebrew University How do Proteins Perform their Rules? u Protein interact in various ways u Change conformations, conformations → function u Major Issues:  Their “active”/functional areas which interact  Their 3D structure

cbio course, spring 2005, Hebrew University Protein Structure u Proteins are poly- peptides of amino-acids u This structure is (mostly) determined by the sequence of amino-acids that make up the protein

cbio course, spring 2005, Hebrew University Protein Structure

cbio course, spring 2005, Hebrew University Related Computational Tasks u Protein 2D, 3D structure prediction u Identify sequence motifs/domains in proteins  Sequence similarity vs. functional similarity

cbio course, spring 2005, Hebrew University Course Goals u Review current tasks posed by modern molecular biology u Review and experiment with some of the tools/solutions currently found (e.g. BLAST, clustalw) u Gain some tools to handle such problems:  Dynamic programming  Probabilistic graphical models:  MM,HMM,CM,Trees  Representation, what principles justify them, Learning, Inference  Statistic tools: how to measure our confidence in our results?

cbio course, spring 2005, Hebrew University Course Goals u Computational tools in molecular biology: u We will cover computational tasks that are posed by modern molecular biology u We will discuss the biological motivation and setup for these tasks u We will understand the the kinds of solutions exist and what principles justify them

cbio course, spring 2005, Hebrew University Course’s Main Point

cbio course, spring 2005, Hebrew University Course’s Main Point Learn to do: Define the problem → Find comp. solution Four Aspects: Biological  What is the task? Algorithmic  How to perform the task at hand efficiently? Learning  How to adapt parameters of the task form examples Statistics  How to differentiate true phenomena from artifacts

cbio course, spring 2005, Hebrew University Example: Sequence Comparison Biological  Evolution preserves sequences, thus similar genes might have similar function Algorithmic  Consider all ways to “align” one sequence against another Learning  How do we define “similar” sequences? Use examples to define similarity Statistics  When we compare to ~10 6 sequences, what is a random match and what is true one

cbio course, spring 2005, Hebrew University Topics I Dealing with DNA/Protein sequences: u Genome projects and how sequences are found u Finding similar sequences u Models of sequences: Hidden Markov Models u Transcription regulation u Protein Families u Gene finding

cbio course, spring 2005, Hebrew University Topics II Gene Expression: u Genome-wide expression patterns u Data organization: clustering u Reconstructing transcription regulation u Recognizing and classifying cancers

cbio course, spring 2005, Hebrew University Topics III Models of genetic change: u Long term: evolutionary changes among species u Reconstructing evolutionary trees from current day sequences u Short term: genetic variations in a population u Finding genes by linkage and association

cbio course, spring 2005, Hebrew University Topics IV Protein World: u How proteins fold - secondary & tertiary structure u How to predict protein folds from sequences data alone u How to analyze proteins changes from raw experimental measurements (MassSpec) u 2D gels

cbio course, spring 2005, Hebrew University Class Structure u 2 weekly meeting  Mondays (Levin 8), Wednesdays (Kaplan) Grade: u Homework assignments: ~50% of the final grade. There will be up to seven homework assignments. These assignments will include theoretical problems, using bioinformatics tools and programming. u Final home assignment: ~20% of the final grade. u Final test: ~30% of the grade. u Class participation: A 5% bonus grade for students who actively participate in discussions during classes u Possible: oral presentation of any exercise to define grade!

cbio course, spring 2005, Hebrew University Exercises & Handouts u Check regularly