"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.

Slides:



Advertisements
Similar presentations
Motivation “Nothing in biology makes sense except in the light of evolution” Christian Theodosius Dobzhansky.
Advertisements

Nothing in (computational) biology makes sense except in the light of evolution after Theodosius Dobzhansky (1970)
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
Protein Structure and Physics. What I will talk about today… -Outline protein synthesis and explain the basic steps involved. -Go over the Chemistry of.
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
No similarity vs no homology If two (complex) sequences show significant similarity in their primary sequence, they have shared ancestry, and probably.
1. Fossils 2. Geographic Distribution 3. Homologies.
Evidences of Evolution
Molecular Evolution Revised 29/12/06
Protein Tertiary Structure Prediction
MCB Class 2. TA: Amanda Dick Office: BioPhysics 402B.
MCB Class 1. Protein structure: Angles in the protein backbone.
Lecture 1 BNFO 135 Usman Roshan. Course overview Perl progamming language (and some Unix basics) –Unix basics –Intro Perl exercises –Programs for comparing.
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.
Identifying functional residues of proteins from sequence info Using MSA (multiple sequence alignment) - search for remote homologs using HMMs or profiles.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.
Sequence similarity.
Protein structure Friday, 10 February 2006 Introduction to Bioinformatics Brigham Young University DA McClellan
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.
Protein structure:. Angles in the protein backbone.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Sequence Analysis - Overview Raja Mazumder Senior Protein Scientist, PIR Assistant Professor, Department of Biochemistry and Molecular Biology.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein Tertiary Structure Prediction
An Introduction to Bioinformatics
Proteins  Organic compounds made of C, O, H, N and S  Building blocks – 20 different amino acids  Peptide and Polypeptides: Dehydration (Condensation)/
Insulin: Weight = 5733, 51 amino acids Glutamine Synthetase: Weight = 600,000, 468 amino acids.
CISC667, S07, Lec5, Liao CISC 667 Intro to Bioinformatics (Spring 2007) Pairwise sequence alignment Needleman-Wunsch (global alignment)
Fast Search Protein Structure Prediction Algorithm for Almost Perfect Matches1 By Jayakumar Rudhrasenan S Primary Supervisor: Prof. Heiko Schroder.
Bioinformatics 2 -- Lecture 8 More TOPS diagrams Comparative modeling tutorial and strategies.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Simplify the display  Show only alpha carbons  Turn off show backbone oxygen  Colour secondary structure  Turn 3 D display on.
Sequence Analysis CSC 487/687 Introduction to computing for Bioinformatics.
Question 1: Name, PDB codes Eric Martz (no research lab) Macromolecular visualization I chose*: 3onz HUMAN TETRAMERIC HEMOGLOBIN:
What is this and what does it show?. Objectives Know what a clade is and the evidence that makes an organism part of a clade. Interpret cladograms. Outcomes.
Molecular visualization
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein Evolution: Introduction to Protein Structure and Function protEvolEllsEmblSept2009 Please open the.
Evidence of Evolution "Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.
Evidence of Evolution Nothing in biology makes sense except in the light of evolution. – Theodosius Dobzhansky.
DNA makes RNA  Transcription RNA makes Proteins  Translation Information flows from genes  proteins – But not the other way! (usually)
Protein Tertiary Structure Prediction Structural Bioinformatics.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Gene Expression DNA, RNA, and Protein Synthesis. Gene Expression Genes contain messages that determine traits. The process of expressing those genes includes.
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
7.3 Translation.
Simplify the display Show only alpha carbons
Crystal structure of the chemotaxis receptor methyltransferase CheR suggests a conserved structural motif for binding S-adenosylmethionine  Snezana Djordjevic,
MCB Class 1.
D.5: Phylogeny and Systematics
MCB Class 1.
Volume 112, Issue 7, Pages (April 2017)
MCB Class 1.
Volume 16, Issue 11, Pages (November 2008)
Volume 6, Issue 11, Pages (November 1998)
The 1.9 Å Structure of α-N-Acetylgalactosaminidase
Evidence of evolution Ch.12 and Ch.13.
Introduction to bioinformatics Lecture 5 Pair-wise sequence alignment
Enzymes Learning Objectives.
Presentation transcript:

"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky

Homology bird wing bat wing human arm by Bob Friedman

homology vs analogy Homology (shared ancestry) versus Analogy (convergent evolution) A priori sequences could be similar due to convergent evolution bird wing butterfly wing bat wingfly wing

What does Bioinformatics have to do with Molecular Evolution? Problem: Application of first principles does not (yet) work: Most scientists believe in the principle of reductionism (plus new laws and relations emerging on each level), e.g.: DNA sequence -> transcription -> translation -> protein folding -> protein function (catalytic and other properties) -> properties of the organism(s) -> ecology At several steps along the way from DNA to function our understanding of the chemical and physical processes involved is incomplete and computational simulations are so time consuming that prediction of protein function based on only a single DNA sequence is at present impossible (at least for a protein of reasonable size).

Related Proteins Present day proteins evolved through substitution and selection from ancestral proteins. Related proteins have similar sequence AND similar structure AND similar function. In the above mantra "similar function" can refer to: identical function, similar function, e.g.: identical reactions catalyzed in different organisms; or same catalytic mechanism but different substrate (malic and lactic acid dehydrogenases); similar subunits and domains that are brought together through a (hypothetical) process called domain shuffling, e.g. nucleotide binding domains in hexokinse, myosin, HSP70, and ATPsynthases.

homology Two sequences are homologous, if there existed an ancestral molecule in the past that is ancestral to both of the sequences Homology is a "yes" or "no" character (don't know is also possible). Either sequences (or characters share ancestry or they don't (like pregnancy). Molecular biologist often use homology as synonymous with similarity of percent identity. One often reads: sequence A and B are 70% homologous. To an evolutionary biologist this sounds as wrong as 70% pregnant.

Sequence Similarity vs Homology The following is based on observation and not on an a priori truth: If two (complex) sequences show significant similarity in their primary sequence, they have shared ancestry, and probably similar function. (although some proteins acquired radically new functional assignments, lysozyme -> lactalbumin).

The Size of Protein Sequence Space (back of the envelope calculation) For comparison the universe contains only about protons and has an age of about 5*10 17 seconds or 5*10 29 picoseconds. If every proton in the universe were a super computer that explored one possible protein sequence per picosecond, we only would have explored 5* sequences, i.e. a negligible fraction of the possible sequences with length 600 (one in about ). Consider a protein of 600 amino acids. Assume that for every position there could be any of the twenty possible amino acid. Then the total number of possibilities is 20 choices for the first position times 20 for the second position times 20 to the third.... = 20 to the 600 = 4* different proteins possible with lengths of 600 amino acids.

no similarity vs no homology If two (complex) sequences show significant similarity in their primary sequence, they have shared ancestry, and probably similar function. THE REVERSE IS NOT TRUE: PROTEINS WITH THE SAME OR SIMILAR FUNCTION DO NOT ALWAYS SHOW SIGNIFICANT SEQUENCE SIMILARITY for one of two reasons: a) they evolved independently (e.g. different types of nucleotide binding sites); or b) they underwent so many substitution events that there is no readily detectable similarity remaining. Corollary: PROTEINS WITH SHARED ANCESTRY DO NOT ALWAYS SHOW SIGNIFICANT SIMILARITY.

Protein structure:

Simplify the display  Show only alpha carbons  Turn off show backbone oxygen  Color secondary structure  Turn 3 D display on

1HEW  Backbone as ribbon  Sidechains only in binding pocket  Substrate in yellow  Non-polar residues of binding pocket in turquoise

1HEW  Backbone as alpha carbons  Sidechains only Tyr62  Substrate with VDW surface and CPK colors

1bmf

 Colored by chain  (discuss similarity to helicase and T3SS)

1bmf  Open pdb – check chain id  Select Chain E (by clicking in column designation in Control Panel)  Safe selected residues in current layer as betaTP  Repeat for betaDP, betaE, alpha TP, alphaDP, and alpha E.

The save menu

1bmf  Open the three beta SU  Color in in secondary structure (shift makes it act on the 3 layers simultaneously)

1bmf 3 beta subunits  Use Magic fit and interative magic fit to align the three subunits (use betDB as reference layer)  tab allows you to move through the three layers For an animated GIF see

Layer Info Window  Checkmarks in the vis and mov columns provide a fast way to change settings for the different layers

betaTB and betaE with RMS coloring compared to betaDP  Magic fit -> fit molecules -> RMS coloring  RED: Long wavelength = long distance between structures  BLUE: Short wavelength = short distance between structures  If you need to switch the reference layer, you can do so in the SwissModel menu

The 3 point alignment tool  If you want to compare the structure of very dissimilar proteins that use a similar substrate, sometimes it helps to align the substrates.  This can be done through the 3 point alignment tool.

Demos  Catalytic cycle of beta subunits  Three point alignment of beta and alpha subunits