Enzyme Evolution John Mitchell, February 2010. Theories of Enzyme Evolution.

Slides:



Advertisements
Similar presentations
TEMPLATE DESIGN © Statistical Coupling Analysis of the Photosystem II D1 Protein Janan Zhu 1 ; Nicholas Polizzi 2 ; 1.
Advertisements

Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Shiri Freilich Janet Thornton’s group, EBI Cambridge University Relating the evolution of gene content to tissue specialization.
Pfam(Protein families )
Basics of Comparative Genomics Dr G. P. S. Raghava.
Biochemistry Enzymes.
Molecular Evolution Revised 29/12/06
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Alpha/Beta structures Barrels, sheets and horseshoes.
Enzymes. What is an enzyme? globular protein which functions as a biological catalyst, speeding up reaction rate by lowering activation energy without.
Protein Sectors: Evolutionary Units of Three-Dimensional Structure Najeeb Halabi, Olivier Rivoire, Stanislas Leibler, and Rama Ranganthan Cell 138, ,
Readings for this week Gogarten et al Horizontal gene transfer….. Francke et al. Reconstructing metabolic networks….. Sign up for meeting next week for.
Adaptive evolution of bacterial metabolic networks by horizontal gene transfer Chao Wang Dec 14, 2005.
General Microbiology (Micr300) Lecture 10 Microbial Genetics (Text Chapter: ; )
MCSG Site Visit, Argonne, January 30, 2003 Genome Analysis to Select Targets which Probe Fold and Function Space  How many protein superfamilies and families.
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.
Genetic Algorithms and Their Applications John Paxton Montana State University August 14, 2003.
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
Protein Modules An Introduction to Bioinformatics.
The Evidence for Evolution. Problem: How did the great diversity of life originate? Alternative Solutions: A. All living things were created at the same.
Evolution of Multidomain Proteins CS 374 – Lecture 10 Wissam Kazan.
Protein Classification A comparison of function inference techniques.
Chemical mechanism is dominant Nature selects the protein for divergent evolution from a pool of enzymes whose mechanism provide a partial mechanism, or.
The evolution and structural anatomy of small molecule metabolism pathways in Escherichia coli. Of Pathways and Proteins Stuart Rison and Sarah Teichmann.
Adventures in Computational Enzymology John Mitchell.
The Science of Life Biology unifies much of natural science
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Adventures in Computational Enzymology John Mitchell University of St Andrews.
Using reaction mechanism to measure enzyme similarity Noel M. O'Boyle, Gemma L. Holliday, Daniel E. Almonacid and John B.O. Mitchell Unilever Centre for.
Sequence analysis: what is a sequence? Linear arrangement of chemical subunits Contains information: 3-D arrangement determined by the sequence; 3-D defines.
Cell Biology for Clinical Pharmacy Students MD102 Module II: Cell Functions (Lecture # 8 ) Dr. Ahmed Sherif Attia
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
The Chemistry of Protein Catalysis
MACiE – a Database of Enzyme Reaction Mechanisms Janet Thornton EMBL-EBI July 2006.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
John Mitchell Bioinformatics Chemoinformatics Computational Chemistry Theoretical Chemistry.
Calculating branch lengths from distances. ABC A B C----- a b c.
NIGMS Protein Structure Initiative: Target Selection Workshop ADDA and remote homologue detection Liisa Holm Institute of Biotechnology University of Helsinki.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
ENZYMES. What are enzymes? Biological catalysts Most are proteins Some RNA Regulate metabolism Respond to changing needs of cell.
The metabolism problem: ingredients of an emerging theory Eörs Szathmáry & Chrisantha Fernando Collegium BudapestEötvös University Budapest.
Evolution of new protein topologies through multistep gene rearrangements Sergio G Peisajovich, Liat Rockah & Dan S Tawfik.
PREDICTION OF CATALYTIC RESIDUES IN PROTEINS USING MACHINE-LEARNING TECHNIQUES Natalia V. Petrova (Ph.D. Student, Georgetown University, Biochemistry Department),
Using blast to study gene evolution – an example.
Comparing and Classifying Domain Structures
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Lesson 12 – Enzymes and Reactions
Daniel E. Almonacid, Gemma L. Holliday, Gail J. Bartlett, Noel M. O’Boyle, Peter Murray-Rust, Janet M. Thornton and John B. O. Mitchell Enzyme Mechanism.
WALT To understand the structure of enzymes and the mechanisms of how they function WILF To be able to answer questions on enzyme structure and function.
Daniel E. Almonacid and Patricia C. Babbitt
Statistical Tests We propose a novel test that takes into account both the genes conserved in all three regions ( x 123 ) and in only pairs of regions.
Chapter 26 Phylogeny and the Tree of Life
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
The Chemistry of Protein Catalysis John Mitchell University of St Andrews.
Basics of Comparative Genomics
Enzymes Biological Catalysts.
Enzymes Lesson 1.
Predicting Active Site Residue Annotations in the Pfam Database
There are four levels of structure in proteins
Enzymes.
Classification: understanding the diversity and principles of
Enzymes Biological Catalysts.
Amino Acids An amino acid is any compound that contains an amino group (—NH2) and a carboxyl group (—COOH) in the same molecule.
RNA Read the lesson title aloud to students..
Unit Genomic sequencing
Basics of Comparative Genomics
Chapter 18 Classification.
Presentation transcript:

Enzyme Evolution John Mitchell, February 2010

Theories of Enzyme Evolution

Overview (1) Divergent retrograde evolution, recruiting adjacent enzymes in pathway and constrained by binding similar molecules as substrates or products.

Overview (2) Divergent patchwork evolution, recruiting enzymes catalysing similar chemical reactions, typically from other pathways, constrained by supporting similar catalytic chemistry.

Overview (3) Convergent evolution, reinventing similar chemistry in a different evolutionary family.

Retrograde Evolution (Horowitz,1945) Pathways evolve backwards: the end product of the newly evolved reaction is the substrate of the existing one.

Picture adapted from Betts & Russell, 2009

Successive reactions in the pathway would therefore be catalysed by homologous enzymes Picture adapted from Betts & Russell, 2009

Patchwork Evolution (Jensen,1976) Recruitment of enzymes for new reactions was based on similarity of reactions catalysed and possibly on substrate ambiguity. It did not necessarily require the sequential and backwardly evolving progression of steps.

Patchwork recruitment: Recruit an enzyme with a chemically similar catalytic function from a quite different pathway.

Picture from Betts & Russell, 2009

The Importance of Moonlighting A moonlighting enzyme has a second job.

The Importance of Moonlighting Patchwork recruitment is most likely to occur when the original enzyme already has some low level of activity for catalysing a different reaction.

The Importance of Moonlighting This allows the enzyme to be recruited to carry out the new function.

Main reaction Minor side reaction

A starting point for evolving a new catalytic function! Main reaction Minor side reaction

How Might Divergent Evolution Occur? At the level of the gene, the most obvious idea is via gene duplication with one copy being free to mutate away from its original function

Picture adapted from Todd, Orengo & Thornton, 1999

Original functionNew function

Picture adapted from Todd, Orengo & Thornton, 1999 Original functionNew function This way, the original function is maintained and a new one evolved.

How Might Divergent Evolution Occur? There are other possible routes to diverged functions (Orengo, Thornton, Todd & others)

Picture adapted from Todd, Orengo & Thornton, 1999

Models for Divergent Enzyme Evolution Two main models of divergent enzyme evolution discussed by Gerlt and Babbitt: Chemistry is conserved, substrate specificity changes. Substrate binding is conserved, chemistry changes. Evidence for both models in different cases, but conserved chemistry is likely to be more common.

Models for Divergent Enzyme Evolution Two main models of divergent enzyme evolution discussed by Gerlt and Babbitt: Chemistry is conserved, substrate specificity changes. If true, implies that chemical reactions are harder to evolve than is substrate binding. Fits well with patchwork recruitment model.

Enzyme Nomenclature and Classification EC Classification Class Subclass Sub-subclass Serial number

Phylogeny of Enzymes (Caetano-Annolés) Taking advantage of the genomic data now available, Caetano-Annolés and group attempted to build a phylogeny of enzymes based on the occurrence of their folds in sequenced genomes.

Phylogeny of Enzymes (Caetano-Annolés) In principle, this could “age” enzymes – the more universal the older.

Picture from Caetano-Annolés et al. (2007)

The diverse mix of “ages” within metabolic networks seems to support the patchwork model. Picture from Kim et al. (2006)

Using similar ideas of universality of superfamilies, tried to reconstruct proteome of LUCA. Interesting, but speculative?

Both Divergent & Convergent Evolution are Important Divergent evolution leads to one fold performing a plurality of functions. Convergent evolution leads to a plurality of folds performing the same function

Takes advantage of folds being both structural and evolutionary units of protein structure.

Divergence

Convergence

Those figures are based on available structures. As more become available, we will find more functions for existing folds, and more folds with existing functions. So these are underestimates! Convergent Divergent

Caveat: Our working definition of “Convergent Evolution” is dependent on the EC classification, which is not a perfect gold standard.

MACiE Mechanism, Annotation and Classification in Enzymes. The MACiE Database G.L. Holliday et al., Nucl. Acids Res., 35, D515-D520 (2007) Gemma Holliday, Daniel Almonacid, Noel O’Boyle, Janet Thornton, Peter Murray-Rust, Gail Bartlett, James Torrance, John Mitchell

Enzyme Nomenclature and Classification EC Classification Class Subclass Sub-subclass Serial number

The EC Classification  Deals with overall reaction, not mechanism  Reaction direction arbitrary  Cofactors and active site residues ignored  Doesn’t deal with structural and sequence information  However, it was never intended to do so

A New Representation of Enzyme Reactions?  Should be complementary to, but distinct from, the EC system  Should take into account:  Reaction Mechanism  Structure  Sequence  Active Site residues  Cofactors  Need a database of enzyme mechanisms

MACiE Mechanism, Annotation and Classification in Enzymes. MACiE Database

Difficulties of Hierarchical Classification Very similar mechanisms can end up in different first level classes. In the case of phosphoinositide-specific phospholipases C, this is due to a slow final hydrolysis step occurring in one of the two enzymes.

Classifying Related Enzymes: Phosphoinositide-specific Phospholipases C Eukaryotic (rat) Cell Signalling Multidomain Catalytic TIM Barrel EC Hydrolase Final hydrolysis step Prefers bisphosphate Acid-base mechanism Calcium dependent Prokaryotic (B. cereus) Virulence factor Single domain Catalytic TIM Barrel EC Lyase No/slow final hydrolysis Disfavours bisphosphate Acid-base mechanism Not calcium dependent Evolutionarily related

Similar reactions end up far apart EC EC

Difficulties of Hierarchical Classification Different mechanisms can occur with exactly the same EC number. MACiE has six beta-lactamases, all with different mechanisms but the same overall reaction.

MACiE Mechanisms are Sourced from the Literature

Coverage of MACiE Representative – based on a non-homologous dataset, and chosen to represent each available EC sub-subclass.

EC Coverage of MACiE Representative – based on a non-homologous dataset, and chosen to represent each available EC sub-subclass. Structures exist for: 6 EC EC EC EC MACiE covers: 6 EC EC EC EC

Convergent Evolution of Enzyme Function D.E. Almonacid et al., PLoS Computational Biology, accepted N.M. O’Boyle et al., J. Molec. Biol., 368, (2007)

We use a combination of bioinformatics & chemoinformatics to identify similarities between enzyme-catalysed reaction mechanisms

Similarity of Overall Reactions: Compare Bond Changes

Similarity of Mechanisms: Compare Steps

Just like sequence alignment! We can measure their similarity … Similarity of Mechanisms: Compare Steps

Carrying out an analysis of pairwise similarity of reactions in MACiE...

Find only a few similar pairs

Identify convergent evolution

Check MACiE for duplicates!

Mechanistic similarity is only weakly related to proximity in the EC classification

EC in common c c.s c.s.ss.-

Similarity of Analogous Reactions We take all possible pairs of analogous enzyme reactions from MACiE Analogous means that they carry out similar functions (EC conserved) and that the enzymes are not homologous We find 95 analogous pairs (convergent evolution).

43 out of 95 pairs that are analogous according to EC have no significant reaction or mechanistic similarity Shared EC sub-subclass and Bond Change based reaction similarity are quite different criteria.

Has implications for using EC as a gold standard of similarity in studies of evolution. Shared EC sub-subclass and Bond Change based reaction similarity are quite different criteria.

One third of analogous pairs with significantly similar overall reactions have significantly similar mechanisms.

Most often, similar overall reactions that evolve convergently will have quite different mechanisms.

For analogous pairs, we find that mechanistic similarity is less than overall similarity (almost always); these lie in the lower triangle.

Conclusions for Analogous Enzymes Conservation of EC sub-subclass does not imply quantitative reaction similarity. One third of analogous pairs with significantly similar overall reactions have significantly similar mechanisms. Mechanistic similarity is less than overall similarity (unlike homologues).

Theories of Enzyme Evolution

Overall Conclusions Both convergent and divergent evolution are important. Patchwork evolution is quite common. Retrograde evolution does sometimes occur (adjacent reactions in pathways have similar enzymes more often than by chance), but is not strongly prevalent. Consistent with the above, conservation of chemistry is more important than conservation of substrate binding.

ACKNOWLEDGEMENTS Cambridge Overseas Trust