Protein Disordered Regions and the Evolution of Eukaryotes Allan Wu Phar 201 Phil Bourne.

Slides:



Advertisements
Similar presentations
EVIDENCE OF EVOLUTION.
Advertisements

1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
Protein Targetting Prokaryotes vs. Eukaryotes Mutations
Heredity Review. Asexual reproduction Hydra (related to jelly-fish Single celled eukaryotes like this protist.
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Riboswitches Sharon Epstein 30/03/2006 Frontiers in Metabolome sciences Feinberg Graduate School.
Essentials of Glycobiology Lecture 43 June 10, 2004 Ajit Varki
Measuring the degree of similarity: PAM and blosum Matrix
Current Topics of Genomics and Epigenomics. Outline  Motivation for analysis of higher order chromatin structure  Methods for studying long range chromatin.
Introduction to yeast genetics Michelle Attner July 24, 2012.
Extracting Essential Features of Biological Networks Natalie Arkus, Michael P. Brenner School of Engineering and Applied Sciences Harvard University.
Systems Biology Biological Sequence Analysis
Sequence similarity.
CDC42 a low molecular weight GTP-binding protein originally designated G(p) and also called G25K Chromosomal location: 1p36.1 The CDC42 protein binds to.
The Evolution of Selfing in Arabidopsis thaliana Tang et al., 2007.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Alignment Statistics and Substitution Matrices BMI/CS 576 Colin Dewey Fall 2010.
Protein Bioinformatics Course
Synthetic biology: New engineering rules for emerging discipline Andrianantoandro E; Basu S; Karig D K; Weiss R. Molecular Systems Biology 2006.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Evolution and Scoring Rules Example Score = 5 x (# matches) + (-4) x (# mismatches) + + (-7) x (total length of all gaps) Example Score = 5 x (# matches)
Friday 17 rd December 2004Stuart Young Capstone Project Presentation Predicting Deleterious Mutations Young SP, Radivojac P, Mooney SD.
Mechanisms of Acquired Resistance to Epidermal Growth Factor Receptor Tyrosine Kinase Inhibitors (EGFR-TKI) in Non-Small Cell Lung Cancer (NSCLC) Victor.
Previously in Cell Bio Hypotheses for ‘problem’ in Graves’ Disease Positive signals (TRH or TSH) altered to increase amount or affinity for their receptor.
Genetics Control of Eukaryotic Genes Genetics The BIG Questions… How are genes turned on & off in eukaryotes? How do cells with the same genes.
PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry.
Codon usage bias Ref: Chapter 9 Xuhua Xia dambe.bio.uottawa.ca.
Construction of Substitution Matrices
Sequence Alignment Csc 487/687 Computing for bioinformatics.
P53 Missense Mutation Cancer. Outline Disease related to p53 Role and regulation pathway Structure of p53 Missense mutation and consequences Experiment’s.
AP Biology Control of Eukaryotic Genes. AP Biology The BIG Questions… How are genes turned on & off in eukaryotes? How do cells with the same genes differentiate.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Codon usage bias Ref: Chapter 9
Control of Eukaryotic Genome
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
AP Biology Control of Eukaryotic Genes.
Russell Group, Protein Evolution _________ ____ Rob Russell Cell Networks University of Heidelberg Interactions and Modules: the how and why of molecular.
NEW TOPIC: MOLECULAR EVOLUTION.
Construction of Substitution matrices
A genetic polymorphism in the Drosophila insulin receptor suggests adaptation to climate variation across continents Annalise Paaby a, Mark Blacket b,
The statistics of pairwise alignment BMI/CS 576 Colin Dewey Fall 2015.
Chapter 19 The Organization & Control of Eukaryotic Genomes.
Previously in Cell Bio Signals are detected via binding interactions Binding interactions governed by protein folding Protein folding dictated by amino.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
TSC1/Hamartin and Facial Angiofibromas Biology 169 Ann Hau.
Previously in Cell Bio Signals are detected via binding interactions Binding interactions governed by protein folding Protein folding dictated by amino.
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
1 From Bi 150 Lecture 0 October 4, 2012 An introduction to molecular biology... but you will learn the cell biology in this course.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
Last time … * Constraint on transcription factor binding sites Sites with the most ‘information content’ generally evolve slowest * Stabilizing selection.
Change in Pufs and their RNA InteractionsAnalogous change in transcription factors and their gene regulation Puf binding specificity tends to be conserved.
Bioinformatics Overview
The neutral theory of molecular evolution
EVIDENCE OF EVOLUTION.
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Gene Regulation Ability of an organisms to control which genes are present in response to the environment.
binding sites 58 of the 473 unambiguously assigned phosphorylation sites are predicted by Scansite to be sites for binding. 50 of these correspond.
Sourav Roy School of Informatics Indiana University
Protein Bioinformatics Course
BIOL 2416 Chapter 1: Genetics: An Introduction
Eukaryote Regulation and Gene Expression
Epigenetics Study of the modifications to genes which do not involve changing the underlying DNA
Agenda 3/16 Eukaryotic Control Introduction and Reading
Control of Eukaryotic Genes
Control of Eukaryotic Genes
Control of Eukaryotic Genes
Chp.19: Eukaryotic Gene Regulation Notes Please Print!
TGFβ Signaling in Growth Control, Cancer, and Heritable Disorders
Control of Eukaryotic Genes
Presentation transcript:

Protein Disordered Regions and the Evolution of Eukaryotes Allan Wu Phar 201 Phil Bourne

Initial motivation for project: The Number of New Folds are Decreasing wthChart.do?content=fold-scop

Introduction to protein disordered regions They are regions of proteins without an inherent structure (though they might have transient structure). Many studies have found that they evolve faster than ordered regions. (Kim et. al, Dunker et. al, Brown et al.)‏ Amount of disordered regions in prokaryotes:  2-4% Amount of disordered regions in eukaryotes:  33%  In addition, much of these disordered regions appear to be in kinase targets, and proteins in signaling pathways. (Kim et. al.)‏ What are the advantage of having so much disordered regions?

Binding affinity variability and multiple ligand model Easy way to adjust binding affinity for proteins and ligand specificity independently Multiple proteins can bind to the same target (Dunker et. al.)‏

Example of protein staying disordered upon binding to ligand Sic1 (cyclin dependent kinase inhibitor)‏ However, the conformations In which the Protein exists mybe limited. (Mittag et. al.)‏

Hypothesis Why do disordered regions evolve faster? Reason protein disordered regions evolve faster is that they are more tolerant of the types of mutations which they could take and still remain functional. Idea of a greater mutation range, or range of amino acid replacement which would preserve function.

Preliminary study to prove hypothesis: Mapping Human SNPs onto Disordered Regions Method: Used Disprot as a database of protein disordered regions, and LS-SNP as a database of human SNPs Mapped all human SNPs onto Disprot, see what percentage falls on disordered regions and what percentage falls on ordered regions Calculate a “conservation” score based on BLOSUM62 matrix of SNPs that are in disordered vs. ordered regions using the equation S s in S and w in W score(w,s)/number of SNPs, where score(w,s) is the BLOSUM62 score between the wildtype amino acid and the amino acid in the SNP.

Results of preliminary study Percentage of disordered regions in 122 proteins: 23% Percentage of SNPs in disordered region: 31% Conservation score of SNPs in ordered regions: -0.4 Conservation score of SNPs in disordered regions: Redid the work of Brown et. al. and Kim et. al., but confirms that what happens on an evolutionary time- scale between two different species also occurs on a time-scale within a species. Interesting to note that the change in conservation score is more dramatic than change in the number of SNPs between ordered and disordered regions.

Implication of my preliminary study Difference in conservation score between ordered and disordered regions more dramatic than change in number of SNPs in number of SNPs in ordered vs. disordered regions. Disordered regions can evolve faster because they could substitute for a wider range of amino acids. Suggests an comparative genomic method of discovering protein disordered regions.

Mutation range of disordered regions is greater than ordered regions We could think of the mutation range as the range of amino acids that can be replaced without loosing function, which is greater in disordered than ordered regions.

Further experiments to confirm the hypothesis Find even more SNPs and see if their pattern of evolution is similar to this more limited study, and also include allelic requencies. Better methods for detection of protein disordered regions. Mutagenesis studies of protein disordered regions  Create SNPs using induced mutagenesis to compare how well ordered vs. disordered regions retain their functions after mutagenesis Test if ability to mutate more freely than ordered region give rise to new functions.

Implications of Study: Better method for detecting disordered regions in proteins Suggests an comparative genomic method of discovering protein disordered regions There is a signature pattern of evolution for protein disorder regions one could use this to determine disordered regions. Could take advantage of cheap sequencing, which may be cheaper than determining structure

Impact on Understanding Evolution of Eukaryotes It has been shown that the three kingdom uses different protein folds We could consider disorder regions to be “folds” which are used only by eukaryotes The use of disordered regions, and perhaps other adaptations, allowed them to evolve beyond prokaryotes

Evidence that disordered region allowed development of multicellularity Suggested by Ward et. al. Disordered regions evolve faster than ordered regions Disordered proteins tend to be involved in cell cycle, transcriptional and translational regulation (Fink 2005)‏ Many of them are kinases  Kinases are involved in differentiation  Many protooncogenes are kinases

Advantages of fast evolution of signaling networks Fast  Kinases cascades work at a time scale much faster than control at the transcription level Faster degradation of signaling proteins?  Since disordered proteins tend to be degraded faster in proteasomes, maybe it's easier to do it this way?

Citations Tanja Mittaga, Stephen Orlickyb, Wing-Yiu Choyc, Xiaojing Tangb, Hong Lina, Frank Sicherib, Lewis E. Kay Mike Tyers and Julie D. Forman-Kaya (2008) Dynamic equilibrium engagement of a polyvalent ligand with a single-site receptor. PNAS 105:17772– Philip M Kim1,5,*, Andrea Sboner1,5, Yu Xia2 and Mark Gerstein1,3,4 (2008) The role of disorder in interaction networks: a structural analysis. Molecular Systems Biology 4. K. DUNKER, E. GARNER, S. GUILLIOT. PROTEIN DISORDER AND THE EVOLUTION OF MOLECULAR RECOGNITION: THEORY, PREDICTIONS AND OBSERVATIONS (1998) Pac Symp Biocomput J. J. Ward, J. S. Sodhi, L. J. McGuffin, B. F. Buxton and D. T. Jones. Prediction and Functional Analysis of Native Disorder in Proteins from the Three Kingdoms of Life. (2004) J. Mol. Biol. 337: 635–645. Brown CJ, Takayama S, Campen AM, Vise P, Marshall TW, Oldfield CJ, Williams CJ, Dunker AK (2002) Evolutionary rate heterogeneity in proteins with long disordered regions. J Mol Evol 55: 104–110. Anthony L Fink. Natively unfolded proteins. Current Opinion in Structural Biology 2005, 15:35–41.