1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.

Slides:



Advertisements
Similar presentations
The Central Dogma Information flow in cells DNA RNA Protein Transcription Translation Language The cat sat on the mat THE CAT SAT ON THE MAT Le chat sest.
Advertisements

STRATEGY FOR GENE REGULATION 1.INFORMATION IN NUCLEIC ACID – CIS ELEMENT CIS = NEXT TO; ACTS ONLY ON THAT MOLECULE 2.TRANS FACTOR (USUALLY A PROTEIN) BINDS.
Speaker: HU Xue-Jia Supervisor: WU Yun-Dong Date: 19/12/2013.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
© 2006 W.W. Norton & Company, Inc. DISCOVER BIOLOGY 3/e
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Alternative splicing and evolution Daniel Jeffares.
Genetics and the Organism 10 Jan, Genetics Experimental science of heredity Grew out of need of plant and animal breeders for greater understanding.
The Hardwiring of development: organization and function of genomic regulatory systems Maria I. Arnone and Eric H. Davidson.
Chris Chander, Luke Adea BioSci D145 Feb. 12, 2015
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Molecular genetics of gene expression Mat Halter and Neal Stewart 2014.
DNA and Chromosome Structure. Chromosomal Structure of the Genetic Material.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Transcription Nicky Mulder Acknowledgements: Anna Kramvis for lecture material (adapted here)
Evolutionary Algorithms BIOL/CMSC 361: Emergence Lecture 4/03/08.
Origins and impact of constraints in evolution of gene families Boris E. Shakhnovich and Eugene V.Koonin Genome Research 2006, October 19 Stella Veretnik.
A little about how DNA works David Sloane, MD Special Studies, HGSE Brigham and Women’s Hospital Harvard Medical School 2/10/2014David.
* only 17% of SNPs implicated in freshwater adaptation map to coding sequences Many, many mapping studies find prevalent noncoding QTLs.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
DNA Structure & Function. Perspective They knew where genes were (Morgan) They knew what chromosomes were made of Proteins & nucleic acids They didn’t.
Evo - Devo I. Background II. Core Processes III. Weak Linkage Regulation - Types of Regulation Enhancer - upstream activation sequence. Binding site for.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Medical Genetics & Genomics Guri Tzivion, PhD Extension 506 BCHM 590: Fall 2015 Windsor University School of Medicine.
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
Centra Dogma Primer. Structure of DNA and RNA Nucleic acids made of nucleotides G, A, T/U, C Ribose vs. deoxyribose Template-dependent synthesis Double.
Announcements 1. Specifics on reading assignments: Ch. 11: Skip, p. 304, btm top 312; Ch. 12: skim ; skip btm ; skip recombination.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Conservation and Evolution of Cis-Regulatory Systems Tal El-Hay Computational Biology Seminar חנוכה תשס"ו December 2005.
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
09/20/04 Introducing Proteins into Genetic Algorithms – CSIMTA'04 Introducing “Proteins” into Genetic Algorithms Virginie LEFORT, Carole KNIBBE, Guillaume.
Genetics Review Honors Human Anatomy & Physiology Mr. Mazza
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
1 Before considering selection, it’s important to characterize how gene expression varies within and between species. What evolutionary forces act on gene.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
NEW TOPIC: MOLECULAR EVOLUTION.
Motif Search and RNA Structure Prediction Lesson 9.
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
RNA Makin’ Proteins DNAMutations Show off those Genes!
Chapter 21 Genetic Variation and Evolution. What is the goal of the Fast Plant Experiment? What are you measuring? What are you comparing?
Single Nucleotide Polymorphisms (SNPs) By Amira Jhelum Rahul Shweta.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
1 From Bi 150 Lecture 0 October 4, 2012 An introduction to molecular biology... but you will learn the cell biology in this course.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
1 GEN304 Lecture #4 The Arabinose operon, a new “twist” on negative and positive control of genes. No assigned reading.
Change in Pufs and their RNA InteractionsAnalogous change in transcription factors and their gene regulation Puf binding specificity tends to be conserved.
Integrative Genomics. Double-helix DNA strands are separated in the gene coding region Which enzyme detects the beginning of a gene ? RNA Polymerase (multi-subunit.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Chapter 2 Genes Code for Proteins. 2.1Introduction Early work measuring recombination frequencies between genes led to the establishment of “linkage groups”:
The trait defines the two major germplasm groups in barley
Evolution of gene function
Relationship between Genotype and Phenotype
Genetic Regulatory Networks
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Genes Code for Proteins
Working in the Post-Genomic C. elegans World
Genes Encode RNAs and Polypeptides
Volume 14, Issue 7, Pages (February 2016)
Reminder The AP Exam registration is open in Naviance. The Exam is on Monday, May 13. I’ll let you know when the next test/homework will be.
Relationship between Genotype and Phenotype
Presentation transcript:

1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute how much to the rates of protein evolution.

2 What features explain the variation in rates of protein evolution? 1.Rate of mutation/recombination of the locus (more recombination = more efficient selection = easier to select adaptive alleles) 2.Number of constrained residues (‘functional density’) 3.Protein fold (structure, stability, folding) 4.Protein essentiality (i.e. essential proteins evolve slower) …. explains very little of the variation 4. Number of protein-protein interactions (‘connectivity) Initially reported, but now largely refuted as a global constraint 5. Pleiotropy (i.e. number of processes in which protein is involved) … explains only 1% of variation in evo. rates 6. And the # 1 best predictor is …… Expression Level of the underlying transcript explains % of the variation in protein evo. rates! Insights from Genomics:

3 Assessed the %variation explained by: * expression level * dispensibility * protein abundance * codon bias * gene length * # protein-protein interactions * centrality in protein-protein networks Previous studies: linear and multiple regressions Here: They argue the inter-dependence of these features makes multiple regression inappropriate … use principal component analysis instead

Principal Component Analysis (PCA) Takes complex (perhaps related) measurements for each item* and identifies independent ‘components’ (= abstract summaries of the data points) that best distinguish your items into subgroups. The first component (PC1) is the plane that explains the most of the variance in your groups (i.e. is the best predictor of subgroups). * Each item (e.g. gene, protein, dog skull) can be plotted as a point in PC space. 4

5 Gene expression/Codon Bias/Protein Abundance (*all related) explain 43% of variation in Ka and 52% variation in Ks! Same holds for Ka and Ks, but less so for Ka/Ks … because selection is likely acting on BOTH Ka AND Ks % var in Ka explained by 7 Principal Components Their model: selection is acting on translation to minimize protein unfolding

6 From Pal et al. Integrated View of Protein Evolution

7 Seminal paper by King & Wilson: # of genes can’t be the only answer … must involve regulatory differences Of course, phenotypes can also evolve through regulatory changes

8 i.e. When, Where, How much, and in what context a protein is present ORF TF RNAP AAAAAAA RNAP anti-sense RNA RNAi Affect translation rates, RNA decay, RNA localization (some affect splice sites) Of course, phenotypes can also evolve through regulatory changes RBP

9 Of course, phenotypes can also evolve through regulatory changes i.e. When, Where, How much, and in what context is a protein is present ORF TF RNAP AAAAAAA RNAP anti-sense RNA RNAi Affect translation rates, RNA decay, RNA localization Some effectors are encoded at the gene affected (local or cis effectors)

10 i.e. When, Where, How much, and in what context a protein is present ORF TF RNAP AAAAAAA RNAP anti-sense RNA RNAi Affect translation rates, RNA decay, RNA localization Other effectors are encoded far from the gene affected (trans effectors) Of course, phenotypes can also evolve through regulatory changes

11 The Coding vs. Noncoding Debate Which type of change is ‘more important’ in evolution? Are some genes/processes/functions more likely to evolve by one or the other? What are the features that dictate coding vs. noncoding evolution? A major advantage of non-coding regulatory changes: Minimizing Pleiotropic Effects Because cis-regulatory information is often modular.

12 EVE regulatory elements in D. melanogaster: a model of modularity From Developmental Biology, 6th Edition

13 Which type of change is ‘more important’ in evolution? Are some genes/processes/functions more likely to evolve by one or the other? What are the features that dictate coding vs. noncoding evolution? Before considering selection, it’s important to characterize how gene expression varies within and between species. What evolutionary forces act on gene expression regulation? The Coding vs. Noncoding Debate

14 Next-generation (‘deep’) sequencing can also be applied to quantify mRNA (or other RNA) levels ORF AAAAAAA DNA RNA cDNA Seq reads

15 What facilitates regulatory evolution? * Gene dispensibility Genes with variable expression within species are heavily enriched for non-essential genes * Genes with upstream TATA elements TATA regulation in yeast (and other organisms?) is associated with variable expression * Redundancy Either gene or regulatory redundancy * Modularity in regulation Genes with more upstream elements or greater environmental responsiveness By now, many studies have looked at natural variation in transcript abundance, simply to look qualitatively at which genes vary more/less. Features that influence how variable a gene’s expression is across individuals:

What facilitates regulatory evolution? But some genes may not vary in expression because of constraint (i.e. purifying selection) while others may not vary in expression due to low rates of mutation/change These cases can be distinguished by measuring the: Mutational variance (V m ) = how much expression of a given gene varies in response to mutation but in the ABSENCE of selection? Genetic variance (V g ) = how much expression of a given gene varies in natural populations (i.e. influenced by mutation + selection) Vg/Vm = 1 means no constraint (expression variation in nature is the same as in lab-derived ‘mutation lines’ … must be little selection in nature) Vg/Vm <<1 means much less variation in natural population than mutation lines … this must mean there has been purifying selection to reduce Vg 16

Generated ‘mutation accumulation’ lines in C. elegans For each line: - grew cells 280 generations - each generation randomly picked 1 individual to generate next gen. Measured whole-genome expression differences in each MA line - calculated V m Measured whole-genome expression differences in each of 5 natural isolates - calculated V m All genes had Vg/Vm < 1 … pervasive purifying selection on expression Genes with the lowest Vg/Vm: enriched for signaling proteins and TFs Genes with the highest Vg/Vm: enriched for carbon and amino acid metabolism 17

Expression can vary by the single gene (due to cis polymorphisms) or for modules of coregulated genes (due to trans-acting effects) ORFsupstream TF 18