Evolution of genes & proteins

Slides:



Advertisements
Similar presentations
Micro Evolution -Evolution on the smallest scale
Advertisements

Evolution of genomes.
Genetica per Scienze Naturali a.a prof S. Presciuttini Homologous genes Genes with similar functions can be found in a diverse range of living things.
Whole Genome Duplications (Polyploidy) Made famous by S. Ohno, who suggested WGD can be a route to evolutionary innovation (focusing on neofunctionalization)
R ATES OF P OINT M UTATION. The rate of mutation = the number of new sequence variants arising in a predefined target region per unit time. Target region.
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
Sampling distributions of alleles under models of neutral evolution.
MIGRATION  Movement of individuals from one subpopulation to another followed by random mating.  Movement of gametes from one subpopulation to another.
Change in frequency of the unbanded allele (q) as a function of q for island populations. Equilibrium points a)Strong selection for q, little migration.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
14 Molecular Evolution and Population Genetics
Effective Population Size Real populations don’t satisfy the Wright-Fisher model. In particular, real populations exhibit reproductive structure, either.
From population genetics to variation among species: Computing the rate of fixations.
The origins & evolution of genome complexity Seth Donoughe Lynch & Conery (2003)
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Sources of variation. Mutation produces variation at multiple scales:
Evolutionary Concepts: Variation and Mutation 6 February 2003.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
MIGRATION  Movement of individuals from one subpopulation to another followed by random mating.  Movement of gametes from one subpopulation to another.
The Evolution of Populations.  Emphasizes the extensive genetic variation within populations and recognizes the importance of quantitative characteristics.
Part VI and Chapter 20 Biology Sixth Edition Raven/Johnson (c) The McGraw-Hill Companies, Inc.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
Mehdi Layeghifard Evolutionary Mechanisms Underlying the Functional Divergence of Vertebrates’ Circadian Rhythm Genes.
Chapter 4: Clusters and repeats. 4.1 Introduction  Duplication of DNA is a major force in evolution.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Chapter 21 Eukaryotic Genome Sequences
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
1 Evolutionary Change in Nucleotide Sequences Dan Graur.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Coalescent Models for Genetic Demography
Selectionist view: allele substitution and polymorphism
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
LESSON # An Introduction to Evolution 2- The Means of Evolution: Microevolution.
Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings PowerPoint ® Lecture Presentations for Biology Eighth Edition Neil Campbell.
© 2014 Pearson Education, Inc. Objective 5: TSWBAT recognize that genetic variation makes evolution possible.
Evolution at the Molecular Level
Early Ideas About Evolution Darwin’s Observations Natural Selection Evidence of Evolution Evolutionary Biology Today
Mutation & genetic variation. Mutations gene – stretch of dna that codes for a distinctive type of rna or protein allele – versions of the same gene.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
NEW TOPIC: MOLECULAR EVOLUTION.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Measuring genetic change Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Section 5.2.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Mutation: Origin of genetic variation sources of new alleles rate and nature of mutations sources of new genes highly repeated functional sequences.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
06_20_cancer_age.jpg. Gene Mutation: Origins and Repair Processes GAATTC  GTATTC A  a Skin cancers from Xeroderma pigmentosum, a recessive hereditary.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
Modelling evolution Gil McVean Department of Statistics TC A G.
LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert.
Evolution of Populations. Individual organisms do not evolve. This is a misconception. While natural selection acts on individuals, evolution is only.
Evolution of Populations
Random Change. In terms of genetics, it is any change in allele frequencies within a population. The H-W, provided conditions that evolution would not.
Lecture 6 Genetic drift & Mutation Sonja Kujala
Evolution of genes & proteins
Evolution of eukaryotic genomes
Evolution of gene function
The neutral theory of molecular evolution
Human Chimp How does DNA evolve? Nucleotide substitutions
The Evolution of Populations
Genomes and Their Evolution
Gene duplications: evolutionary role
Gene Density and Noncoding DNA
Chapter 7: Mechanisms of Mutation
Chapter 6 Clusters and Repeats.
Reminder The AP Exam registration is open in Naviance. The Exam is on Monday, May 13. I’ll let you know when the next test/homework will be.
Presentation transcript:

Evolution of genes & proteins Course code: ZOO560 Week 2 Evolution of genes & proteins Advanced molecular biology (ZOO560) by Rania M. H. Baleela is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

This week lectures content Evolution of duplicate genes (paralogs) Evolution of Pseudogenes Mathematical models of substitution Models of mutation: IAM ISM SMM Wright-Fisher Coalescence

Genome Evolution Genome changes due to Mutation Recombination Transposition Gene transfer Deletion and duplication major mechanism for the expansion in the size of genomes as organisms evolved from simple to more complex is duplication of whole genomes as well as duplication of specific sequences

Gene duplication is an important source of phenotypic change and adaptive evolution (Dennis et al., 2012)

Fates of duplicate genes 1.Subfunctionalization 2.Neofunctionalization 3.Nonfunctionalization (psuedogene) 4.Evolve in concert

Definitions Subfunctionalization: a pair of duplicate genes are said to be subfuctionalized if each of the 2 copies of the gene performs only a subset of the functions of the ancestral single copy gene. Neofunctionalization: a pair of duplicate genes in a population are said to be neofunctionalized if one of the 2 genes possesses a new selectively beneficial function that was absent in the population before the duplication (e.g. hepatocyte growth factor vs. plasminogen)

Duplicate genes fate models Duplication Nonfunctionalization Neofunctionalization Subfunctionalization (pseudogene)

A duplicated gene is unlikely to be fixed unless it acquires a novel & useful function

Sub/neo-functionalization Evolution of paralogs Sub/neo-functionalization

Gene duplication: history Time= 1936, Scientist= Bridges Finding= gene duplication of a chromosomal band in a mutant of the fruit fly Drosophila melanogaster, Observed result= extreme reduction in eye size Gene duplicates= paralogs

X-linked duplication The Bar gene duplication = chromosomal duplication or gene amplification Is any duplication of a region of DNA that contains a gene. http://www.nature.com/scitable/content/25319/pierce_9_7_large_2.jpg

Gene duplication May occur due to: an error in homologous recombination, a retrotransposition event, duplication of an entire chromosome.

Which genes undergo duplication? Features that might allow or prohibit the fixation of a duplicate copy of a gene in the population: Functional biases in the types of genes that survive in duplicate (e.g. in: yeast, humans, insects & bacteria). Belong to certain categories: such as genes encoding transcription factors, kinases and particular enzymes & transporters have unexpectedly high numbers of duplicates

Duplications can be advantageous, deleterious or neutral If an organism is exposed to a toxic environment, there may be an advantage in overproduction of detoxifying enzymes Disadvantage will result of overproduction of a protein that upsets the regulatory balance Most duplications are neutral=> their fate is determined by selection and drift

Transition from Australopithecus to Homo, why Transition from Australopithecus to Homo, why? and the beginning of neocortex expansion, how? The cortical development gene Slit-Robo Rho GTPase-activating protein 2 (SRGAP2) duplicated 3 times exclusively in humans: from (SRGAP2A) to (SRGAP2B), 2 larger duplications later copied SRGAP2B to (SRGAP2C) and to proximal (SRGAP2D). SRGAP2C is the most likely duplicate to encode a functional protein (one of the most fixed human-specific duplicate genes). Incomplete duplication created a novel gene function at birth 2–3 mya, Gene duplication is an important source of phenotypic change and adaptive evolution. We leverage a haploid hydatidiform mole to identify highly identical sequences missing from the reference genome, confirming that the cortical development gene Slit-Robo Rho GTPase-activating protein 2 (SRGAP2) duplicated three times exclusively in humans. We show that the promoter and first nine exons of SRGAP2 duplicated from 1q32.1 (SRGAP2A) to 1q21.1 (SRGAP2B) ∼3.4 million years ago (mya). Two larger duplications later copied SRGAP2B to chromosome 1p12 (SRGAP2C) and to proximal 1q21.1 (SRGAP2D) ∼2.4 and ∼1 mya, respectively. Sequence and expression analyses show that SRGAP2C is the most likely duplicate to encode a functional protein and is among the most fixed human-specific duplicate genes. Our data suggest a mechanism where incomplete duplication created a novel gene function—antagonizing parental SRGAP2 function—immediately “at birth” 2–3 mya, which is a time corresponding to the transition from Australopithecus to Homo and the beginning of neocortex expansion Dennis et al., 2012

Evolution of (paralogs) Pseudogenes “fossil records” Nonfunctionalization

Pseudogenes A pseudogene is a DNA sequence that is nearly identical to that of a functional gene, but contains one or more mutations, making it non-functional. First recognized and named pseudogenes during the late 1970s

Regulatory role has been observed for human pseudogenes. 2 types of pseudogenes 1. Unprocessed (duplicated): From genome duplication. Subsequently lost its function . Rapid degeneration observed in prokaryotes. 2. Processed (retrotransposed): From reverse transcription (no intron). Regulatory role has been observed for human pseudogenes.

Many changes have occurred in a beta-globin gene since it became a pseudogene

Pseudogene descendants of human ribosomal protein gene (RPL21)

Pseudogenes may represent reservoirs of genetic information that participate in the evolution of new genes, not only relics of inactivated genes whose fate is genomic extinction.

21-hydroxylase (cytP21) gene One of the cytochrome P450 gene family. cytP21 is located on chromosome 6 in humans. Has a paralogous pseudogene in the vicinity. 100s of mutations in the 21-hydroxylase gene have been described. 75% of them are due to gene conversion.

ξ-globin duplication The equine ξ-globin locus consists of a gene and a pseudogene.  The duplication of the ξ -globin genes predates placental mammals radiation. Because of repeated GC events, the gene and the pseudogene are identical in their align-able part.

“Natural selection merely modified, while redundancy created” Evolution by Gene Duplication Susumu Ohno, 1970 “Natural selection merely modified, while redundancy created”

Source of variation mutation

Mutation Mutation is any heritable change in the genetic material. Is the ultimate source of genetic variation. Include: Changes of DNA sequence (e.g. substitution) Chromosomal rearrangements (e.g. inversion)

Mutation rate= probability of mutation. Most wild type (wt) genes mutate at a very low rate Typical mutation rate= to new mutations/gene/generation. In a population of size N diploid organisms, there are 2N copies of each gene, each of which can mutate in any generation. Mutation rate= probability of mutation.

Substitution models

Types of point mutations In DNA sequences: Transitions: Point mutations substituting a purine (A or G) for a purine (A or G) or a pyrimidine (T or C) for a pyrimidine (T or C) . Transversions: substituting a purine (A or G) by a pyrimidine (T or C). Transitions are more common than transversions.

Mathematical models of substitution Are essential to study the dynamics of nucleotides substitutions: Jukes & Cantor one-parameter model (JC) (1969), Kimura‘s two-parameters model (K2P) (1980), Felsenstein model (F81) (1981), Hasegawa, Kishino & Yano model (HKY85) (1985), A general reversible model (REV) (Rodríguez et al., 1990).

Jukes & Cantor model (JC) assumes no bias in the direction of change so that substitutions occur randomly among the four types of nucleotides with equal probability.

Kimura‘s two-parameters model (K2P) incorporate the observation that the transition rate per site (α) may differ from that of transversion (β).

allows the frequencies of the 4 nucleotides to be different Felsenstein model (F81) allows the frequencies of the 4 nucleotides to be different But assumes that they are approximately the same over all the sequences.

Both K2P & F81 extend JC

Hasegawa, Kishino & Yano model (HKY85) merges K2P & F81 models By: allowing transitions & transversions to occur at different rates, allowing base frequencies to vary as well

The general reversible model (REV) adopts a more general approach, with a probability matrix of six parameters making it possible to generate any of the previous models.

Models of mutation

1. The infinite-alleles model (IAM) assumes that every new mutation that arises in a population creates a new allele that had not existed previously (Kimura & Crow, 1964).

2. The infinite-sites model (ISM) assumes that a new mutation alters sites (i.e. nucleotides) in sequences or alleles instead of creating an entire new allele and that it makes all polymorphic sites segregating for just two nucleotides if the mutation rate is sufficiently low (Kimura, 1969).

3. The stepwise-mutation model (SMM) initially developed for allozyme variation and then adopted for microsatellites mutations; assumes that mutation only occurs to adjacent states (Ohta and Kimura, 1973); in the case of microsatellites, different alleles have different number of repeats that mutation occurs only by adding or deleting one repeat. Unlike IAM, in the SMM mutation may produce alleles that are already present in the population (Hedrick, 2005).

4. The Wright-Fisher model is a simple representation of a population that Sewall Wright (1931) and Ronald Fisher (1930) used in developing the principles of population genetics (Hedrick, 2005). Assumes non-overlapping generations of individuals, random mating and a constant population size of N diploid individuals resulting in a Poisson distribution (Hey and Machado, 2003).

Non-overlapping generations mathematical model (adopted from Hartl and Clark, 1997, with modifications) In this model, all organisms from one generation die before the members of the next generation mature. It applies literally only to organisms with a very simple life history such as short-lived insects, but this model can be used in population genetics as a first approximation to populations with more complex life histories.

5. The coalescent approach Credited to Kingman (1982), in brief this theory depends on the Wright-Fisher model and works by tracing alleles back to their ancestors and calculating the times to the common ancestry allele. The point at which the ancestor allele is detected is called coalescence (Hedrick, 2005).

It takes almost half of the The coalescent The most recent common ancestor MRCA Tn = 2Ne Time is running backwards Time of coalescence for n lineages coalescent coalescent n(n-1) Tn= 4Ne D C B A It takes almost half of the time for the last two lineages to coalesce Sequences