Families with >5 genes are more common in plants than in animals adapted from Lockton S, Gaut BS. 2005. Trends Genet 21: 60-65.

Slides:



Advertisements
Similar presentations
Genetica per Scienze Naturali a.a prof S. Presciuttini Homologous genes Genes with similar functions can be found in a diverse range of living things.
Advertisements

Gene an d genome duplication Nadia El-Mabrouk Université de Montréal Canada.
Whole Genome Duplications (Polyploidy) Made famous by S. Ohno, who suggested WGD can be a route to evolutionary innovation (focusing on neofunctionalization)
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
History, protohistory and prehistory of the Arabidopsis thaliana chromosome complement Henry Yves et al 2006, in press.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Molecular Clock I. Evolutionary rate Xuhua Xia
The Statistical Significance of Max-gap Clusters Rose Hoberman David Sankoff Dannie Durand.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
BIOE 109 Summer 2009 Lecture 4- Part I Mutation and genetic variation.
02_13.jpg Human chromosome 4 02_15.jpg 02_15_2.jpg.
Genetica per Scienze Naturali a.a prof S. Presciuttini Mutation Rates Ultimately, the source of genetic variation observed among individuals in.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Bioinformatics Genome anatomy Comparisons of some eukaryotic genomes Allignment of long genomic sequences Comparative genomics Oxford Grid Reconstruction.
Outline Arabidopsis gene expression (MPSS) Two evolutionary issues in the evolution of expression profiles: –Physical clustering of co-expressed genes.
Plant genomes: phenotypes evolving by new rules Todd J. Vision Department of Biology University of North Carolina at Chapel Hill.
Genetica per Scienze Naturali a.a prof S. Presciuttini Evolution of the eukaryotic nuclear genome The nuclear genome of eukaryotes is thought to.
Scientific FieldsScientific Fields  Different fields of science have contributed evidence for the theory of evolution  Anatomy  Embryology  Biochemistry.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Origins and impact of constraints in evolution of gene families Boris E. Shakhnovich and Eugene V.Koonin Genome Research 2006, October 19 Stella Veretnik.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Chapter 5 Genome Sequences and Gene Numbers. 5.1Introduction  Genome size vary from approximately 470 genes for Mycoplasma genitalium to 25,000 for human.
Molecular Clock. Rate of evolution of DNA is constant over time and across lineages Resolve history of species –Timing of events –Relationship of species.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
Genome Alignment. Alignment Methods Needleman-Wunsch (global) and Smith- Waterman (local) use dynamic programming Guaranteed to find an optimal alignment.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
whole-genome duplications and large segmental duplications… …seem to be a common feature in eukaryotic genome evolution …play a crucial role in the evolution.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Comparative genomics Haixu Tang School of Informatics.
Using blast to study gene evolution – an example.
Molecular and Genomic Evolution Getting at the Gene Pool.
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Significance Tests for Max-Gap Gene Clusters Rose Hoberman joint work with Dannie Durand and David Sankoff.
5.4 Cladistics Essential idea: The ancestry of groups of species can be deduced by comparing their base or amino acid sequences. The images above are both.
Comparative genomics of Gossypium and Arabidopsis: Unraveling the consequences of both ancient and recent polyploidy Junkang Rong, John E. Bowers, Stefan.
Chapter 3 The Interrupted Gene.
Lesson Overview 17.4 Molecular Evolution.
Opener Chapter 24 – Genome Evolution. Comparative Genomes Powerful tool for exploring evolutionary divergence among organisms Footprints on the evolutionary.
How many genes are there?
Statistical Tests We propose a novel test that takes into account both the genes conserved in all three regions ( x 123 ) and in only pairs of regions.
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used for.
The Haplotype Blocks Problems Wu Ling-Yun
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used.
Evolution of eukaryotic genomes
Reconstructing the Evolutionary History of Complex Human Gene Clusters
Evolution of gene function
Genetics and Evolutionary Biology
Lesson Overview 17.4 Molecular Evolution.
Pipelines for Computational Analysis (Bioinformatics)
Very important to know the difference between the trees!
Genomes and Their Evolution
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
families with >5 genes are more common in plants than in animals
Volume 8, Issue 6, Pages (June 2015)
Volume 11, Issue 3, Pages (March 2018)
First Draft of Chimpanzee Genome
Evolutionary genetics
BIOL 433 Plant Genetics Term 2,
Chapter 6 Clusters and Repeats.
Jeffrey A. Fawcett, Hideki Innan  Trends in Genetics 
Unit Genomic sequencing
Volume 11, Issue 3, Pages (March 2018)
Presentation transcript:

families with >5 genes are more common in plants than in animals adapted from Lockton S, Gaut BS Trends Genet 21: 60-65

alternative splicing (AS) is more common in animals than in plants Boue S, et al BioEssays 25: ; Iida K, et al Nucleic Acids Res 32: ; Kikuchi S, et al Science 301: Arabidopsis and rice AS

duplications occur on any length scale, from individual genes (where tandem refers to a gene and its duplicate being adjacent), to multi-gene segments of the chromosome, to an entire genome e.g. wild wheat is diploid 2n, domestication gave a tetraploid 4n (pasta) and a hexaploid 6n (bread) synteny is when 2 or more genes are found in the same order/orientation on the chromosomes of related species

polyploidy (whole genome duplication) events among plants adapted from Blanc G, Wolfe KH Plant Cell 16: ; Paterson AH, et al Proc Natl Acad Sci USA 101: monocot dicot

phylogeny of the favored plants there is extensive synteny among Gramineae but between Gramineae and Arabidopsis there is essentially no synteny sorghum maize Arabidopsis barley wheat rice Gramineae 55~70 Mya monocot-dicot 170~235 Mya

the duplication history of rice every cDNA-defined gene is assigned a duplication category using the methods of Yu J, et al PLoS Biol 3: e38 1.analysis relies entirely on 19,079 full length cDNAs; had we used predicted genes instead many of the duplications would have been missed 2.a homolog pair refers to a cDNA and its TblastN match (i.e. comparisons done at amino acid level to genome translation in all 6 reading frames) at an expectation value of 1E-7 and requiring that >50% be aligned; note that the TblastN match is not necessarily expressed itself 3.if a gene has any homologs at all, the mean(median) number of homologs is 40(5) 4.multiple duplications are difficult to analyze; so consider the cDNAs with 1-and-only-1 homolog

ONE whole genome duplication, a recent segmental duplication, and many individual gene duplications birth death whole genome individual genes recent segmental time

18 pairs of duplicated segments covering 65.7% of rice genome higher order homologs used to backfill established trend lines segmental

ancient whole genome duplication (WGD) in rice

uninterpretable plot if use cDNAs with more than one homolog in rice mean (median) number of homologs per duplicated gene is 40 (5)

unmarked trend along diagonal from tandem gene duplications there were NO segmental duplications within a chromosome tandem background

computing molecular clocks and indicators of evolutionary selection Ka = non-synonymous changes per available site Ks = synonymous changes per available site available site corrects for fact that 76% of substitutions, or 438 of 3  3  64, encode a different amino acid Ka/Ks < 1 is evidence of purifying selection Ka/Ks = 1 is evidence of no selection (pseudogene) Ka/Ks > 1 is evidence of adaptive selection mean Ka/Ks is 0.20 in primates and 0.14 in rodents

from neutral substitution rate to time since divergence of species neutral substitution rates vary with genes and evolutionary lineages but on average they are 2.2×10 -9 for mammals and 6.5×10 -9 for Gramineae Kumar S, Hedges SB Nature 392: common ancestor species1species2 time since divergence equals species2-species1 divided by (2 × neutral substitution rate)

17 of 18 segments are attributable to a whole genome duplication just before the Gramineae divergence timing of WGD relative to Gramineae divergence is based on observed syntenies and not Ks

background duplications have Ks signature like tandem duplications except that they are more ancient peak at zero Ks and exponential decay thereafter is indicative of ongoing duplication process

duplicated genes undergo periods of relaxed selection and are usually silenced within 4~17 million years hypothesis introduced by Lynch M, Conery JS Science 290: 1151; with details in Lynch M, Conery JS J Struct Funct Genomics 3: 35 one copy left alone one copy to modify eventual death novel function progenitor gene relaxed selection reduced expression post-duplicative ‘transient’ of duration 4~17 million years

rice analysis succeeded only because duplication is not too old when the duplication is old: an analysis from yeast comparing related genomes with and without the duplication Kellis M, et al Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: when the duplication is extremely new: an analysis from human Bailey JA, et al Recent segmental duplications in the human genome. Science 297:

proof of whole genome duplication in Saccharomyces cerevisiae by comparison to sequence of Kluyveromyces waltii duplication mutation gene death interleaving genes from sister segments in comparison to K. waltii

gene and regional correspondences with K. waltii

ancient whole genome duplication in S. cerevisiae

identifying recent segmental duplications in human assembly whole genome shotgun (WGS) reads from Celera are aligned to map-based genome from IHGSC; recent segmental duplications are detected in similarity and read depth anomalies

patterns of intra-chromosomal and inter-chromosomal duplication recent segmental duplications of length>10-kb & identity>95%; intra-chromosomal (blue lines) and inter-chromosomal (red bars) duplication; unique regions surrounded by intra-chromosomal duplications (gold bars) are hot spots for genomic disorders

recent segmental duplications in IHGSC and Celera genomes proportion of Celera aligned bases falls rapidly as identity exceeds 97% or length exceeds 15-kb, but the total sequence lost is still only 2%~3% NB: search of the map-based rice genome revealed no segmental duplications of recent origins (Yu J, et al Trends Plant Sci 11:

“Although it is clear that the detailed clone-ordered approach is superior in the resolution of segmental duplications, it would be unrealistic to propose that the sequencing community should abandon whole- genome-shotgun based approaches. These are the most efficient cost-effective means of capturing the bulk of the euchromatic sequence.” Evan E. Eichler (21 October 2004)