Outline Arabidopsis gene expression (MPSS) Two evolutionary issues in the evolution of expression profiles: –Physical clustering of co-expressed genes.

Slides:



Advertisements
Similar presentations
Evolution of genomes.
Advertisements

Supplementary Figure S1 Distribution of observed (blue) and Poisson expected (red) standard deviation of human-chimpanzee divergence over different window.
Atelier INSERM – La Londe Les Maures – Mai 2004
Signatures of Selection
Genome evolution There are both proximate and ultimate explanations in molecular biology Mutation continually generates variation in genome content and.
Molecular Evolution Revised 29/12/06
This presentation was originally prepared by C. William Birky, Jr. Department of Ecology and Evolutionary Biology The University of Arizona It may be used.
Genomics An introduction. Aims of genomics I Establishing integrated databases – being far from merely a storage Linking genomic and expressed gene sequences.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Variation in Natural Populations. Overview of Evolutionary Change Natural Selection: variation among individuals in heritable traits lead to variation.
From population genetics to variation among species: Computing the rate of fixations.
The evolution of expression patterns in the Arabidopsis genome Todd Vision Department of Biology University of North Carolina at Chapel Hill.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Alternative splicing and evolution Daniel Jeffares.
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
1 Is Gene Position Adaptively Favored?. 2 Why do we care? Genomic clusters of genes Yeast 98% of genes in metabolic pathways cluster (Lee & Sonnhammer)
Lecture Outline 12/7/05 The human genome
The phylogenetics project data revealed! October 4, 2010 OEB 192.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
The origin of genetic variation
Quantitative Genetics
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Large-Scale Copy Number Polymorphism in the Human Genome J. Sebat et al. Science, 305:525 Luana Ávila MedG 505 Feb. 24 th /24.
1 Genetic Variability. 2 A population is monomorphic at a locus if there exists only one allele at the locus. A population is polymorphic at a locus if.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Genomics BIT 220 Chapter 21.
Proliferation cluster (G12) Figure S1 A The proliferation cluster is a stable one. A dendrogram depicting results of cluster analysis of all varying genes.
This presentation was originally prepared by C. William Birky, Jr. Department of Ecology and Evolutionary Biology The University of Arizona It may be used.
Genomics Lecture 8 By Ms. Shumaila Azam. 2 Genome Evolution “Genomes are more than instruction books for building and maintaining an organism; they also.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Mehdi Layeghifard Evolutionary Mechanisms Underlying the Functional Divergence of Vertebrates’ Circadian Rhythm Genes.
Genomes & their evolution Ch 21.4,5. About 1.2% of the human genome is protein coding exons. In 9/2012, in papers in Nature, the ENCODE group has produced.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
Families with >5 genes are more common in plants than in animals adapted from Lockton S, Gaut BS Trends Genet 21:
Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings PowerPoint ® Lecture Presentations for Biology Eighth Edition Neil Campbell.
Maria Warnefors, Vini Pereira, Adam Eyre-Walker
Asymmetric Sequence Divergence of Duplicate Genes Experimented By: Gavin Conant and Andreas Wagner Presented By: Jennifer Case and Jonathan Hobbs.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Eukaryotic Genomes 11 November, 2005 Text Chapter 19.
How many genes are there?
Statistical Tests We propose a novel test that takes into account both the genes conserved in all three regions ( x 123 ) and in only pairs of regions.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Evolution of Duplicated Genomes Talline Martins
The Evolution of Populations Chapter Weaknesses  He didn’t know how heritable traits pass from one generation to the next  Although variation.
Mutation: Origin of genetic variation sources of new alleles rate and nature of mutations sources of new genes highly repeated functional sequences.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
VI. Mutation A.Overview B.Changes in Ploidy C.Changes in ‘Aneuploidy’ (changes in chromosome number) D. Change in Gene Number/Arrangement.
Data Analysis. Statistics - a powerful tool for analyzing data 1. Descriptive Statistics - provide an overview of the attributes of a data set. These.
CSCOPE Unit: 09 Lesson: 01.  Be prepared to share your response to the following: ◦ Biological evolution happens at the __________ level, not the individual.
Quest for epigenetic determinants of local coexpression clusters Wieslawa Mentzen Labrador and Corces, 2002.
Transcriptional Enhancers Looking out for the genes and each other Sridhar Hannenhalli Department of Cell Biology and Molecular Genetics Center for Bioinformatics.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Evolution of eukaryotic genomes
Segmental, Componential, & Contextual Changes
Evolution of gene function
Genetics and Evolutionary Biology
Signatures of Selection
SGN23 The Organization of the Human Genome
families with >5 genes are more common in plants than in animals
Germplasm Issues Chapter 3. Variation: Type, Origin, and Scale
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Descriptive Statistics
Genetic Drift, followed by selection can cause linkage disequilibrium
Evolutionary genetics
Chapter 6 Clusters and Repeats.
1. Unequal Crossing-Over a. process: If homologs line up askew:
Parallel evolution of coding sequences above neutral expectation.
Volume 7, Issue 2, Pages (August 2010)
Analysis of the nonsynonymous to synonymous substitutions dN/dS (ω) for the 5,493 1:1 ortholog transcripts. Analysis of the nonsynonymous to synonymous.
Presentation transcript:

Outline Arabidopsis gene expression (MPSS) Two evolutionary issues in the evolution of expression profiles: –Physical clustering of co-expressed genes –Divergence of duplicated genes

Physical clustering of co-expression Caenorhabditis elegansRoy et al., (2002) Nature 418, 975 Lercher et al (2003) Genome Research 13, 238 Drosophila melanogasterBoutanaev et al (2002) Nature 420, 666 Spellman and Rubin (2002) J Biology 1, 5 Homo sapiens Caron et al (2001) Science 291, 1289 Lercher et al (2002) Nature Genetics 31, 180 Saccharomyces cerevisiae Cohen et al (2000) Nature Genetics 26, 183 Hurst et al (2002) Trends in Genetics 18, 604 Mannila et al (2002) Bioinformatics 18, 482 ‘ What are the proximate explanations? –shared cis-regulatory elements –chromatin packaging, etc. What are the ultimate explanations? –Adaptive: greater transcriptional efficiency/accuracy? –Maladaptive: mutational rain chipping away at insulators and other mechanisms that over-ride regional controllers of gene expression?

Measuring expression distance library 1 library 2 library 3

Clustering of tissue-specific expression Flower (red) Silique (violet) Leaf (green) Root (blue) Callus (white) Chromosome 1

Statistical tests of coexpression clustering Measured median pairwise expression distance (MPED) in non-overlapping windows of 20 genes –Summed unique class 1 and 2 signatures for each gene –Only one gene within each tandemly arrayed family was counted Out of 100 shuffles of gene order –Zero shuffles had as many windows with small MPED (less than 1.5) as the unshuffled data –Zero shuffles had as large a variance in MPED among windows as the unshuffled data

Coexpression in Arabidopsis

Selection and recombination In regions of low recombination –deleterious mutations can hitch-hike to high frequency along with favorable ones –favorable mutations are kept at low frequency by linkage to deleterious ones Therefore, the effectiveness of natural selection is causally related to recombination rate Are clusters more concentrated in regions of –high recombination (i.e. are they adaptive) –low (i.e. are they maladaptive)?

Measuring recombination rate Chromosome 1

Co-expression is greater in low recombination regions

Co-expression clusters MPSS data provides evidence for clusters of co-expression among non-related genes in Arabidopsis Co-expression is greater in regions of low recombination Thus, co-expression clusters may be maladapative, at least on average

Outline Arabidopsis gene expression (MPSS) Two evolutionary issues in the evolution of expression profiles: –Physical clustering of co-expressed genes –Divergence of duplicated genes

Divergence of duplicated genes Age of duplication Expression distance

Duplicated genes in Arabidopsis

Modes of gene duplication Tandem (unequal crossing-over) Dispersed (transposition) Segmental (polyploidy)

Divergence of duplicated genes All gene families of size 2 in Arabidopsis were classified as ‘dispersed’, ‘segmental’ or ‘tandem’ Expression distance was calculated for each The number of silent (i.e. synonymous) substitutions per site was calculated for each (as a proxy for age since duplication)

Divergence and mode of duplication

Divergence of duplicated genes Almost all expression divergence occurs during (or immediately following) duplication Initial expression divergence is more extreme for tandem than dispersed duplicates Tandem and dispersed duplicates with the most divergent expression profiles are quickly lost Segmental duplicates plateau at a lower level of expression divergence than dispersed duplicates The average divergence in relative expression level in each tissue is about 8-fold.

Lessons learned Clusters of co-expression in Arabidopsis may be largely the result of a rain of weakly deleterious mutations that homogenize the expression profiles of neighboring genes Divergence in expression profile between duplicated genes is dependent on the nature of the mutation that gave rise to the duplication