By Chunfang Zheng and David Sankoff, 2014

Slides:



Advertisements
Similar presentations
1 Orthologs: Two genes, each from a different species, that descended from a single common ancestral gene Paralogs: Two or more genes, often thought of.
Advertisements

. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
Gene an d genome duplication Nadia El-Mabrouk Université de Montréal Canada.
Sequence allignement 1 Chitta Baral. Sequences and Sequence allignment Two main kind of sequences –Sequence of base pairs in DNA molecules (A+T+C+G)*
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey Puffer fish phylogeny From Figure 1 Mulley, J and Holland,
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
GenomePixelizer - a visualization tool for comparative genomics within and between species. A. Kozik, E. Kochetkova, and R. Michelmore (Department of Vegetable.
The Statistical Significance of Max-gap Clusters Rose Hoberman David Sankoff Dannie Durand.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Finding Orthologous Groups René van der Heijden. What is this lecture about? What is ‘orthology’? Why do we study gene-ancestry/gene-trees (phylogenies)?
Genomic Rearrangements CS 374 – Algorithms in Biology Fall 2006 Nandhini N S.
Short Primer on Comparative Genomics Today: Special guest lecture 12pm, Alway M108 Comparative genomics of animals and plants Adam Siepel Assistant Professor.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Genetica per Scienze Naturali a.a prof S. Presciuttini Evolution of the eukaryotic nuclear genome The nuclear genome of eukaryotes is thought to.
Sequencing a genome and Basic Sequence Alignment
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Comparative Genomics of the Eukaryotes
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Alignment Statistics and Substitution Matrices BMI/CS 576 Colin Dewey Fall 2010.
Pairwise Alignment How do we tell whether two sequences are similar? BIO520 BioinformaticsJim Lund Assigned reading: Ch , Ch 5.1, get what you can.
Sequence Analysis Alignments dot-plots scoring scheme Substitution matrices Search algorithms (BLAST)
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
CISC667, S07, Lec5, Liao CISC 667 Intro to Bioinformatics (Spring 2007) Pairwise sequence alignment Needleman-Wunsch (global alignment)
A computational study of protein folding pathways Reducing the computational complexity of the folding process using the building block folding model.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Chap. 7 Genome Rearrangements Introduction to Computational Molecular Biology Chap ~
Sequencing a genome and Basic Sequence Alignment
whole-genome duplications and large segmental duplications… …seem to be a common feature in eukaryotic genome evolution …play a crucial role in the evolution.
Inferring Functional Information from Domain co-evolution Yohan Kim, Mehmet Koyuturk, Umut Topkara, Ananth Grama and Shankar Subramaniam Gaurav Chadha.
Construction of Substitution Matrices
Greedy Algorithms CS 498 SS Saurabh Sinha. A greedy approach to the motif finding problem Given t sequences of length n each, to find a profile matrix.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
Complementarity of network and sequence information in homologous proteins March, Department of Computing, Imperial College London, London, UK 2.
Identification of Ortholog Groups by OrthoMCL Protein sequences from organisms of interest All-against-all BLASTP Between Species: Reciprocal best similarity.
Orthology & Paralogy Alignment & Assembly Alastair Kerr Ph.D. [many slides borrowed from various sources]
Comparative genomics Haixu Tang School of Informatics.
Using blast to study gene evolution – an example.
Molecular and Genomic Evolution Getting at the Gene Pool.
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
Significance Tests for Max-Gap Gene Clusters Rose Hoberman joint work with Dannie Durand and David Sankoff.
Orthology & Paralogy Alignment & Assembly Alastair Kerr Ph.D. WTCCB Bioinformatics Core [many slides borrowed from various sources]
Comparative genomics of Gossypium and Arabidopsis: Unraveling the consequences of both ancient and recent polyploidy Junkang Rong, John E. Bowers, Stefan.
Construction of Substitution matrices
1 Genome Rearrangements (Lecture for CS498-CXZ Algorithms in Bioinformatics) Dec. 6, 2005 ChengXiang Zhai Department of Computer Science University of.
Sequence Search Abhishek Niroula Department of Experimental Medical Science Lund University
Statistical Tests We propose a novel test that takes into account both the genes conserved in all three regions ( x 123 ) and in only pairs of regions.
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used for.
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
The Artemis Comparison Tool
Phylogeny and the Tree of Life
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used.
Reconstructing the Evolutionary History of Complex Human Gene Clusters
Original Synteny Vincent Ferretti, Joseph H. Nadeau, David Sankoff, 1996 Presented by: Suzy Sun.
Genome alignment Usman Roshan.
Research in Computational Molecular Biology , Vol (2008)
BIOLOGY EOC 10-DAY REVIEW.
A Hybrid Algorithm for Multiple DNA Sequence Alignment
Tests for Gene Clustering
Mattew Mazowita, Lani Haque, and David Sankoff
I. Statistical Tests: Why do we use them? What do they involve?
Multiple Genome Rearrangement
Evolutionary genetics
Pairwise Sequence Alignment
Chapter 6 Clusters and Repeats.
CSCI 1810 Computational Molecular Biology 2018
Understanding biological Evolution and the Diversity of Life
Volume 11, Issue 7, Pages (May 2015)
Presentation transcript:

By Chunfang Zheng and David Sankoff, 2014 Practical halving; the Nelumbo nucifera evidence on early eudicot evolution By Chunfang Zheng and David Sankoff, 2014 Presented by: Nuray Kabasakal

Introduction Practical Halving and the N. nucifera genome The search for subgenomes The comparison of Nelumbo and Vitis ancestors Gene divergence evidence Conclusion

1. Introduction

Arabidopsis Utricularia Fractionation and rearrangement Signature pattern to recode evolutionary history Pattern of m genome set and k mutually synthenic chromosome fragments of genes .

Genome Halving: Genome halving (k = 2) and genome aliquoting (k > 2) are computational procedures to find the number of chromosomes in the ancestor. Genome halving algorithm designed for large eukaryote genomes with largely single-copy genes, taking advantage of a signature pattern of paralog distribution in ancient polyploids. A “practical halving” approach to reconstructing the pre-doubling ancestor, derived from the practical aliquoting procedure which prioritizes evidence for this signature pattern.

Nelumbo nucifera (sacred lotus) Hypothesis about eudicot genome history

The genome publication for Nelumbo suggests that γ can happen as two successive tetraploidizations B + B’ and A + BB’, A is earlier diverging sister genome. Q= Did A diverged before or after Nelumbo-core eudicot split?

2. Practical halving and the N. nucifera genome 2 important characteristic of genome organization: Paralogy Genome bipartition, has to do with homology among genes within a doubled genome, more particularly the pairs of paralogous genes created by a whole genome doubling. Synteny Double synteny, involves gene positions on the chromosome.

3. The search for subgenomes Each gene is identified by a distinct label and its only two relevant properties are: Its position on specific chromosome Identity of its paralog SynMap procedure in CoGe was used to extract these data via a self comparison of the genome. Parameters used for practical halving algorithm: Short gap reward r>0, jumping j>0, halving defect h>0, threshold t≥0. A fragment: contiguous set of genes on a chromosome of the input genome. A region: an ordered set of fragments, with successive fragments being separated by a gap of one or more genes on a chromosome, or by a chromosome jump, i.e., the two fragments are on different chromosomes. In a pair of regions, ideally all the paralogs of all the genes are between the regions and none are within a single region. Pairs of paralogs that are exceptions to this rule are called halving defects.

The main step of the algorithm is keep merging 2 existing clusters, pair of regions, together. If merging reduce number of gapped genes; it is assigned score r if the number of chromosomes of the input genome in the two regions being merged is less than the number in the output, there is a penalty j . if the number of halving defects in the merged regions is d greater than that in both of the original regions, a penalty of hd is given.

4. The comparison of Nelumbo and Vitis ancestors Comparison of Nelumbo vs Vitis chromosomes

Syntheny Blocks

5. Gene divergence evidence N & V orthologs equal Vitis paralogs equal. ii. & iii. N & V orthologs less similar for dominant Vitis subgenome. Vitis paralogs more similar for non-similar subgenomes. iv. Vitis paralogs more similar for non-similar subgenomes v. N & V orthologs more similar for dominant Vitis subgenome. vi. The similarities between paralogs in Nelumbo should have a bimodal distribution.

6. Conclusion

Thank you for listening! Have a happy summer!