The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer By Fan Ge, Li-San Wang, Junhyong Kim Published: August 30, 2005 Presented.

Slides:



Advertisements
Similar presentations
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
Advertisements

1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.
Scale of human cooperation an outlier in the animal kingdom Cooperative activities paradoxical: costly to the individual without yielding any direct benefits.
EVIDENCE OF EVOLUTION.
1 A Systematic Review of Cross- vs. Within-Company Cost Estimation Studies Barbara Kitchenham Emilia Mendes Guilherme Travassos.
A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
1 Orthologs: Two genes, each from a different species, that descended from a single common ancestral gene Paralogs: Two or more genes, often thought of.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
18.2 Modern Evolutionary Classification
18.2 Modern Evolutionary Classification
Phylogenetic reconstruction
18.2 Modern Evolutionary Classification
PHYLOGENY AND SYSTEMATICS
The Cobweb of life revealed by Genome-Scale estimates of Horizontal Gene Transfer Fan Ge, Li-San Wang, Junhyong Kim Mourya Vardhan.
Current Approaches to Whole Genome Phylogenetic Analysis Hongli Li.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Bioinformatics and Phylogenetic Analysis
Tree Pattern Matching in Phylogenetic Trees Automatic Search for Orthologs or Paralogs in Homologous Gene Sequence Databases By: Jean-François Dufayard,
Gene transfer Organismal tree: species B species A species C species D Gene Transfer seq. from B seq. from A seq. from C seq. from D molecular tree: speciation.
Slide 1 Statistics Workshop Tutorial 4 Probability Probability Distributions.
Bell Work Dogs of a certain breed can have black fur or white fur. Black fur is dominant, but the breeder only wants puppies with white fur. Cross two.
Classification and Phylogenies Taxonomic categories and taxa Inferring phylogenies –The similarity vs. shared derived character states –Homoplasy –Maximum.
Biology Chapter 1 The Science of Biology
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Input for the Bayesian Phylogenetic Workflow All Input values could be loaded as text file or typing directly. Only for the multifasta file is advised.
Molecular phylogenetics
The Evolutionary History of Biodiversity
The Horizontal Gene Transfer Database Jeramy Brewster, Edward Simpson and Spandana Kommalapati Mentor: Mark Goebl, PhD.
7-1 Estim Unit 7 Statistical Inference - 1 Estimation FPP Chapters 21,23, Point Estimation Margin of Error Interval Estimation - Confidence Intervals.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Work Day 1 Today we will: Discuss scientific writing
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Phylogeny GENE why is coalescent theory important for understanding phylogenetics (species trees)? coalescent theory lets us test our assumptions.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Chapter 26 Phylogeny and the Tree of Life
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
Introduction to Phylogenetics
Introduction to History of Life. Biological evolution consists of change in the hereditary characteristics of groups of organisms over the course of generations.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny Ch. 7 & 8.
National Research Council Of the National Academies
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
ASSEMBLY AND ALIGNMENT-FREE METHOD OF PHYLOGENY RECONSTRUCTION FROM NGS DATA Huan Fan, Anthony R. Ives, Yann Surget-Groba and Charles H. Cannon.
1.  The practice or science of collecting and analyzing numerical data in large quantities, especially for the purpose of inferring* proportions in a.
Section 26.5: Horizontal Gene Transfer By Monica Macaro.
Chapter 26 Phylogeny and the Tree of Life
Lesson Overview Lesson Overview Modern Evolutionary Classification 18.2.
General Microbiology (Micr300)
Phylogeny and the Tree of Life
Introduction to Bioinformatics Resources for DNA Barcoding
phylogenetic inferences
EVIDENCE OF EVOLUTION.
Chapter 12 Using Descriptive Analysis, Performing
Evolution of Biodiversity
Why could a gene tree be different from the species tree?
Fig. 1. (A) Acclimated specific growth rates of Picochlorum species in media with varying salinity (10 mM–1.2 M). (B) Growth rates of P. oklahomensis and.
Genome Evolution: Horizontal Movements in the Fungi
Volume 8, Issue 2, Pages (February 2015)
Genome Evolution: Horizontal Movements in the Fungi
Reverend Thomas Bayes ( )
Analysis of the complete genome sequences of human rhinovirus
Pier Francesco Palamara, Todd Lencz, Ariel Darvasi, Itsik Pe’er 
Volume 12, Issue 9, Pages (April 2002)
Phylogeny and the Tree of Life
Matthew A. Campbell, Piotr Łukasik, Chris Simon, John P. McCutcheon 
Volume 26, Issue 23, Pages (December 2016)
Presentation transcript:

The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer By Fan Ge, Li-San Wang, Junhyong Kim Published: August 30, 2005 Presented By: Vishal S. Doshi Course: CS th April 2014

Fact of the day "Biology is the only science in which multiplication means the same thing as division."

Agenda Terminologies Motivation Overview Experiment Conclusion

Terminologies Phylogeny: History of the Evolution of a species or group, especially lines of descent and relationships among broad groups.

Tree of Life Terminologies

Horizontal Gene Transfer (HGT): Horizontal gene transfer occurs when genes jump between unrelated organisms.

Terminologies Horizontal Gene Transfer (HGT): Horizontal gene transfer occurs when genes jump between unrelated organisms. E.g. Think of it as acquiring genes from your neighbor instead of your parents.

Terminologies Horizontal Gene Transfer (HGT)

Pun Intended Why is the mushroom always asked to a party?

Pun Intended Why is the mushroom always asked to a party? Because he’s a fungi (fun guy).

Agenda Terminologies Motivation Overview Experiment Conclusion

Motivation There are 10 types of people in the world:

Motivation There are 10 types of people in the world: 01. Those who understand binary, and

Motivation There are 10 types of people in the world: 01. Those who understand binary, and 10. those who don't.

Motivation Similarly, there are 2 types of people in the world:

Motivation Similarly, there are 2 types of people in the world: 1.Who believe HGT constitutes only minor interference when inferring phylogeny. Tree of Life

Motivation Similarly, there are 2 types of people in the world: 2.And others who believe HGT has an impact on Phylogeny ( the tree-like structure of life) and the old structure should be replaced with a new net-like structure.

Motivation Similarly, there are 2 types of people in the world: 2.And others who believe HGT has an impact on Phylogeny ( the tree-like structure of life) and the old structure should be replaced with a new net-like structure.

Motivation This established the aim of this experiment. The results would be breakthrough.

Agenda Terminologies Motivation Overview Experiment Conclusion

Overview Horizontal Gene Transfer (HGT): – Its role in speciation, adaptation and evolution is studied intensively –Growing evidence of lateral transfer of genes among species.

Overview Debate: Whole-Genome analysis of different prokaryotes indicate rampant HGTs. Suggesting role of HGT to be: ⁻ Pivotal in prokaryotic evolution. ⁻ To be considered essence of Phylogeny. Therefore life history cannot be well represented by tree-like form, but rather a network-like form.

Overview Debate : Therefore life history cannot be well represented by tree-like form, but rather a network-like form.

Overview Debate : Therefore life history cannot be well represented by tree-like form, but rather a network-like form.

Overview Unresolved issues in the debate : Estimation of HGT Frequency. Thus, inferring its impact on phylogeny. And the experiment conducted by our authors is to deal with them.

Overview Suggested HGT frequency: –At 24% in Thermatoga –A range up to 17% among different prokaryotes Led some researchers to believe absence of tree-like structure.

Overview Other researchers propose: –HGT constitutes only minor interference when inferring phylogeny.

Overview Other researchers propose: –HGT constitutes only minor interference when inferring phylogeny. –Methods for inferring HGT have various problems leading to Over-estimation E.g. : Different methods of estimation gave different HGT candidates applied to same genome.

Overview Other researchers propose: –HGT constitutes only minor interference when inferring phylogeny. –Methods for inferring HGT have various problems leading to Over-estimation E.g. : Different methods of estimation gave different HGT candidates applied to same genome. –Phylogeny can be sufficiently retrieved via a core of genes that may be resistant to HGT

Overview Inference of HGT can be done by tree comparisons. But it should be done under a proper statistical framework. Even though, if HGT events are randomly distributed across lineage: there still exist a backbone tree structure.

Agenda Terminologies Motivation Overview Experiment Conclusion

Experiment Specific questions asked in this paper are: 1.What is the fundamental structure of the whole-genome (W-G) tree?

Experiment Specific questions asked in this paper are: 1.What is the fundamental structure of the whole- genome (W-G) tree? 2.How do individual gene trees differ from this tree, especially in terms of general HGT events? 3.How do individual gene trees differ from one another in terms of HGT? 4.What is the rate of HGT events per genome? 5.What kind of genome-specific patterns or gene- specific patterns are evident for HGT events?

Experiment Specific questions asked in this paper are: 1.What is the fundamental structure of the whole-genome (W-G) tree? 2.How do individual gene trees differ from this tree, especially in terms of general HGT events?

Experiment Specific questions asked in this paper are: 1.What is the fundamental structure of the whole-genome (W-G) tree? 2.How do individual gene trees differ from this tree, especially in terms of general HGT events? 3.How do individual gene trees differ from one another in terms of HGT?

Experiment Specific questions asked in this paper are: 1.What is the fundamental structure of the whole-genome (W-G) tree? 2.How do individual gene trees differ from this tree, especially in terms of general HGT events? 3.How do individual gene trees differ from one another in terms of HGT? 4.What is the rate of HGT events per genome?

Experiment Specific questions asked in this paper are: 1.What is the fundamental structure of the whole- genome (W-G) tree? 2.How do individual gene trees differ from this tree, especially in terms of general HGT events? 3.How do individual gene trees differ from one another in terms of HGT? 4.What is the rate of HGT events per genome? 5.What kind of genome-specific patterns or gene- specific patterns are evident for HGT events?

Experiment Figure 1. Flowchart of the HGT Inference Procedure Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

Experiment: Step 1 High Quality Clusters of Orthologous Groups (COGs)

Experiment: Step 1 Figure 1. Flowchart of the HGT Inference Procedure Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio Use the COG database to assemble a set of high-quality orthologous groups for tree inference.

Experiment: Step 1 The COG database used, covered 43 microorganisms. Stringent high-quality COG selection procedure resulted in: –Retention of only 297 out of 3852 COG entries. –These covered 40 genomes and on average ~16 genomes/entry.

Step 1: Outcome Table 1. Number of COG Entries That Contain Each of 40 Genomes Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

Experiment: Step 2 High-Quality Gene Groups and the W-G Tree

Experiment: Step 2 Figure 1. Flowchart of the HGT Inference Procedure Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio Construct the W-G tree that represents the best treelike description for the genealogical relationship of the genomes.

Experiment: Step 2 Figure 1. Flowchart of the HGT Inference Procedure Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio Construct the W-G tree that represents the best treelike description for the genealogical relationship of the genomes. We estimate a tree for each orthologous group (gene tree)

Experiment: Step 2 Use of median tree estimator designed by Kim and Salisbury to approximate W-G tree. –It is robust –Overcomes major genetic distortions like HGTs

Step 2: Outcome Figure 2. The W-G Tree Based on the Median Tree Algorithm Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

Step 2: Outcome Branches with bootstrap scores <50% were collapsed into the polytomous form. Three domains of life are shown as –(A) Archaea, –(B–J) Bacteria, and –(K) Eukaryote. Taxonomy labels are: A.Euryarchaea, B.Proteobacteria, C.Chlamydiae, D.Spirochaetes, E.Thermotogae F.Aquificae, G.Actinobacteria, H.Deinococcus, I.Cyanobacteria, J.Firmicutes, and K.Fungi.

Step 2: Outcome Figure 2. The W-G Tree Based on the Median Tree Algorithm Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio Inferred HGT rates: red, >4%; yellow, 3%–4%; pink, 2%–3%; blue, 1%–2%; green,<1%.

Experiment: Step 3 & 4 HGT events for a particular gene or sequence can be detected by comparing the estimated gene tree against: –Other gene trees, or –Some candidate tree that represents the history of the genomes (W-G Tree)

Method Statistical Inference of HGT and Power Test

Method A.Statistical Inference: Method used for comparison: 1.Maximum Agreement Subtree (MAST) algorithm. 2.Symmetric Difference (SD)  Both process return a MAST score and SD score respectively.  Authors used PAUP* software tool to carry out this process.

Maximum Agreement Subtree (MAST) algorithm. Figure 3. Example for MAST

Symmetric Difference (SD) The symmetric difference of: The sets {1,2,3} and {3,4} is {1,2,4} Figure 4. Example for SD

Method Table 2. HGT inference MAST Score SD Score Explanation Small No HGT Large No easily explained by HGT SmallLargeHGT Event LargeSmallCannot occur

MAST= 2, SD= 2 MAST= 2, SD= 8 Figure 5. HGT Inference via Tree Comparison Method Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

MAST= 2, SD= 2 MAST= 2, SD= 8 HGT Event Figure 5. HGT Inference via Tree Comparison Method Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

Method B.Power Test A hypothesis test for HGT, using the difference between the normalized values of the two metrics. i.e SD and MAST

Method For a pair of trees T and T′, with m and n splits (i.e., branches), respectively, and with x number of taxa, our statistic γ is defined as: d S is the SD metric d M is the MAST metric

Method For a pair of trees T and T′, with m and n splits (i.e., branches), respectively, and with x number of taxa, our statistic γ is defined as: d S is the SD metric d M is the MAST metric Normalized values of the SD and the MAST metrics

Method Now, compute the p-Value(Significance) of an observed γ-value by generating a nonparametric null distribution based on randomly bootstrapped gene trees (Subtree Pruning-Regrafting)

Experiment Figure 1. Flowchart of the HGT Inference Procedure Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio Assess the difference between the tree structure of each gene tree and that of the W-G tree.

Experiment: Step 3 HGT Estimation via Comparisons between Each Gene Tree and the W-G Tree

Experiment: Step 3 Hypothesis test described before was applied to each of the 297 COG gene trees against the W-G tree.

Step 3: Outcome Figure 6. Power of the γ Test in Detecting HGT Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

Step 4: Outcome Figure 7. The Relationship between Detecting COG Entries with HGT and the p-Values Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

Step 3: Outcome At the significance level of 0.05, authors inferred that 33 out of 297 COG entries (i.e., 11.1%) contain putative HGT events

Experiment: Step 4 HGT Estimation via Comparisons among Gene Trees

Experiment Figure 1. Flowchart of the HGT Inference Procedure Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio Assess the difference between the tree structure of each gene tree vs. every other gene tree.

Experiment: Step 4 Same operation was carried out, but this time every gene-tree was compared with every other gene-tree and not W-G Tree. 14,004 pairs were compared. 1,764 pairs were significant. 13% COGs contain HGTs

Experiment: Step 5 HGT Frequency in 40 Genomes

Step 5: Outcome Table 3. Frequency of HGT in 40 Genomes and List of Transferred Genes Ge F, Wang L-S, Kim J (2005) The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer. PLoS Biol 3(10): e316. doi: /journal.pbio

Step 5: Outcome The estimated rates of HGT in different genomes are between 0% and 6.7%, with an average of 2.0% among the 40 genomes studied here.

Agenda Terminologies Motivation Overview Experiment Conclusion

Imagine a very large tree 10,000 taxa. 10, 000 Potential “units” of HGT per genome If each such element had, say, 1,000 actual HGTs across the 10,000 taxa. All the HGTs will appear as extremely thin connections like “cobwebs” when layed over the tree.

Conclusion So pictorially:

Conclusion So pictorially:

Conclusion So pictorially:

Questions

I have one for you..

How do you tell the sex of chromosome?

I have one for you.. How do you tell the sex of chromosome? Pull down it's genes!