A Web Interface to analyse SOM of Bipartitions of Gene Phylogenies - A Walk Through J. Peter Gogarten, Maria Poptsova Dept. of Molecular and Cell Biology.

Slides:



Advertisements
Similar presentations
February 11, 2012 Seth Bordenstein Departments of Biological Sciences & Pathology, Microbiology, and Immunology Seth Bordenstein.
Advertisements

Viral Evolution and Recombination Peter Norberg
A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
Phylogenetic analysis To infer and study evolutionary history of homologous gene families Manuel Ruiz (CIRAD, Data Integration team) Alexis Dereeper (IRD)
Wellcome Trust Workshop Working with Pathogen Genomes Module 6 Phylogeny.
Plant Molecular Systematics (Phylogenetics). Systematics classifies species based on similarity of traits and possible mechanisms of evolution, a change.
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
New Tools for Visualizing Genome Evolution Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island J. Peter Gogarten Dept. of Molecular.
Molecular Evolution Revised 29/12/06
© Wiley Publishing All Rights Reserved. Phylogeny.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
The Cobweb of life revealed by Genome-Scale estimates of Horizontal Gene Transfer Fan Ge, Li-San Wang, Junhyong Kim Mourya Vardhan.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Adaptive evolution of bacterial metabolic networks by horizontal gene transfer Chao Wang Dec 14, 2005.
Bioinformatics and Phylogenetic Analysis
Introduction to Computational Biology Topics. Molecular Data Definition of data  DNA/RNA  Protein  Expression Basics of programming in Matlab  Vectors.
Bas E. Dutilh Phylogenomics Using complete genomes to determine the phylogeny of species.
The Tree of Life (TOL) in the age of Genomics or a journey through the Phylogenetic Forest Eugene Koonin, NCBI / NLM / NIH RECOM BE, San Diego, May 23,
Example of bipartition analysis for five genomes of photosynthetic bacteria (188 gene families) total 10 bipartitions R: Rhodobacter capsulatus, H: Heliobacillus.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Gene transfer Organismal tree: species B species A species C species D Gene Transfer seq. from B seq. from A seq. from C seq. from D molecular tree: speciation.
MCB 372 #12: Tree, Quartets and Supermatrix Approaches Collaborators: Olga Zhaxybayeva (Dalhousie) Jinling Huang (ECU) Tim Harlow (UConn) Pascal Lapierre.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
MCB 372 #14: Student Presentations, Discussion, Clustering Genes Based on Phylogenetic Information J. Peter Gogarten University of Connecticut Dept. of.
Bioinformatics tools for phylogeny and visualization
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Department of Biomedical Informatics Biomedical Data Visualization Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Coalescence and the Cenancestor J. Peter Gogarten University of Connecticut Department of Molecular and Cell Biology.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Self-organizing map Speech and Image Processing Unit Department of Computer Science University of Joensuu, FINLAND Pasi Fränti Clustering Methods: Part.
Christian M Zmasek, PhD Burnham Institute for Medical Research Bioinformatics and Systems Biology
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Calculating branch lengths from distances. ABC A B C----- a b c.
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Phylogenetic analyses of cyanobacterial genomes: Quantification of horizontal gene transfer events Olga Zhaxybayeva, J. Peter Gogarten, Robert L. Charlebois,
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
Phylogeny and Genome Biology Andrew Jackson Wellcome Trust Sanger Institute Changes: Type program name to start Always Cd to phyml directory before starting.
26.1 Organisms Evolve Through Genetic Change Occurring Within Populations. “Nothing in Biology makes sense except in the light of Evolution” –Theodosius.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
An Overview of Clustering Methods Michael D. Kane, Ph.D.
Bioinformatics and Comparative Genome Analyses Course Course web page: EMBO Bioinformatics and Comparative.
Announcements Urban Forestry data and photos due next week after the break. Reading. Writing assignment due Oct 18. Choose one of the characteristics out.
Data Mining and Decision Trees 1.Data Mining and Biological Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
Copyright OpenHelix. No use or reproduction without express written consent1.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Today in MEGA: Sequence Data Explorer Constructing Phylogenetic Trees
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
MCB 3421 class 26.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Systematics and Phylogenetics Ch. 23.1, 23.2, 23.4, 23.5, and 23.7.
Interdisciplinary Research Interests.
Introductory Phylogenetic Workflows in the Discovery Environment Sheldon McKay iPlant Collaborative, DNALC, Cold Spring Harbor Laboratory Feb 8, 2012.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
General Microbiology (Micr300)
Phylogeny & the Tree of Life
High-throughput Biological Data The data deluge
Biological Classification: The science of taxonomy
Genome organization and Bioinformatics
Phylogenetic Trees.
Chapter 25 Phylogeny and the Tree of Life
Comments on bipartitions, quartets and supertrees
Chapter 26.5: Horizontal Gene Transfer
Self-organizing map numeric vectors and sequence motifs
Phylogenetics Chapter 26.
Unit Genomic sequencing
Presentation transcript:

A Web Interface to analyse SOM of Bipartitions of Gene Phylogenies - A Walk Through J. Peter Gogarten, Maria Poptsova Dept. of Molecular and Cell Biology University of Connecticut Neha Nahar, Lutz Hamel Department of Computer Science and Statistics University of Rhode Island

BranchClust n Genomes Super Families Gene Families Reconstruct Phylogenetic History for Each Family

Data Matrix Biapartiton #1 (** …. ….) … Biapartiton #k (*******..) Support value vector for a set #1 of orthologous genes P 11 … P 1k Support value vector for a set #2 of orthologous genes P 21 … P 2k … ……… Support value vector for a set #m of orthologous genes P n1 … P nk Number of bipartitions (k) for N genomes is equal to 2 (N-1) -N-1.

Visualizing Multiple Genomes: SOMs SOM  Self-Organizing Map An artificial neural network approach to clustering we are looking for clusters of genes which favor certain tree topologies Advantages over other clustering approaches: No a priori knowledge of how many clusters to expect Explicit summary of commonalities and differences between clusters Visually appealing representation T. Kohonen, Self-organizing maps, 3rd ed. Berlin ; New York: Springer, 2001.

All clusters selected => ATV tree viewer applet (Zmasek & Eddy, Bioinformatics, 17, ) displays plurality consensus of all gene families. ATV allows to modify display

Select branch to place root Select to re-root tree

Cren- archaeota Euryarchaeota Root

List of strongly supported bipartitions, including conflicts

click to open map as pdf

select clusters that support bipartition “well behaved” gene families

gene families that group Archaeoglobus with Methanosarcina

prolyl-tRNA synthetase, a gene family that groups the Halobacteria with the outgroup. This gene was acquired by the halobacterial lineage from the bacteria. These rare inter-domain gene transfers allow to correlate evolution in the three domains of life. (see Huang & Gogarten: Ancient horizontal gene transfer can benefit phylogenetic reconstruction. Trends in Genetics 22 (7): )