Phylogenomics Symposium and Software School Co-Sponsored by the SSB and NSF grant 0733029.

Slides:



Advertisements
Similar presentations
A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
Advertisements

Computer Science and Reconstructing Evolutionary Trees Tandy Warnow Department of Computer Science University of Illinois at Urbana-Champaign.
Profile HMMs Tandy Warnow BioE/CS 598AGB. Profile Hidden Markov Models Basic tool in sequence analysis Look more complicated than they really are Used.
Phylogenomics Symposium and Software School Tandy Warnow Departments of Computer Science and Bioengineering The University of Illinois at Urbana-Champaign.
Estimating species trees from multiple gene trees in the presence of ILS Tandy Warnow Joint work with Siavash Mirarab, Md. S. Bayzid, and others.
Recent breakthroughs in mathematical and computational phylogenetics
Supertrees and the Tree of Life
CS/BIOE 598: Algorithmic Computational Genomics Tandy Warnow Departments of Bioengineering and Computer Science
New HMM-based methods for Ultra-large Alignment and Phylogeny Estimation Tandy Warnow Departments of Bioengineering and Computer Science The University.
IPAM Workshop on Multiple Sequence Alignment Tandy Warnow The University of Illinois at Urbana-Champaign.
From Gene Trees to Species Trees Tandy Warnow The University of Texas at Austin.
Introduction to Phylogenomics and Metagenomics Tandy Warnow The Department of Computer Science The University of Texas at Austin.
Computational Phylogenomics and Metagenomics Tandy Warnow Departments of Bioengineering and Computer Science The University of Illinois at Urbana-Champaign.
New techniques that “boost” methods for large-scale multiple sequence alignment and phylogenetic estimation Tandy Warnow Department of Computer Science.
From Gene Trees to Species Trees Tandy Warnow The University of Texas at Austin.
New methods for inferring species trees in the presence of incomplete lineage sorting Tandy Warnow The University of Illinois.
From Gene Trees to Species Trees Tandy Warnow The University of Texas at Austin.
Ultra-large Multiple Sequence Alignment Tandy Warnow Founder Professor of Engineering The University of Illinois at Urbana-Champaign
SuperFine, Enabling Large-Scale Phylogenetic Estimation Shel Swenson University of Southern California and Georgia Institute of Technology.
New methods for inferring species trees in the presence of incomplete lineage sorting Tandy Warnow The University of Illinois.
New methods for inferring species trees in the presence of incomplete lineage sorting Tandy Warnow The University of Illinois.
CS 394C Algorithms for Computational Biology Tandy Warnow Spring 2012.
TIPP: Taxon Identification and Phylogenetic Profiling Tandy Warnow The Department of Computer Science.
Orangutan GorillaChimpanzee Human From the Tree of the Life Website, University of Arizona Species Tree.
New methods for estimating species trees from genome-scale data Tandy Warnow The University of Illinois.
New methods for estimating species trees from genome-scale data Tandy Warnow The University of Illinois.
From Gene Trees to Species Trees Tandy Warnow The University of Texas at Austin.
Algorithms research Tandy Warnow UT-Austin. “Algorithms group” UT-Austin: Warnow, Hunt UCB: Rao, Karp, Papadimitriou, Russell, Myers UCSD: Huelsenbeck.
394C, October 2, 2013 Topics: Multiple Sequence Alignment
SEPP and TIPP for metagenomic analysis Tandy Warnow Department of Computer Science University of Texas.
BBCA: Improving the scalability of *BEAST using random binning Tandy Warnow The University of Illinois at Urbana-Champaign Co-authors: Theo Zimmermann.
TIPP: Taxon Identification and Phylogenetic Profiling Tandy Warnow The Department of Computer Science The University of Texas at Austin.
Constructing the Tree of Life: Divide-and-Conquer! Tandy Warnow University of Illinois at Urbana-Champaign.
New methods for estimating species trees from genome-scale data Tandy Warnow The University of Illinois.
New HMM-based methods for Ultra-large Alignment and Phylogeny Estimation Tandy Warnow Departments of Bioengineering and Computer Science The University.
Using Divide-and-Conquer to Construct the Tree of Life Tandy Warnow University of Illinois at Urbana-Champaign.
SupreFine, a new supertree method Shel Swenson September 17th 2009.
Family of HMMs Nam Nguyen University of Texas at Austin.
New methods for estimating species trees from genome-scale data Tandy Warnow The University of Illinois.
The Mathematics of Estimating the Tree of Life Tandy Warnow The University of Illinois.
Three approaches to large- scale phylogeny estimation: SATé, DACTAL, and SEPP Tandy Warnow Department of Computer Science The University of Texas at Austin.
TIPP: Taxon Identification using Phylogeny-Aware Profiles Tandy Warnow Founder Professor of Engineering The University of Illinois at Urbana-Champaign.
CS/BIOE 598: Algorithmic Computational Genomics Tandy Warnow Departments of Bioengineering and Computer Science
Algorithms for Ultra-large Multiple Sequence Alignment and Phylogeny Estimation Tandy Warnow Department of Computer Science The University of Texas at.
SEPP and TIPP for metagenomic analysis Tandy Warnow Department of Computer Science University of Texas.
Progress and Challenges for Large-Scale Phylogeny Estimation Tandy Warnow Departments of Computer Science and Bioengineering Carl R. Woese Institute for.
Simultaneous alignment and tree reconstruction Collaborative grant: Texas, Nebraska, Georgia, Kansas Penn State University, Huston-Tillotson, NJIT, and.
New methods for inferring species trees in the presence of incomplete lineage sorting Tandy Warnow The University of Illinois.
New methods for inferring species trees in the presence of incomplete lineage sorting Tandy Warnow The University of Illinois.
New HMM-based methods for Ultra-large Alignment and Phylogeny Estimation Tandy Warnow Departments of Bioengineering and Computer Science The University.
Ensembles of HMMs and their use in biomolecular sequence analysis Nam-phuong Nguyen Carl R. Woese Institute for Genomic Biology University of Illinois.
Advancing Genome-Scale Phylogenomic Analysis Tandy Warnow Departments of Computer Science and Bioengineering Carl R. Woese Institute for Genomic Biology.
Scaling BAli-Phy to Large Datasets June 16, 2016 Michael Nute 1.
Constrained Exact Optimization in Phylogenetics
CS 581 / BIOE 540: Algorithmic Computational Genomics
Chalk Talk Tandy Warnow
TIPP: Taxon Identification using Phylogeny-Aware Profiles
Mathematical and Computational Challenges in Reconstructing Evolution
Tandy Warnow The University of Illinois
Large-Scale Multiple Sequence Alignment
Mathematical and Computational Challenges in Reconstructing Evolution
TIPP: Taxon Identification using Phylogeny-Aware Profiles
Tandy Warnow Founder Professor of Engineering
Texas, Nebraska, Georgia, Kansas
Taxonomic identification and phylogenetic profiling
Ultra-large Multiple Sequence Alignment
New methods for estimating species trees from gene trees
Advances in Phylogenomic Estimation
Advances in Phylogenomic Estimation
Scaling Species Tree Estimation to Large Datasets
Presentation transcript:

Phylogenomics Symposium and Software School Co-Sponsored by the SSB and NSF grant

NSF DEB Large-Scale Simultaneous Multiple Alignment and Phylogeny Estimation PIs: – Tandy Warnow (UT-Austin) – Randy Linder (UT-Austin) – Mark Holder (Kansas) – Jim Leebens-Mack (Georgia) – Etsuko Moriyama (Nebraska)

Not all genes present in all species gene 1 S1S1 S2S2 S3S3 S4S4 S7S7 S8S8 TCTAATGGAA GCTAAGGGAA TCTAAGGGAA TCTAACGGAA TCTAATGGAC TATAACGGAA gene 3 TATTGATACA TCTTGATACC TAGTGATGCA CATTCATACC TAGTGATGCA S1S1 S3S3 S4S4 S7S7 S8S8 gene 2 GGTAACCCTC GCTAAACCTC GGTGACCATC GCTAAACCTC S4S4 S5S5 S6S6 S7S7

Combined analysis gene 1 S1S1 S2S2 S3S3 S4S4 S5S5 S6S6 S7S7 S8S8 gene 2gene 3 TCTAATGGAA GCTAAGGGAA TCTAAGGGAA TCTAACGGAA TCTAATGGAC TATAACGGAA GGTAACCCTC GCTAAACCTC GGTGACCATC GCTAAACCTC TATTGATACA TCTTGATACC TAGTGATGCA CATTCATACC TAGTGATGCA ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?

... Analyze separately Supertree Method Two competing approaches gene 1 gene 2... gene k... Combined Analysis Species

Red gene tree ≠ species tree (green gene tree okay)

1KP: Thousand Transcriptome Project 1200 plant transcriptomes More than 13,000 gene families (most not single copy) iPLANT (NSF-funded cooperative) Gene sequence alignments and trees computed using SATé G. Ka-Shu Wong U Alberta N. Wickett Northwestern J. Leebens-Mack U Georgia N. Matasci iPlant T. Warnow, S. Mirarab, N. Nguyen, Md. S.Bayzid UT-Austin UT-Austin Gene Tree Incongruence

Avian Phylogenomics Project E Jarvis, HHMI G Zhang, BGI Approx. 50 species, whole genomes genes, UCEs Gene sequence alignments computed using SATé MTP Gilbert, Copenhagen S. Mirarab Md. S. Bayzid, UT-Austin UT-Austin T. Warnow UT-Austin Plus many many other people… Gene Tree Incongruence

Software School, Rooms 302 A-C 8:30-9:45: Tutorial 1 (PASTA: Multiple Sequence Alignment), Rooms 302A and 302B. 9:45-10:15 Break 10:15-11:45: Tutorial 3 (GARLI, 302A), Tutorial 6 (Phyldog, 302B), and Tutorial 7 (MulRF, 302C) 11:45-1:15 Lunch 1:15-2:30: Tutorial 5 (ASTRAL, 302A and 302B) and Tutorial 9 (HMMER, 302C) 2:30-3:45: Tutorial 2 (BAli-Phy, 302C), Tutorial 4 (Phylonet, 302B), and Tutorial 8 (Metagenomics, 302A)

Preparing for the Software School Today: revisit the Software School webpage to make sure you have downloaded and installed everything, and tested the software on your laptop. Contact the organizer of the tutorials you are taking if you are having trouble. Tomorrow: come with a fully charged laptop. You may want to plan to go back to your hotel room to recharge your laptop during the lunch break!

Feedback Your feedback would be very welcome. Please the tutorial providers directly if you have suggestions for improvements to the software. If you use the software in a subsequent study, please let us know!

Symposium 9:00-9:45, Tandy Warnow (large-scale multiple sequence alignment) 9:45-10:15, Coffee Break 10:15-11:00, Nam Nguyen (Taxon identification of metagenomic data) 11:00-11:45, Siavash Mirarab (The Avian Phylogenomics Project) 11:45-1:15, Lunch Break 1:15-2:00, Jim Leebens-Mack (The Thousand Transcriptome Project) 2:00-2:45, Luay Nakhleh (Phylogenetic Networks) 2:45-3:15 Coffee Break 3:15-4:45 Discussion