iPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Building and Using Workflows Within the DE; Phylogenetics
“Knowledge of evolutionary relationships is fundamental to biology, yielding new insights across the plant sciences, from comparative genomics and molecular evolution, to plant development, to the study of adaptation, speciation, community assembly, and ecosystem functioning.” Workflows within the DE; Phylogenetics Why is the Tree of Life Important?
Workflows within the DE; Phylogenetics We like to put things into categories!
Workflows within the DE; Phylogenetics Why is the Tree of Life Important? Midway between the unintelligible and the commonplace, it is metaphor which produces most knowledge. —Aristotle, Rhetoric III
A B C DE F * But not always Workflows within the DE; Phylogenetics Classifications represent inferred evolutionary relationships*
(E) human Consider primates: Do humans make up a monophyletic group? Workflows within the DE; Phylogenetics Classifications represent inferred evolutionary relationships*
Workflows within the DE; Phylogenetics What is a monophyletic group, again?
(E) human Hylobatidae Pongidae Hominidae Workflows within the DE; Phylogenetics Classifications represent inferred evolutionary relationships*
Phylogeny based on globin pseudogene suggests that humans and chimpanzees make up a single monophyletic group outgroup Workflows within the DE; Phylogenetics Classifications represent inferred evolutionary relationships*
How can iPlant help with phylogenetics?
Scalability
Workflows within the DE; Phylogenetics Trees also present computational challenges Number of atoms in the universe
It can take weeks or months to analyze data sets with > 100, 000 species. Example of iPlant contribution: NINJA/WINDJAMMER (Neighbor-Joining) -- NINJA 216K species, ~8 days -- WINDJAMMER 216K species, ~4 hours Workflows within the DE; Phylogenetics Trees also present computational challenges
How can we scale up phylogenetic tree visualization? Goloboff et al Largest Published Tree (73,060 species)
HD TV: 1920 × ,533 names largest computer monitors: 3280×2048 (can be tiled) Laser printer: effectively 3600 × 4725 (can be tiled)
Prototype iPlant tree viewer
Scientific Question: How are ATP Synthase Subunit B (atpB) genes in basal groups of Magnoliophyta species related? Phylogenetic Workflow Lab
ATP Synthase Subunit B gene, Basal Magnoliophyta. Phylogenetic Workflow Lab
Please login to the Discovery Environment. Follow along with the instructor Or Follow along with the handouts on your own
Phylogenetic Workflow Lab Use workflows to simplify your life Extend your research beyond just next-gen sequence analysis Chain together multiple apps and make it easy for your collaborators to copy your exact workflow Future improvements will include branching and decision points that further extend capabilities, such as parameter sweeps