March 26, 2007 Phyloinformatics of Neuraminidase at Micro and Macro Levels using Grid-enabled HPC Technologies B. Schmidt (UNSW) D.T. Singh (Genvea Biosciences)

Slides:



Advertisements
Similar presentations
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Advertisements

F INDINGS National Institutes of Health National Institute of General Medical Sciences Viral Voyages Structural Biologist Mavis Agbandje-McKenna: Understanding.
The pandemic and a brief ABC of influenza Thomas Abraham JMSC 6090.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
Molecular Evolution Revised 29/12/06
© Wiley Publishing All Rights Reserved. Phylogeny.
Avian Influenza – What does it all mean? Important Background Information Island Paravets and Residents.
Influenza Ieuan Davies. Signs and Symptoms Influenza is an acute, viral respiratory infection. Fever, chills, headache, aches and pains throughout the.
Bioinformatics and Phylogenetic Analysis
H1N1: “Swine Flu”. Why you should care… Every year between 5 and 20% of the population gets the flu. The CDC estimates that the flu kills 36,000 people.
Methods for Phylogenetics and Evolutionary analysis Jianpeng Xu University of Nebraska-Omah a.
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
Probabilistic methods for phylogenetic trees (Part 2)
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
“Species Trees”. What is the “species tree?” The true tree (when there is one) The population tree The dominant history ????
EPIDEMIOLOGY AND PREVENTION OF INFLUENZA. Introduction Unique epidemiology: – Seasonal attack rates of 10% to 30% – Global pandemics Influenza viruses.
Laboratory Training for Field Epidemiologists Typing May 2007 Sequencing and Phylogeny.
Development in hardware – Why? Option: array of custom processing nodes Step 1: analyze the application and extract the component tasks Step 2: design.
The Avian flu The Next Pandemic. At the end of every summer million of ducks and wild geese mass on Canadian and Siberian lakes for their annual migration.
D-Influenza virus. Influenza epidemiology in humans Fields Virology, 2nd ed, Fields & Knipe, eds, Raven Press, 1990, Fig.40-1.
Microbiology of Influenza
Molecular phylogenetics 1 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Using Comparative Genomics to Explore the Genetic Code of Influenza Sangeeta Venkatachalam.
Evolution of influenza A Rachel Albert Craig Bland Evolution of influenza A.
Influenza A/H1N1 W. Rose H1N1 is a subtype of influenza type A Influenza types B and C also exist but less common, less infectious, and drift.
What’s up with the flu? Novel H1N1? SWINE FLU??? Mexican flu? swine-origin influenza A? A(H1N1)? S-OIV? North American flu? California flu? Schweingrippe.
INTRODUCTION Cause RTI Cause RTI Genetic variation (shift and drift) Genetic variation (shift and drift) Estimated million deaths worldwide in pandemic.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Calculating branch lengths from distances. ABC A B C----- a b c.
Antigenic Shift v. Drift in Avian and Mammalian Sino- Influenza Type A Viruses. By Charles Hauser, St. Edward’s University Mark Maloney, Spelman College.
REASSORTMENT OF INFLUENZA VIRUS
Evolution and Human Health – Chapter 14 - Flu virus evolution - Antibiotic resistance - Cancer as set of evolving population - Adaptative reasoning and.
More statistical stuff CS 394C Feb 6, Today Review of material from Jan 31 Calculating pattern probabilities Why maximum parsimony and UPGMA are.
Virion Structure and Organization
A rapid visualization pipeline for the 2009 A/H1N1 influenza pandemic Donovan Parks Norm MacDonald Rob Beiko.
Influenza H1N1 A: A close insight Dr. Mustafa Ababneh Molecular Virologist.
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic 12.1 Overview 12.2 The pathogen-Influenza Virus A 12.3: Naming System 12.4: A Disease.
The New Influenza A/H1N1 Isabelle Thomas May 28-29, 2009 Brussels,
INFLUENZA LUKE UYEMURA ENGLISH 100 ESP. BASIC INFO Definition: Influenza, more commonly know as the flu, is a viral infection that attacks your respiratory.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Hybrid MPI/Pthreads Parallelization of the RAxML Phylogenetics Code Wayne Pfeiffer.
Applied Bioinformatics Week 8 Jens Allmer. Theory I.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Virologia Applicata E.A. Influenza VIROLOGIA. Virologia Applicata E.A. Influenza The virus and its replication.
“Neutralizing Antibodies Derived from the B Cells of 1918 Influenza Pandemic Survivors” (Yu et. al) Daniel Greenberg.
Bayesian Evolutionary Analysis by Sampling Trees (BEAST) LEE KIM-SUNG Environmental Health Institute National Environment Agency.
Viral Genetics. The objectives What genetic changes are detected in viruses? The effects of viral genetic changes on viral virulence. The application.
Abstract Our research mainly applies Maximum Likelihood Method (MLE), Dynamic Programming, and Neighbor Joining Method in an attempt of shortening the.
Viral evolution and adaption
Phylogeny and the Tree of Life
Bioinformatics Overview
Introduction to Bioinformatics Resources for DNA Barcoding
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic
Comparative genotypic and phenotypic characterization of
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic
Influenza Virus: Evolution in real time
Phylogenetic Inference
What makes H5N1 Avian influenza “Avian”
Influenza يك بيماري بسيار مسري عفوني ويروسي است.
Di Liu, Weifeng Shi, George F Gao
Business Process Management Software
Origins of Human Virus Diversity
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic
A Web-based Interactive Genome Library for Surveillance, Detection, Characterization and Drug-Resistance Monitoring of Influenza Virus Infection in the.
Chapter 19 Molecular Phylogenetics
J. -H. Lin, S. -C. Chiu, J. -C. Cheng, H. -W. Chang, K. -L. Hsiao, Y
Neonatal HSV-2 genomes are genetically distinct from one another and encompass a broad range of known HSV-2 genetic diversity. Neonatal HSV-2 genomes are.
Presentation transcript:

March 26, 2007 Phyloinformatics of Neuraminidase at Micro and Macro Levels using Grid-enabled HPC Technologies B. Schmidt (UNSW) D.T. Singh (Genvea Biosciences) R. Trehan, T. Bretschneider (NTU, Singapore)

March 26, 2007 Contents H5N1 Genetics H5N1 Phyloinformatics Design Principles of Quascade H5N1 Phyloinformatics with Quascade Results Conclusion and Future work

March 26, 2007 H5N1 Genetics Belongs to the Influenza A virus type Segmented RNA genome 8 genes, 11 proteins Classification based on: –Hemagglutinin (HA): 15 subtypes –Neuraminidase (NA): 9 subtypes Genetic variations in HA/NA Genetic drift –Point mutations –1918 Spanish flu Genetic shift –Reassortment of the segmented genome –1957, 1968, 1997 pandemics –2003 Z strain of H5N1

March 26, 2007 H5N1 Phyloinformatics Essential to monitor new emerging strains –Molecular evolution at gene and genome level –Phylogenetic analysis for determining the origin of new strains Phylogenetics –How fast do proteins evolve? –What is the best method to measure the evolution? –How to obtain the best phylogenetic tree? Phylogenetic algorithms –Character based Maximum Parsimony, Maximum Likelihood (ML) –Distance based UPGMA, Neighborhood Join (NJ) –Bayesian MCMC based Mr. Bayes, BEAST

March 26, 2007 Quascade – User Interface Example Communication A data-flow tool in which each black-box represents Java objects running on different computers! Assignment of objects to available computers done automatically (manually if required) Communication between objects done transparently Configuration of objects done before run-time Processing pipeline

March 26, 2007 Java Object Java Object Java Object Coding in regular Java/ C/ C++ Persistent – activated whenever all data-inputs present No explicit messaging protocol required No distributed computing concepts need to be understood Objects automatically or manually assigned to computers / CPU-cores Object Features

March 26, 2007 Phyloinformatics Workflow with Quascade

March 26, 2007 Parallelized Phyloinformatics Workflow

March 26, 2007 Data and Algorithms Core Group –22 H5N1 NA sequences from SwissProt and TREMBL Medium Set –581 NA H5N1 sequences from Uniprot Large Set –909 NA Influenza A sequences from Uniprot ProtDist –NJ –UPGMA ProtPars ProtML Mr. Bayes

March 26, 2007 Runtime and Scalability (NA Bird Flu Protein) 25 processors sequences 581 sequences Processing time [h] Distance-based workflow sequences 581 sequences Processing time [h] MP workflow 1 processor

March 26, 2007 Mr Bayes – Tree Core Set

March 26, 2007 Analysis and Observations Clustering possibilities –Temporal, host-based, geographical Algorithms –Mr. Bayes and ProtML are most consistent in their performance –Too compute-intensive for the larger “macro” sets Observed pattern –All phylograms yielded geographic-based clustering rather than time- based clustering –Host ranges along clustered clades vary –Same strain with identical NA sequences can infect different hosts –NA may not be the sole factor responsible for determining the diverse host range –Glycan site acquisition or loss seems to play a critical role in the molecular evolution of H5N1 NA –Identification of “bridging isolates” may help in rapid monitoring and development of global scale warning system for H5N1

March 26, 2007 Conclusion and Future Work Quascade –New graphical data-flow tool to design automatically grid-enabled pipelines / workflows –Supports implicit high-performance parallelization –Supports persistent components –Can be used with Java / C/ C++ code or application-binaries H5N1 Phyloinformatics –Can take advantage of workflow system and HPC –Can be easily used and modified by biologists –Use H5N1 NA sequences to better understand evolution of H5N1 –Analysis of H5N1 NA data with different algorithms indicates spatial clustering based on geographical distribution rather than temporal or host. Future work –Studies in conjunction with other proteins such as HA, Polymerase etc., and also at gene and genome level