Www.fludb.org Sequence Feature Variant Type and Evolutionary Trajectory Analysis using the Influenza Research Database (IRD) 19 July 2011 Richard H. Scheuermann,

Slides:



Advertisements
Similar presentations
John R. LaMontagne Memorial Symposium on Pandemic Influenza Research Working Group 8 Virus Transmission: Understanding and Predicting Pandemic Risk.
Advertisements

The pandemic and a brief ABC of influenza Thomas Abraham JMSC 6090.
Virus Pathogen Resource (ViPR) 26 September 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern Medical Center.
Centers of Excellence for Influenza Research and Surveillance 6 th Annual Meeting Aug 1, 2012 Status of IRD Development.
Avian Influenza – The Bird Flu
Introduction to Bioinformatics Richard H. Scheuermann, Ph.D. Director of Informatics JCVI.
Host cell responses to viral infection can be monitored by a variety of different high throughput experimental methodologies in order to understand the.
Influenza Sara Finestone April 8, The influenza virus causes 3-5 million cases of severe illness and up to 500,000 deaths annually.
Bioinformatics Resource Centers Influenza Research Database (IRD) Virus Pathogen Database and Analysis Resource (ViPR) 8 December 2010 Richard.
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
Integrated Bioinformatics Data and Analysis Tools for Herpesviridae Viruses in the Virus Pathogen Resource (ViPR) Yun Zhang 1, Brett Pickett 1, Eva Sadat.
Large-Scale Sequencing of the Influenza Virus Genome Steven Salzberg Center for Bioinformatics and Computational Biology University of Maryland Institute.
Pandemic Influenza Preparedness Kentucky Department for Public Health Department for Public Health.
Epidemic Vs Pandemic 8.L.1.2.
EPIDEMIOLOGY AND PREVENTION OF INFLUENZA. Introduction Unique epidemiology: – Seasonal attack rates of 10% to 30% – Global epidemics Influenza viruses.
EPIDEMIOLOGY AND PREVENTION OF INFLUENZA. Introduction Unique epidemiology: – Seasonal attack rates of 10% to 30% – Global pandemics Influenza viruses.
Evolution as a Confounding Factor in Genetic Association Studies 14 December 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern.
Laboratory Training for Field Epidemiologists Typing May 2007 Sequencing and Phylogeny.
The evolution of infectious disease. Influenza We generally think of the flu as nothing more than a minor annoyance In an average year, however, the flu.
Databases and tools to study the genomes of hundreds of pathogens, plants, and mammals Richard H. Scheuermann, Ph.D. Director of Informatics J. Craig Venter.
Influenza Research Database (IRD): A Web-based Resource for Influenza Virus Data and Analysis Victoria Hunt 1 *, R. Burke Squires 1, Jyothi Noronha 1,
Laboratory of Molecular Pathology Retreat - 10 MAR 2011
Pandemic Influenza; A Harbinger of Things to Come Michael T Osterholm PhD, MPH Director, Center for Infectious Disease Research and Policy Associate Director,
Learning from the 2009 H1N1 Pandemic Response 1 Daniel S. Miller MD, MPH Director, International Influenza Unit Office of the Secretary Office of Global.
Data Mining in the Influenza Research Database (IRD) and the Virus Pathogen Resource (ViPR) JCVI-GSCID/NIAID Workshop University of Limpopo 01 June 2011.
Comparative Genomics in the Influenza Research Database 17 June 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern.
4th Year MPharm SRP: Introduction to Pseudotype Viruses
Sequence Variation Identification and Functional/Structural Inference in the Influenza Research Database (IRD) and Virus Pathogen Resource (ViPR) Yun Zhang.
Richard H. Scheuermann, Ph.D. Department of Pathology, UT Southwestern March 30, 2011 Virus Bioinformatics Resource Centers – ViPR & IRD.
Influenza A H1N1: A Pandemic in Real Time – What’s Next? Danny Chen, MD FRCPC MSc Infectious Disease Specialist Grand Rounds, York Central Hospital September.
Influenza Research Database (IRD) 26 September 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern Medical Center.
Influenza Michelle Lin.
BioHealthBase: The Bioinformatics Resource Center for Francisella tularensis Shubhada Godbole 1, Stephen M. Beckstrom-Sternberg 2,3, Paul S. Keim 2,3,
Statistical Tool for Identifying Sequence Variations That Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) July 22,
BioHealthBase: A Web-based Database and Analysis Resource for Francisella Shubhada Godbole 1, Jyothi Noronha 1, Burke Squires 1, Victoria Hunt 1, Ed Klem.
Evolution of influenza A Rachel Albert Craig Bland Evolution of influenza A.
THE QUESTION: SHOULD I GET A FLU SHOT EACH YEAR?.
Evolution and Human Health. I.Motivation Evolutionary principles can contribute to understanding of origin and treatment of human disease Evolutionary.
What’s up with the flu? Novel H1N1? SWINE FLU??? Mexican flu? swine-origin influenza A? A(H1N1)? S-OIV? North American flu? California flu? Schweingrippe.
Yun Zhang J. Craig Venter Institute San Diego, CA, USA August 4, 2012 Integrated Bioinformatics Data and Analysis Tools for Herpesviridae.
Statistical Tool for Identifying Sequence Variations that Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) Brett E.
P ANDEMICS T HROUGHOUT H ISTORY. A pandemic is defined as an unusually high outbreak of a new infectious disease that is spreading through the human population.
BIG Data: Knowledge for Improving Vaccine Virus Selection Richard H. Scheuermann, Ph.D. Director of Informatics JCVI.
John R. LaMontagne Memorial Symposium on Pandemic Influenza Research April 4-5, 2005 Institute of Medicine Working Group One: Influenza Virulence and Antigenic.
REASSORTMENT OF INFLUENZA VIRUS
Molecular Dynamics of the Avian Influenza Virus Team Members: Ashvin Srivatsa, Michael Fu, Ellen Chuang, Ravi Sheth Team Leader: Yuan Zhang.
Integration of Host Factor Data into the Virus Pathogen Database and Analysis Resource (ViPR) and the Influenza Research Database (IRD) Brett E. Pickett.
Virion Structure and Organization
The Informatics Crystal Ball: Mining the Past to Predict the Species Jump Event 19 April 2011 Richard H. Scheuermann, Ph.D. Department of.
Richard H. Scheuermann, Ph.D. November 5, 2012 Support for Systems Biology Data in IRD/ViPR.
Influenza H1N1 A: A close insight Dr. Mustafa Ababneh Molecular Virologist.
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic 12.1 Overview 12.2 The pathogen-Influenza Virus A 12.3: Naming System 12.4: A Disease.
The New Influenza A/H1N1 Isabelle Thomas May 28-29, 2009 Brussels,
Viral Genomics: Strength in Numbers David Spiro Assistant Investigator J. Craig Venter Institute
Avian Influenza: A Zoonotic Disease of International Importance 1.
Influenza Ontology Infectious Disease Ontology Workshop 2008 Burke Squires.
Virologia Applicata E.A. Influenza VIROLOGIA. Virologia Applicata E.A. Influenza The virus and its replication.
P ANDEMICS T HROUGHOUT H ISTORY. A pandemic is defined as an unusually high outbreak of a new infectious disease that is spreading through the human population.
“Neutralizing Antibodies Derived from the B Cells of 1918 Influenza Pandemic Survivors” (Yu et. al) Daniel Greenberg.
Daniel Janies, Ph.D. Carol Grotnes Belk Distinguished Professor of Bioinformatics and Genomics College of Computing and Informatics University of North.
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic
Influenza Virus: Evolution in real time
OUTBREAK.
What makes H5N1 Avian influenza “Avian”
وبائية أنفلونزا الطيور والإجراءات المتخذة لمواجهة الوباء العالمي
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic
Decision Making During a Novel H1N1 Influenza Epidemic
NP sequence alignment of human H1N1 strains and phylogenetic analysis of representative NP from pandemic and seasonal H1N1 strains. NP sequence alignment.
Presentation transcript:

Sequence Feature Variant Type and Evolutionary Trajectory Analysis using the Influenza Research Database (IRD) 19 July 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern Medical Center

Outline Brief overview of NIAID-Sponsored Influenza Research Database (IRD) – Comprehensive integrated database – Analysis and visualization tools – U.S. NIH-funded, free access, open to all – Developed by a team of research scientists, bioinformaticians and professional software developers – – for other human viral pathogens Novel approach to genotype-phenotype association studies – Sequence Feature Variant Type (SFVT) analysis Evolutionary Trajectory analysis of the pandemic (H1N1) 2009 strain

Public Health Impact of Influenza Seasonal flu epidemics occur yearly during the fall/ winter months and result in 3-5 million cases of severe illness worldwide. More than 200,000 people are hospitalized each year with seasonal flu-related complications in the U.S. Approximately 36,000 deaths occur due to seasonal flu each year in the U.S. Populations at highest risk are children under age 2, adults age 65 and older, and groups with other comorbidities. Pandemics – 1918 Spanish flu (H1N1); million deaths – 1957 Asian flu (H2N2); million deaths – 1968 Hong Kong flu (H3N2); 750, million deaths – 2009 Swine origin (H1N1); > 16,000 deaths as of March 2010 Source: World Health Organization -

Influenza Virus Orthomyxoviridae family Negative-strand RNA Segmented Enveloped 8 RNA segments encode 11 proteins Classified based on serology of HA and NA

IRD Overview

Search Access to Data

Data Types

Core Query Attributes

Advanced Query Options

Segment search results

Analysis and Visualization

Analysis and Visualization Tools

Workbench Access

My Private Workbench

IRD Summary Funded by U.S. National Institute of Allergy and Infectious Diseases (NIAID) Free and open access with no use restrictions Developed by a team of research scientists, bioinformaticians and professional software developers Comprehensive collection of public data Novel derived data, novel analytical tools, unique functions Integration – Integration – Integration

NOVEL APPROACH TO GENOTYPE-PHENOTYPE ASSOCIATION STUDIES – SEQUENCE FEATURE VARIANT TYPE (SFVT) ANALYSIS

Limitations to Phylogenetics Traditional virus phylogenetics focuses on comparative analysis of whole genome/genome segments, and is most useful to understand virus evolution However, the genetic determinants of important viral phenotypes, e.g. virulence, host range, replication efficiency, immune response evation, etc., are determined by focused functional regions of viral proteins Therefore, specific genotype-phenotype association can be masked by other evolutionary factors that contribute to traditional phylogenetic analysis

SFVT approach VT-1I F D R L E T L I L VT-2I F N R L E T L I L VT-3I F D R L E T I V L VT-4L F D Q L E T L V S VT-5I F D R L E N L T L VT-6I F N R L E A L I L VT-7I Y D R L E T L I L VT-8I F D R L E T L V L VT-9I F D R L E N I V L VT-10I F E R L E T L I L VT-11 L F D Q M E T L V S Influenza A_NS1_nuclear-export-signal_137(10) Identify regions of protein/gene with known structural or functional properties – Sequence Features (SF) an alpha-helical region, the binding site for another protein, an enzyme active site, an immune epitope Determine the extent of sequence variation for each SF by defining each unique sequence as a Variant Type (VT) High-level, comprehensive grouping of all virus strains by VT membership for each SF independently Genotype-phenotype association statistical analysis, e.g. genetic determinants of host range, virulence, replication rate Influenza A_NS1_alpha-helix_171(17)

SF definition Based on experimentation reported in the literature and 3D protein structures (PDB records) Captured by manual curation Defined by the specific amino acid positions in the polypeptide chain Annotated with the know structural or functional properties

Influenza A Sequence Features as of 18JUL SFs total

NS1 Sequence Features

SF8 (nuclear export signal)

VT for SF8 (nuclear export signal)

VT-1 strains

DO VARIATIONS IN NS1 SEQUENCE FEATURES INFLUENCE INFLUENZA VIRUS HOST RANGE?

NS1 Sequence Features

VT for SF8 (nuclear export signal)

VT distribution by host

Causes of apparent NS1 VT- associated host range restriction Virus spread - capability + opportunity – Phenotypic property of the virus – limited capacity – Restricted founder effect – limited opportunity Restricted spatial-temporal distribution Sampling bias – assumption of random sampling – Oversampling – avian H5N1 in Asia; 2009 H1N1 – Undersampling – large and domestic cats Linkage to causative variant

VT-11 strains

VT for SF8 (nuclear export signal)

VT lineages

VT-4 lineage

VT-4 lineage = B allele/group

VT-16 & VT-9 lineages

VT-7 lineage

EVOLUTIONARY TRAJECTORY ANALYSIS OF THE PANDEMIC (H1N1) 2009 STRAIN

Phylogenetic Analysis Evolutionary origin – Select a representative pandemic (H1N1) 2009 sequence from the IRD database – BLAST to identify most similar sequences – Assess phylogenetic relationships

Pandemic (H1N1) 2009 selection

BLAST Result

Segment 1 phylogenetic tree Swine/Ohio/2004 Duck/USA/2000s Human/USA/2007 (seasonal) Swine/USA/1990s Pandemic (H1N1) 2009

Temporal component Reference strain – A/California/04/2009 BLAST – Return top 1000 results Normalize data Graph nucleotide differences versus isolation year differences

NP chart

NS chart

HA chart

Group 1 Group 3 Group 2

<= Cali/04/09 NS blue cluster (G1)

<= Cali/04/09 NS green cluster (G2)

Phylogenetic Trees Quantification Analysis method – Build tree for Group 1 and Group 2 strains separately – Analyze branch lengths of trees Results – Avg. Group 1 Branch Length: (S.D ) – Avg. Group 2 Branch Length: (S.D ) – T-test (2 sample, unequal variance):

Group 1 Group 3 Group 2

HA trendline

Evolutionary Trajectory Slopes vs. Mutation Rate SegmentGroup 1 SlopeGroup 2 SlopeMutation Rate PB PB PA HA NP NA M NS Substitutions/segment/year

Evolutionary Trajectory (E.T.) Similar but Distantly Related (SDR)

Garten, et al. Science 2009

Garten, et al. Science 2009

<= Cali/04/09 ET

<= Cali/04/09 SDR

North American H1N1 Lineage - HA H1N American Swine, 2000’s North American H1N1 Lineage HA – Group 1 American Swine, 90’s American Swine, 80’s American Swine, 70’s American Swine, ’s

Evolutionary Trajectory Plots Evolutionary Trajectory of a strain, with candidates displayed.

Summary The Influenza Research Database (IRD) provides a comprehensive resource of data, analysis and visualization tools about influenza virus – SFVT represents a novel tool that can be used to better understand genotype-phenotype relationships for flu Use of IRD to illuminate the viral origins of the pandemic (H1N1) 2009 virus IRD is continually evolving to capture and integrate addition data and analytical tools to support the needs of the influenza research community

72 U.T. Southwestern – Richard Scheuermann (PI) – Burke Squires – Jyothi Noronha – Victoria Hunt – Shubhada Godbole – Brett Pickett – Yun Zhang MSSM – Adolfo Garcia-Sastre – Eric Bortz – Gina Conenello – Peter Palese Vecna – Chris Larsen – Al Ramsey LANL – Catherine Macken – Mira Dimitrijevic U.C. Davis – Nicole Baumgarth Northrop Grumman – Ed Klem – Mike Atassi – Kevin Biersack – Jon Dietrich – Wenjie Hua – Wei Jen – Sanjeev Kumar – Xiaomei Li – Zaigang Liu – Jason Lucas – Michelle Lu – Bruce Quesenberry – Barbara Rotchford – Hongbo Su – Bryan Walters – Jianjun Wang – Sam Zaremba – Liwei Zhou IRD SWG – Gillian Air, OMRF – Carol Cardona, Univ. Minnesota – Adolfo Garcia-Sastre, Mt Sinai – Elodie Ghedin, Univ. Pittsburgh – Martha Nelson, Fogarty – Daniel Perez, Univ. Maryland – Gavin Smith, Duke Singapore – David Spiro, JCVI – Dave Stallknecht, Univ. Georgia – David Topham, Rochester – Richard Webby, St Jude USDA – David Suarez Sage Analytica – Robert Taylor – Lone Simonsen CEIRS Centers Acknowledgments N01AI40041

Segment 6 (NA) By Host