Sequence Variation Identification and Functional/Structural Inference in the Influenza Research Database (IRD) and Virus Pathogen Resource (ViPR) Yun Zhang.

Slides:



Advertisements
Similar presentations
1.1.3 MI.
Advertisements

Office of Infectious Diseases Computational Challenges for Infectious Diseases Michael Shaw, PhD OID/Office of the Director.
Representing the Immune Epitope Database in OWL Jason A. Greenbaum 1, Randi Vita 1, Laura Zarebski 1, Hussein Emami 2, Alessandro Sette 1, Alan Ruttenberg.
Virus Pathogen Resource (ViPR) 26 September 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern Medical Center.
Centers of Excellence for Influenza Research and Surveillance 6 th Annual Meeting Aug 1, 2012 Status of IRD Development.
Standardizing Metadata Associated with NIAID Genome Sequencing Center Projects Richard H. Scheuermann, Ph.D. Department of Pathology Division of Biomedical.
Avian Influenza – The Bird Flu
Introduction to Bioinformatics Richard H. Scheuermann, Ph.D. Director of Informatics JCVI.
January 25, Current and Future Database (CH)  Indexing vgd_common (JM; 1Q)  Fully implement Taxonomy tables (JO, DD; 2Q)  Allow subspecies-level.
Host cell responses to viral infection can be monitored by a variety of different high throughput experimental methodologies in order to understand the.
Bioinformatics Resource Centers Influenza Research Database (IRD) Virus Pathogen Database and Analysis Resource (ViPR) 8 December 2010 Richard.
A Genomic Survey of Polymorphism and Linkage Disequilibrium Imran Mohiuddin Magnus Nordborg, Ph.D. University of Southern California.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
Integrated Bioinformatics Data and Analysis Tools for Herpesviridae Viruses in the Virus Pathogen Resource (ViPR) Yun Zhang 1, Brett Pickett 1, Eva Sadat.
Towards Personal Genomics Tools for Navigating the Genome of an Individual Saul A. Kravitz J. Craig Venter Institute Rockville, MD Bio-IT World 2008.
Richard H. Scheuermann, Ph.D. Department of Pathology Division of Biomedical Informatics U.T. Southwestern Medical Center Standardizing Metadata Associated.
Evolution as a Confounding Factor in Genetic Association Studies 14 December 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern.
ASCR Scientific Data Management Analysis & Visualization PI Meeting Exploration of Exascale In Situ Visualization and Analysis Approaches LANL: James Ahrens,
Standardizing Metadata Associated with NIAID Genome Sequencing Center Projects and their Implementation in NIAID Bioinformatics Resource Centers Richard.
Laboratory Training for Field Epidemiologists Typing May 2007 Sequencing and Phylogeny.
Databases and tools to study the genomes of hundreds of pathogens, plants, and mammals Richard H. Scheuermann, Ph.D. Director of Informatics J. Craig Venter.
Sequence Feature Variant Type and Evolutionary Trajectory Analysis using the Influenza Research Database (IRD) 19 July 2011 Richard H. Scheuermann,
Influenza Research Database (IRD): A Web-based Resource for Influenza Virus Data and Analysis Victoria Hunt 1 *, R. Burke Squires 1, Jyothi Noronha 1,
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Laboratory of Molecular Pathology Retreat - 10 MAR 2011
Data Mining in the Influenza Research Database (IRD) and the Virus Pathogen Resource (ViPR) JCVI-GSCID/NIAID Workshop University of Limpopo 01 June 2011.
Comparative Genomics in the Influenza Research Database 17 June 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern.
1 Workshop on Infectious Disease Ontology Influenza Informatics in the BioHealthBase Bioinformatics Resource Center Richard H. Scheuermann, Ph.D. Department.
Richard H. Scheuermann, Ph.D. Department of Pathology, UT Southwestern March 30, 2011 Virus Bioinformatics Resource Centers – ViPR & IRD.
Influenza Research Database (IRD) 26 September 2011 Richard H. Scheuermann, Ph.D. Department of Pathology U.T. Southwestern Medical Center.
BioHealthBase: The Bioinformatics Resource Center for Francisella tularensis Shubhada Godbole 1, Stephen M. Beckstrom-Sternberg 2,3, Paul S. Keim 2,3,
Statistical Tool for Identifying Sequence Variations That Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) July 22,
BioHealthBase: A Web-based Database and Analysis Resource for Francisella Shubhada Godbole 1, Jyothi Noronha 1, Burke Squires 1, Victoria Hunt 1, Ed Klem.
Conclusions and Future Work (301) Kamal Kumar, Valmik Desai, Li Cheng, Maxim Khitrov, Deepak Grover, Ravi Vijaya Satya,
Yun Zhang J. Craig Venter Institute San Diego, CA, USA August 4, 2012 Integrated Bioinformatics Data and Analysis Tools for Herpesviridae.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey “Bird flu”  Caused by avian influenza virus (AIV)  Endemic.
Statistical Tool for Identifying Sequence Variations that Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) Brett E.
Swine Flu Presenter: Ali Azarashk. Overview Introduction Classification History Transmission Signs and Symptoms Treatment.
Richard H. Scheuermann, Ph.D. November 5, 2012 Support for Systems Biology Data in IRD/ViPR - Proteomics.
BIG Data: Knowledge for Improving Vaccine Virus Selection Richard H. Scheuermann, Ph.D. Director of Informatics JCVI.
Influenza Infectious Disease Ontology (Influenza-IDO) Status August 2010.
John R. LaMontagne Memorial Symposium on Pandemic Influenza Research April 4-5, 2005 Institute of Medicine Working Group One: Influenza Virulence and Antigenic.
Antigenic Shift v. Drift in Avian and Mammalian Sino- Influenza Type A Viruses. By Charles Hauser, St. Edward’s University Mark Maloney, Spelman College.
Molecular Dynamics of the Avian Influenza Virus Team Members: Ashvin Srivatsa, Michael Fu, Ellen Chuang, Ravi Sheth Team Leader: Yuan Zhang.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
Integration of Host Factor Data into the Virus Pathogen Database and Analysis Resource (ViPR) and the Influenza Research Database (IRD) Brett E. Pickett.
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM in EGEE-2, biomed meeting, 2006/04/28 WISDOM : Grid-enabled Virtual High Throughput.
BRC 2011 Session #4 – “Omics” Data. Session #4 - Outline Challenges and Opportunities  pathogen datasets; host datasets; integrating pathogen-host datasets.
The Informatics Crystal Ball: Mining the Past to Predict the Species Jump Event 19 April 2011 Richard H. Scheuermann, Ph.D. Department of.
Valentina Di Francesco Senior Program Officer for Bioinformatics, Structural Genomics and Systems Biology Microbial Genomics.
Richard H. Scheuermann, Ph.D. November 5, 2012 Support for Systems Biology Data in IRD/ViPR.
Viral Genomics: Strength in Numbers David Spiro Assistant Investigator J. Craig Venter Institute
3DM: Protein Super-family Platforms 3DM Protein super-family data integration Tom van den Bergh Bio-Prodict.
3DM: Protein engineering Super-family platforms Bio-Prodict DM super-family systems Henk-Jan Joosten Remko Kuipers Tom v/d Bergh Bas Vroling.
Genomic Analysis of Wetland Sediment as a Tool for Avian Influenza Virus Surveillance in Wild Waterfowl Chelsea Himsworth DVM, MVetSc, PhD, Dipl ACVP Leader,
Influenza Ontology Infectious Disease Ontology Workshop 2008 Burke Squires.
No reference available
Genome sequence of the dissimilatory metal ion–reducing bacterium Shewanella oneidensis Heidelberg, J. F., Paulsen, I. T., Nelson, K. E., Gaidos, E. J.,
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
Current Data And Future Analysis Thomas Wieland, Thomas Schwarzmayr and Tim M Strom Helmholtz Zentrum München Institute of Human Genetics Geneva, 16/04/12.
Hierarchy of Biological Complexity Interactions of machines (molecular and cellular dynamics) Macromolecular machines Proteins and nucleic acids Sequences.
High throughput biology data management and data intensive computing drivers George Michaels.
Milanesi Luciano Catania, Italy 13/03/2007 Bioinformatics challenges in European projects in Grid. Milanesi Luciano National Research Council Institute.
Daniel Janies, Ph.D. Carol Grotnes Belk Distinguished Professor of Bioinformatics and Genomics College of Computing and Informatics University of North.
U.S. Influenza Surveillance Sabrina Swenson, dvm ms phd
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Pathweavers Elizabeth McClellan Ribble, Ph.D.
A Web-based Interactive Genome Library for Surveillance, Detection, Characterization and Drug-Resistance Monitoring of Influenza Virus Infection in the.
1.1.3 MI.
Presentation transcript:

Sequence Variation Identification and Functional/Structural Inference in the Influenza Research Database (IRD) and Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute June 23, 2014

Challenges for Sequence Analysis Sequence Variation Identification Functional/Structural Inference Influenza virus A_NS1_nuclear-export-signal_137(11) Sequence Feature (SF) Curated from literature, public archives, direct submission 2,747 influenza SFs, 543 Dengue SFs, 301 HCV SFs, 296 Vaccinia SFs Clin Infect Dis. (2013) 57 (4): Manual analysis Data amount Subjective Novel analysis tool Statistical Genotype-phenotype correlation Buried in literature

Sequence Variation Analysis Workflow Search for sequences metadata/BLAST Run statistical analysis: Meta-CATS / SNP Verify results in sequence alignment Determine if positions of interest are located in Sequence Features Visualize positions of interest on protein structure Influenza A_ H3_ experimentally- determined- epitope_156(7)

Use Case: Influenza H7N9 Virus 2013 Influenza virus A H7N9 outbreak – H7 viruses have historically circulated in birds and horses – H7N9 human cases: 1 st human case reported in March 2013, 410 human cases as of April 2014, fatality rate 22% – Sequence variations involved in human adaptation?

Sequence Search – H7N9 HA & Similar Sequences SearchMeta-CATSAlignmentSequence FeaturesProtein Structure 2. HA sequences highly similar to a typical H7N9 human strain 1. H7N9 HA complete sequences

Meta-CATS Analysis – Grouping similar older H7 sequences H7N9 human HA sequences SearchMeta-CATSAlignmentSequence FeaturesProtein Structure H7N9 outbreak HA sequences vs. similar older H7 sequences

Meta-CATS Analysis Results SearchMeta-CATSAlignmentSequence FeaturesProtein Structure

Verify Results on Alignment SearchMeta-CATSAlignmentSequence FeaturesProtein Structure Older H7 avian strains H7N9 human strains 235L/I

Meta-CATS Analysis Results SearchMeta-CATSAlignmentSequence FeaturesProtein Structure

Variant Position Mapped to Sequence Features SearchMeta-CATSAlignmentSequence FeaturesProtein Structure 161 2

Variant Positions Visualized on Protein Structure SearchMeta-CATSAlignmentSequence FeaturesProtein Structure Variant position 235 Ligands 4BSC: H7N9 HA in Complex with 6'-SLN

Summary A novel sequence variation identification and functional/structural inference workflow

Acknowledgments NIAID HHSN C J. Craig Venter Institute Richard Scheuermann (PI) Brian Aevermann, M.S. Douglas Greer, Ph.D. Brett Pickett, Ph.D. Rick Stanton, M.S.E.E Lucy Stewart, MBA Yun Zhang, M.Sc. Vecna Chris Larsen, Ph.D. Al Ramsey, Ph.D. Guangyu Sun, Ph.D. LANL Catherine Macken, Ph.D. Mira Dimitrijevic Southern Methodist Univ Monnie McGee, Ph.D. Mengya Liu, Ph.D. Northrop Grumman Scott Stuart, Program Manager Ed Klem, Ph.D., Project Manager Zhiping Gu, Ph.D. Sherry He Wenjie Hua Wei Jen Sanjeev Kumar Xiaomei Li, Ph.D. Jason Lucas Bruce Quesenberry Barbara Rotchford Tom Smith, Ph.D. Hongbo Su, Ph.D. Bryan Walters Sam Zaremba, Ph.D. Hongtao Zhao, Ph.D. Liwei Zhou, Ph.D. NIAID / DMID Alison Yao, Ph.D., Contracting Officer Representative, Microbial Genomics & Advanced Technologies Andrei Gabrielian, Ph.D., Office of Cyber Infrastructure and Computational Biology Diane Post, Ph.D., CEIRS Project Officer