Centers of Excellence for Influenza Research and Surveillance 6 th Annual Meeting Aug 1, 2012 Status of IRD Development.

Slides:



Advertisements
Similar presentations
Gazetteer Application CODIST II, May 2011 Yoseph Mekasha Yoseph Mekasha E-Application Section.
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Systems Biology Data Dissemination Working Group 25FEB2015.
Introduction to Bioinformatics Richard H. Scheuermann, Ph.D. Director of Informatics JCVI.
January 25, Current and Future Database (CH)  Indexing vgd_common (JM; 1Q)  Fully implement Taxonomy tables (JO, DD; 2Q)  Allow subspecies-level.
Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics USC School of Medicine Library.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Information Retrieval in Practice
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
NCBI resources III: GEO and expression data analysis Yanbin Yin Fall
Midterm project Course: Statistics in Bioinformatics Date: 指導教授 : 陳光琦 學生 : 吳昱賢.
Improving Quality with the Substance Registry Services (SRS) John Harman U.S. EPA May 14, 2009.
Overview of Search Engines
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
1 FACS Data Management Workshop The Immunology Database and Analysis Portal (ImmPort) Perspective Bioinformatics Integration Support Contract (BISC) N01AI40076.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
EPIDEMIOLOGY AND PREVENTION OF INFLUENZA. Introduction Unique epidemiology: – Seasonal attack rates of 10% to 30% – Global epidemics Influenza viruses.
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Influenza Research Database (IRD): A Web-based Resource for Influenza Virus Data and Analysis Victoria Hunt 1 *, R. Burke Squires 1, Jyothi Noronha 1,
Training Course 2 User Module Training Course 3 Data Administration Module Session 1 Orientation Session 2 User Interface Session 3 Database Administration.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Christian M Zmasek, PhD Burnham Institute for Medical Research Bioinformatics and Systems Biology
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Statistical Tool for Identifying Sequence Variations That Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) July 22,
NGS data analysis CCM Seminar series Michael Liang:
Web Apollo and the VectorBase user community Gloria I. Giraldo-Calderón March 31, 2015.
Copyright OpenHelix. No use or reproduction without express written consent1.
Statistical Tool for Identifying Sequence Variations that Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) Brett E.
HOW DO VIRUSES CROSS THE SPECIES BARRIER? Rachel Rezabek.
Data Mining in Ensembl with BioMart Nov,
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
John R. LaMontagne Memorial Symposium on Pandemic Influenza Research April 4-5, 2005 Institute of Medicine Working Group One: Influenza Virulence and Antigenic.
Integration of Host Factor Data into the Virus Pathogen Database and Analysis Resource (ViPR) and the Influenza Research Database (IRD) Brett E. Pickett.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
Variation data in VectorBase NIH/NIAID VectorBase site visit March 2015.
Copyright OpenHelix. No use or reproduction without express written consent1.
A collaborative tool for sequence annotation. Contact:
Bioinformatics and Computational Biology
Emerging Diseases Lecture 12: Influenza Virus and the 1918 Pandemic 12.1 Overview 12.2 The pathogen-Influenza Virus A 12.3: Naming System 12.4: A Disease.
PRO and the NIF / ImmPort Antibody Registries Alexander Diehl Protein Ontology Workshop 6/18/14.
Oracle Spatial Network Data Model Overview Oracle Life Sciences User Group Meeting Susie Stephens Life Sciences Product Manager Oracle Corporation.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
Influenza Ontology Infectious Disease Ontology Workshop 2008 Burke Squires.
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
Copyright OpenHelix. No use or reproduction without express written consent1.
Central hub for biological data UniProtKB/Swiss-Prot is a central hub for biological data: over 120 databases are cross-referenced (EMBL/DDBJ/GenBank,
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Copyright OpenHelix. No use or reproduction without express written consent1.
The role of the National Agricultural Library in arthropod genomics research - implementing and developing tools for genomic data management Monica Poelchau.
Accessing and visualizing genomics data
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
UNEP Live. What is UNEP Live? - An on-line knowledge management platform - Focuses on open access to global, regional and national data and knowledge.
National Cancer Institute Uma Mudunuri ABCC, NCI-Frederick ISRCE Monthly Meeting, Nov 9th 2010 bioDBnet The biological DataBase network.
Information Retrieval in Practice
Single-Stranded Positive-Sense RNA Single-Stranded Negative-Sense RNA
The IPT user interface and data quality tools
Over 1,000 books, journals, videos and reference material
Network biology An introduction to STRING and Cytoscape
Session 1: WELCOME AND INTRODUCTIONS
Welcome - webinar instructions
Presentation transcript:

Centers of Excellence for Influenza Research and Surveillance 6 th Annual Meeting Aug 1, 2012 Status of IRD Development

Session Topics Current CEIRS data in IRD Surveillance Serology Immunology & ImmPort IRD enhancements over past year Search improvements Surveillance data from map Support for serology data 3D movies Phylogenetic tree decoration Metadata-driven comparative genomics analysis Sequence feature submission tool Host factor data Publications Plans for future development

Current CEIRS Data in IRD

CEIRS Surveillance Samples 94% avian 5.8% non-human mam. 0.2% human

Surveillance Sample Stats Avian RecordsAvian % Non-Human Mammalian RecordsMammalian % Total197,20714,098 Tested175,74689%12,97392% Flu-positive10,1365.8%5103.9% Linked to sequence7727.6%112.2% *as of May 1, 2012

Serology Samples Species categorySubmission YearSample Count Avian Avian Human Non Human Mammalian Non Human Mammalian Non Human Mammalian TOTAL2675

Influenza Serology Data

CEIRS Immunology Data in IRD

Introduction to ImmPort Immunology Database and Analysis Portal (ImmPort) – Bioinformatics Integration Support Contract (BISC) Purpose – Warehouse for storing immunology experiment data – Integrate data with analysis and visualization tools – Provide access to research community Projects – Population Genetics Analysis Program – HLA Region Genetics in Immune-mediated Diseases – Modeling Immunity for Biodefense – Others

Additional ImmPort Capabilities Integrate data from multiple resources – OMIM, GO, synonyms, protein-protein interactions, etc. Suite of data analysis and visualization tools – Microarray – Flow Cytometry – Other “-omics” platforms

IRD Enhancements Over Past Year

Sequence Search Page Enhancements

Quick Text Search

Surveillance Data from Map

Spinning 3D Protein Structure Movie

Phylogenetic Tree Decoration Decorate by: – Host species Avian: Avian grouped/separated – Country – Year – HA subtype – NA subtype – HA & NA subtype – Geographic region – Flu season – SFVT Manual decoration

Metadata-driven Comparative Analysis Tool

Sequence Feature Variant Type (SFVT)

Sequence Feature Submission Tool

DMID Systems Biology Program

Host Factor Data

IRD/ViPR Publications 2012

Future Development Plans

User Support and Outreach Data – Evaluate feasibility of supporting Antigenic Cartography – Prepare packages of (correctly-formatted) data to export to external tools Outreach – Perform on-site outreach at CEIRS centers – Continue developing tutorials for existing tools & features

Search Query Capabilities – Ability to search for high-path and/or low-path strains (using sequence biomarkers)

Comparative Genomics Develop PCR Primer design tool (exclude orthologs) Increase SF definitions for: virulence, host specificity, replication, etc. Provide a new tool to assign (or convert between) sequence coordinate schemes

Annotation and Host Factor Data Ensure sequence submissions are appropriately prepared (i.e. no primer sequence, etc.) Increase number of host factor datasets Develop method to handle different statistical methods from various “-omics” platforms (e.g. microarray, proteomics, etc.)

Surveillance Identify NIAID-funded human surveillance studies and solicit deposition into IRD Develop additional use-cases to identify additional helpful data types

Immunology Epitopes – Add search options such as: CD4, CD8, host Serology – Solicit feedback from community on use-cases – Identify volunteers for data submission

PA-X Prediction for All Strains Build on analysis performed earlier this year by Jagger et al. Science 2012 Jul 13;337(6091): – Identified new protein on segment 3 using ~1000 sequences Frameshift occurs at codon 190 in PA protein, results in new C-terminus IRD will extend this analysis across all segment 3’s in resource – Add PA-X annotation to existing IRD sequence records – Allow users to search for PA-X protein sequences – Provide data that can assist in downstream comparative genomics analyses

H5 clade annotation tool Automated clade determination for any query HA sequence Match WHO clade definitions

NGS Deep Sequencing Data Primary data in SRA Derived data in IRD – Positions with sequence variation – Proportion of read with a particular sequence variation Metadata to understand the context