VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

Slides:



Advertisements
Similar presentations
© 2002 The MITRE Corporation. ALL RIGHTS RESERVED. Co-Chair: Alexander Yeh, MITRE Corp. Data: FlyBase ( July 2002 KDD Cup 2002 Task1:
Advertisements

Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
Provenance in a Collaborative Bio-database RAASWiki Donald Dunbar & Jon Manning Queen’s Medical Research Institute University of Edinburgh Use Cases for.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Welcome to mini-symposium on ontologies for biological sample description EMBL-EBI Wellcome Trust Genome Campus Deceber 5, 2001.
Differential insertion of transposable elements in Anopheles gambiae M & S genomes Jenica L. Abrudan, Ryan C. Kennedy, Maria F. Unger, Michael R. Olson,
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Archives and Information Retrieval
BRC6 28 th October 2008 Collective annotation of the Ixodes scapularis genome: VectorBase, MSCs and the tick community. Daniel Lawson, VectorBase.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
NIAID Bioinformatics Resource Centers Valentina Di Francesco Bioinformatics Program Director Microbial Genomics Program, DMID.
Genome Related Biological Databases. Content DNA Sequence databases Protein databases Gene prediction Accession numbers NCBI website Ensembl website.
Specie: Anopheles gambiae PEST Genome size: 260 Mb Status: 3rd assembly and annotation NIAID funded.
VectorBase BRC VectorBase annotation metrics Daniel Lawson VectorBase-EBI, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
ABSTRACT We have conducted an extensive computational analysis of the Culex quinquefasciatus genome to find and annotate a specific subfamily of the TEs:
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
The Ensembl Gene set The “Genebuild” 21 April 2008.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
VectorBase Seth Redmond Imperial College, London
Abstract Although transposable elements (TEs) were discovered over 50 years ago, the robust discovery of them in newly sequenced genomes remains a difficult.
Data Curation and Management activities within the UCT Computational Biology Group Dr Nicky Mulder.
Gene Expression Omnibus (GEO)
Taverna and my Grid Basic overview and Introduction Tom Oinn
EBI is an Outstation of the European Molecular Biology Laboratory. Bert Overduin Daniel Rios Stephen Fitzgerald Edinburgh, 24 & 25 February 2009 Ensembl.
Annotation of Anopheline Genomes at VectorBase Dan Lawson, VectorBase & The Anopheles Genomes Cluster Consortium EMBL-EBI.
The new VectorBase: our improved resource for invertebrate vectors Scott Emrich On behalf of VectorBase “bigger, better, faster” Or “ "consolidate, improve.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
VectorBase Gene expression data in VectorBase Fotis Kafatos, George Christophides, Bob MacCallum & Seth Redmond Imperial College London (thanks also to.
NCBI Vector-Parasite Genomic Related Databases Chuong Huynh NIH/NLM/NCBI Sao Paulo, Brasil July 12, 2004
Web Apollo and the VectorBase user community Gloria I. Giraldo-Calderón March 31, 2015.
GMOD: Managing Genomic Data from Emerging Model Organisms Dave Clements 1, Hilmar Lapp 1, Brian Osborne 2, Todd J. Vision 1 1 National Evolutionary Synthesis.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
EMBL-EBI EMBL-EBI EMBL-EBI What is the EBI's particular niche? Provides Core Biomolecular Resources in Europe –Nucleotide; genome, protein sequences,
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
VectorBase BRC The evolving VectorBase gene build: mixing automated and manual approaches when annotating vector genomes Daniel Lawson VectorBase-EBI,
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Vectorbase and Galaxy Jarek Nabrzyski On behalf of VectorBase Center for Research Computing University of Notre Dame VectorBase Bioinformatics Resource.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
2009 GMOD Meeting Dhileep Sivam & Isabelle Phan Seattle Biomedical Research Institute.
VectorBase BRC Overview Scott Emrich BRC 2011 – Annual Meeting UT Southwestern Medical Center Dallas, TX September 2011.
Importing Community annotations into VectorBase. Aims Provide the VectorBase community with tools for improving genome annotation. Must have low entry.
Building WormBase database(s). SAB 2008 Wellcome Trust Sanger Insitute Cold Spring Harbor Laboratory California Institute of Technology ● RNAi ● Microarray.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
2009 IADR, MIAMI, FL, USA Hands-on Experience for using the Human Oral Microbiome Database (HOMD) 2009 IADR Workshop, Miami, FL, USA Tsute (George) Chen.
VectorBase Kolymbari Meeting July 2011 new genomes new features and future plans Daniel Lawson (on behalf of VectorBase)
Map-based Exploration of Population Biology Data in VectorBase What is VectorBase? We are a consortium of institutions that hosts the genomes of invertebrate.
Variation data in VectorBase NIH/NIAID VectorBase site visit March 2015.
Introduction to the Gene Ontology GO Workshop 3-6 August 2010.
Bioinformatics and Computational Biology
Overview and History of VectorBase Frank Collins March 31, 2015.
EBI is an Outstation of the European Molecular Biology Laboratory. Gautier Koscielny VectorBase Meeting 08 Feburary 2012, EBI VectorBase Text Search Engine.
VectorBase’s Population Biology Resources and How to Submit to Them Bob MacCallum Imperial College, London, UK July 16, 2013.
Applied Bioinformatics Week 9 Jens Allmer. Theory I Gene Expression Microarray.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
1 of 28 Evaluating Genes and Transcripts (“Genebuild”)
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
Introduction to Genes and Genomes with Ensembl
VectorBase genome annotation
Functional Annotation of the Horse Genome
Ensembl Genome Repository.
TAMU Bovine QTL db and viewer
Presentation transcript:

VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London

VectorBase Outline Introduction to VectorBase Two important recent developments: –Community Annotations –Gene Expression Data

VectorBase What is VectorBase? Aim  Genomic bioinformatics resource for invertebrate vectors of human pathogens  Data hub for community Funding  US NIAID (National Institute for Allergy and Infectious Diseases)  via its Bioinformatics Resource Centre (BRC) program

VectorBase Why VectorBase? Sequencing initiatives do not include “after-care” Ensembl had no long-term plans for insects

VectorBase Main VectorBase activities –Browse, search & download genomic data Genome annotation –Automatic & manual Functional genomics Ontologies Training/outreach/consultancy

VectorBase Invertebrate vectors SpeciesDiseaseStatusFunder Aedes aegyptiYellow fever Dengue fever Complete†NIAID Anopheles gambiae PESTMalariaComplete†- Anopheles gambiae M & S formMalariaAssembledNHGRI Culex pipiens quinquefasciatusLymphatic filariasisComplete†NIAID Glossina morsitans morsitansSleeping sicknessInitiatedWellcome Trust Ixodes scapularisLyme diseaseDraft gene setNIAID Lutzomyia longipalpisLeishmaniaPlannedNHGRI/Wellcome Trust Pediculus humanusTyphusDraft gene setNHGRI Phlebotomus papatasiLeishmaniaPlannedNHGRI/Wellcome Trust Rhodnius prolixusChagas diseaseInitiatedNHGRI

VectorBase Who is VectorBase? US UK GR

VectorBase Notre Dame PIs Frank Collins, Dave Severson, Greg Madey, Nora Besansky Tasks project coordination core website development community annotation pipeline Aedes and Anopheles community reps.

VectorBase EBI (European Bioinformatics Institute) PI Ewan Birney Tasks “automated” genome annotation comparative genomics Genbank submissions genome browser technology

VectorBase IMBB, Crete PI Kitsos Louis Tasks ontologies for anatomy, insecticide resistance, biological processes population genetics

VectorBase Harvard PI Bill Gelbart Tasks manual annotation

VectorBase Imperial College, London PIs George Christophides, Fotis Kafatos Tasks functional genomics: gene expression, RNAi phenotypes

VectorBase UC Riverside PI Peter Atkinson Tasks Culex pipiens

VectorBase Purdue University PI Catherine Hill Tasks Ixodes scapularis

VectorBase A quick tour of VectorBase Blast Genome browser Search engine BioMart Downloads

VectorBase VectorBase genome browser

VectorBase VectorBase genome browser

VectorBase Genome annotation cycle Automatic gene build Assembly Community annotations Manual annotations Other genomes, gene sets Repeat library (TEs etc) ESTs, cDNAs Protein domains

VectorBase Manual annotation Flybase team (Kathy Campbell) Anopheles 2L completed Sep 2006 Anopheles 2R completed Sep 2007 Anopheles X completed Feb Culex genes completed July 2008 Three mosquitoes better than one

VectorBase Community annotation Expertise from around world Gene models, symbols, literature, function Need system to track contributions Incorporated in gene build updates Credit sources  Community Annotation Pipeline (CAP)

VectorBase CAP: gene model submission Gene symbol Gene description mRNA sequence Translation start Translation stop Determination method GO IDs PubMed IDs Excel spreadsheet

VectorBase

CAP: what happens next Transcript aligned to genome Gene model constructed Reviewed by community representative

VectorBase

CAP: other annotations Publications CV/ontology terms Free text comment* (* unmoderated)

VectorBase

Expression data Many microarray technologies Many experimental designs Large amount of information Many ways to do analysis

VectorBase Microarray repositories Widely adopted standard: MIAME GEO (NCBI) & ArrayExpress (EBI) Repository ≠ Useful data Curation backlog at central repositories VectorBase data is manageable We manage and curate

VectorBase Microarray pipeline at VB WhatWhere Alignments & gene assignmentsEnsembl-style database Microarray data, raw & processedBASE Statistics and web interfaceVB’s GESOL API

VectorBase Web interface PPO*

VectorBase

Overall picture of expression

VectorBase

Genome browser integration

VectorBase Help & Documentation

VectorBase No time today for… Averaging over multiple reporters Ambiguous reporters List of microarray experiments in VB Community microarray data submission Expert analysis & collaboration Future developments

VectorBase

VectorBase’s future directions More genomes & sequencing Population biology, association studies More community involvement in genome annotation Enhanced functional genomics resources

VectorBase Acknowledgements VB team IC PIs VB SWG NIAID Community Organisers Audience