Dan Masiga Molecular Biology and Biotechnology Department International Centre of Insect Physiology and Ecology, Nairobi, Kenya BARCODE Data Standard The.

Slides:



Advertisements
Similar presentations
International Barcode Of Life Initiative
Advertisements

How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
- Unit VI - IAS Specimen Collection, Identification, and Preservation Rebecca M. Westbrooks Randy G. Westbrooks Steven Manning Global Invasive Species.
Connect.barcodeoflife.org. Promote barcoding as a global standard Build participation Working Groups BARCODE standard International Conferences.
BARCODING LIFE, ILLUSTRATED Goals, Rationale, Results ppt v1
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Robert Hanner, PhD Database Working Group Chair, CBOL Global Campaign Coordinator, FISH-BOL Associate Director, Canadian Barcode of Life Network Biodiversity.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
1.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Metabarcoding 16S RNA targeted sequencing
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
Catalogue of Life, Reading, UK, 29 March 2007 Consortium for the Barcode of Life (CBOL): Linking Molecules to the Catalogue of Life David E. Schindel,
DNA Barcodes: Linking GenBank records to Museum Specimens David E. Schindel, Executive Secretary, CBOL Robert Hanner, University of Guelph.
How to use the web for bioinformatics Molecular Technologies February 11, 2005 Ethan Strauss X 1373
Data Analysis Working Group, DIMACS, 26 Sept 2005 DNA Barcoding and the Consortium for the Barcode of Life David E. Schindel, Executive Secretary National.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
CBOL, DNA Barcoding and Long-Term Ecological Studies David E. Schindel, Executive Secretary National Museum of Natural History Smithsonian Institution.
BioBarcode: a general DNA barcoding database and server platform for Asian biodiversity resources Jeongheui Lim Korean BioInformation Center Korea Research.
DNA BARCODING CHILLIES BIO-NERDS : Say Wah Yugraj Singh Tanja Obradovic Jenny Pham Lovita Bharossa Buai Chuol Diana Corzo.
Consortium for the Barcode of Life A rapid, cost-effective system for species identification David E. Schindel, Executive Secretary National Museum of.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
The Encyclopedia of Life: A Web Site for Every Species James Edwards Executive Director, EOL Barcode of Life Conference Taipei 20 September 2007.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
X-ray crystallography NMR cryoEM Experimental approaches for structural biology.
DNA Barcoding Amy Driskell Laboratories of Analytical Biology
Scott Miller – SANBI, 7 April 2006 Overview of DNA Barcoding and the Barcode of Life Initiative Scott E. Miller, Chair, CBOL Executive Committee National.
Census of Marine Life, Amsterdam – 16 May 2006 The Protocol Chain for DNA Barcoding Projects.
CToL Workshop Grahamstown, November 2008 Biomaterials collections and curation in Africa Gavin Gouws & Unathi Lwana South African Institute for Aquatic.
BARCODE OF LIFE DATA SYSTEMS (BOLD) Riadul Mannan Biodiversity Institute of Ontario.
ABBI/FISH-BOL Neotropical Working Group Meeting 14 March 2007
South/Central America Regional Meeting, Campinas, Brazil, 19 March 2007 Overview of Consortium for the Barcode of Life (CBOL) David E. Schindel, Executive.
Freek T. Bakker Nationaal Herbarium Nederland Wageningen University branch DNA barcoding: the CBOL perspective.
Utah State University – 29 Nov 2006 DNA Barcoding: An Emerging Global Standard for Species Identification Consortium for the Barcode of Life National Museum.
From the World Directory of Collections of Cultures of Microorganisms towards Mash-up of Biodiversity and Sequence Databases Director, WFCC-MIRCEN World.
Progress since the February 2005 London DNA Barcode of Life Conference Scott Miller, Chair Consortium for the Barcode of Life Smithsonian Institution.
Aspects for Improving the ABBI Patricia Escalante Instituto de Biología UNAM AOU-Collections Committee member.
Richard White Biodiversity Informatics. What is biodiversity informatics? The preceding project, among others, shows that the challenges facing biodiversity.
Muthama Muasya University of Cape Town Application of DNA barcoding in plant taxonomy, Eastern Africa Experience.
TDWG 2006, Missouri, U.S.A. Exchange of germplasm datasets with PyWrapper/BioCASE October 16, 2006 TDWG annual Meeting 2006 Missouri Botanical Garden St.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Consortium for the Barcode of Life
CBoL Taipei, september 2007 BARCODE DATA, MUSEUM CATALOGS AND GBIF Simon Tillier.
DNA TECHNOLOGY AND BIOTECHNOLOGY PAGES Chapter 10.
National Science Foundation – 7 February 2006 Consortium for the Barcode of Life (CBOL) David E. Schindel, Executive Secretary National Museum of Natural.
Eastern Africa Regional Meeting, Nairobi, 18 October 2006 DNA Barcoding and the Consortium for the Barcode of Life (CBOL) Status in 2006, Ambitions for.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY DNA Barcoding in Southern Africa Cape Town 7 April
South/Central America Regional Meeting, Campinas, Brazil, 19 March 2007 CBOL Working Groups David E. Schindel, Executive Secretary National Museum of Natural.
DNA Barcoding and the Consortium for the Barcode of Life Katie Ferrell, Project Manager National Museum of Natural History Smithsonian Institution
Overview PlantCollections – Publish information about public garden collections – Using existing infrastructure Morphbank – Goals and capabilities of.
Western and Central Africa: DNA barcoding Meeting One-day course on DNA barcoding: Practical advice Date: 25 October 2008 East African experience and lessons.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Tephritid Barcoding Initiative
Linking Barcode Data to Multiple Users David E. Schindel, Executive Secretary National Museum of Natural History Smithsonian Institution
Andrew Polaszek Executive Secretary, ICZN, c/o Natural History Museum, London UK
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
The BARCODE Data Standard: CBOL’s Partnership with the International Nucleotide Sequence Database Collaboration (INSDC) David E. Schindel, Executive Secretary.
HISCOM An Australian Virtual Herbarium Jim Croft Australian National Herbarium.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
Dan Rosauer Research School of Biology Australian National University Citing data in biogeography: The Atlas of Living Australia.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.
JCU Australian Marine Science Data Network.
Barcode sequences at GenBank
What is Bioinformatics?
Introduction to Bioinformatics
Introduction and background information
Presentation transcript:

Dan Masiga Molecular Biology and Biotechnology Department International Centre of Insect Physiology and Ecology, Nairobi, Kenya BARCODE Data Standard The BARCODE Data Standard: Enabling Molecular Diagnostics for Biodivesity Western and Central Africa: DNA barcoding Meeting One-day course on DNA barcoding: Practical advice 23rd October 2008

New partners

The Infrastructure of Taxonomy Collections and databases of specimens Codes of Taxonomic Nomenclature Compilations of taxonomic names Data repositories (characters, gene sequences, images, trees) Monographs Floristic and faunistic surveys/inventories Revisions The (undigitized) Taxonomic Literature

International Nucleotide Sequence Database Collaboration

Roles of INSDC an archival database/repository for nucleotide sequence Output of Project A Output of Project B Output of Project C Common access interface Standardization of data structure including data items and values Assignment of a unique identifier (an accession number) to a sequence Users

New tools for taxonomy DNA Barcoding The ability to compare genotype information across a huge range of organisms is a powerful tool

“Only [27%] of papers had a legitimate specimens examined section, with museum numbers for each voucher, and names of the museums where the specimens used in the study could be examined ”

DNA Sequence” Couplets Consisting of: “Species Name - DNA Sequence” Basis of a “look-up table” enabling molecular diagnostic applications However, both elements need validation Underlying specimens and associated raw sequence data are not typically available for secondary inspection

Problem Areas TRANSPARENCY AND TRACEABILITY Genetic Data Quality Specimen Data Quality Taxonomy Access to Information

Paradigm Shift Barcoders began calling for a Paradigm Shift Depositing barcode sequences in public database, along with primer sequences, trace files and associated quality scores makes this species identification technique widely accessible. Reference DNA barcode sequences should be derived from, and liked to, specimens of known promenance in web-accessible collections in order to validate this system of molecular diagnostics.

Rationale for Defining “BARCODE” keyword in GenBank Provides the community with reference records with verifiable and retrievable data: –Associated with retrievable voucher specimens (liberally defined: tissue, DNA, etc.) –Linked to on-line metadata –Meet an agreed upon standard of taxonomic identification –Provide an assured level of data completeness –On an agreed upon gene region –Recommended for use in identifying unknowns

Barcode Data Standard The Barcode Data Standard Establishing a new data standard for “BARCODE” keyword records in DDBJ/EMBL/GenBank: 1.Minimum 500bp, <1% ambiguous base calls 2.Double stranded sequence 3.Trace files and associated quality scores 4.Primers used to generate sequence 5.Linkages to: 1.A morphological voucher specimen 2.Structured reference to collections 3.Geospatial reference information 4.Valid species name 5.Who performed the identification 6.Literature citations

Features, Qualifiers and Values The Feature table is updated based on discussions at the International Collaborators meeting of INSDC

NCBI Trace Archive accepts BARCODE as a keyword that identifies “a DNA sequence analysis of a uniform target gene to enable species identification”

Triplet structure for specimen identifiers /specimen_voucher=“ | | ” - abbreviation of the archiving institution - collection within the institution (*) - specimen identifier within the collection The above approach is used in the DarwinCore/GBIF and is parallel to the Life Science Identifier (LSID) that is an Object Management Group (OMG) standard. (*) museums & herbaria culture collections stock centers germplasm repositories (seed banks) frozen tissue banks zoos/aquaria/botanical gardens DNA banks, personal collections e-voucher archives

Link from GenBank to Museums

Process Record

acknowledgments Lee Weight, Smithsonian Institution Scott Miller, PI CBOL David Schindel, Executive Secretary, CBOL Sujeevan Ratnasingham, Biodiversity Institute of Ontario (BIO)/BOLD Robert Hanner (BIO) Organizers: Western and Central Africa DNA barcoding Meeting (NABDA & CBOL Secretariat)