Linking biodiversity data with the Biological Collections Ontology Ramona Walls (iPlant Collaborative, University of Arizona) John Deck (University of.

Slides:



Advertisements
Similar presentations
User needs assessment and preparing a dissemination plan John Tann Kolkata, June 2011 The Atlas is funded by the Australian Government.
Advertisements

Improving Learning Object Description Mechanisms to Support an Integrated Framework for Ubiquitous Learning Scenarios María Felisa Verdejo Carlos Celorrio.
International Barcode Of Life Initiative
The DNA Bank Network Use Cases Gabriele Droege Botanic Garden and Botanical Museum Berlin-Dahlem Freie Universität Berlin.
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
What is NESCent? Inspired by the National Center for Ecological Analysis and Synthesis What is NESCent?
The DNA Bank Network Gabriele Droege Botanic Garden and Botanical Museum Berlin-Dahlem Freie Universität Berlin.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
John Deck, University of California, Berkeley Brian Stucky, University of Colorado, Boulder Lukasz Ziemba, University of Florida, Gaineseville Nico Cellinese,
SONet (Scientific Observations Network) and OBOE (Extensible Observation Ontology): Mark Schildhauer, Director of Computing National Center for Ecological.
IDigBio Minimum Information Standards for Scientific Collections (MISC)/Authority Files Working Group Gil Nelson Andréa Matsunaga (on behalf of the WG)
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
Catalogue of Life, Reading, UK, 29 March 2007 Consortium for the Barcode of Life (CBOL): Linking Molecules to the Catalogue of Life David E. Schindel,
DNA Barcodes: Linking GenBank records to Museum Specimens David E. Schindel, Executive Secretary, CBOL Robert Hanner, University of Guelph.
 Goals Unambiguous description of how the investigation was performed Consistent annotation, powerful queries and data integration  Details NOT model.
THE NATIONAL CENTER FOR BIOMEDICAL ONTOLOGY Ontology-based Tools to Enhance Data Curation Trish Whetzel, PhD Outreach Coordinator December 9, 2010.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
How to Organize the World of Ontologies Barry Smith 1.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Linking collections to related resources: Multi-scale, multi-dimensional, multi-disciplinary collaborative research in biodiversity. Is this a “Big.
Roles and Goals Greg Riccardi. iDigBio People University of Florida o Larry Page, Jose Fortes, Pamela Soltis, Bruce McFadden, Renato Figueiredo, Reed.
Dan Masiga Molecular Biology and Biotechnology Department International Centre of Insect Physiology and Ecology, Nairobi, Kenya BARCODE Data Standard The.
TDWG Annual Conference 2013, Florence Hannu Saarenmaa University of Eastern Finland Integrating observation and survey data for production of the Essential.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
BFO 2.0 Proposal Barry Smith 7/28/2011.
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
GGBN Working Group for DNA/Tissue Classes TDWG 2012 Conference Working Group: Katie Barker, John Deck, Gabi Droege, Paul Flemons, Anton Guentsch, Éamonn.
Taxonomic ontologies: Bridging phylogenetic and taxonomic history Peter Midford University of Kansas Phenoscape Project.
DNA Barcoding Amy Driskell Laboratories of Analytical Biology
Census of Marine Life, Amsterdam – 16 May 2006 The Protocol Chain for DNA Barcoding Projects.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
South/Central America Regional Meeting, Campinas, Brazil, 19 March 2007 Overview of Consortium for the Barcode of Life (CBOL) David E. Schindel, Executive.
Peter H. Wiebe and Nancy Copley Woods Hole Oceanographic Institution How does CMarZ Work? CMarZ Information System / Database /OBIS/ Species Pages.
The NIH Roadmap and the Human Microbiome Project Francis S. Collins, M.D., Ph.D. National Human Genome Research Institute April 22, 2007.
Aspects for Improving the ABBI Patricia Escalante Instituto de Biología UNAM AOU-Collections Committee member.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
A paradigm shift in biodiversity publishing: mobilization, mark up, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel.
Biodiversity Data Journal: mobilization, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, Jordan Bisserkov.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
CBoL Taipei, september 2007 BARCODE DATA, MUSEUM CATALOGS AND GBIF Simon Tillier.
BIS TDWG Conference, New Orleans 2011 Knowledge Organization Systems Session - Introduction Éamonn Ó Tuama Senior Programme Officer, Inventory, Discovery,
Field Based Data Validation: a very real experience in wrangling data, taxonomic names, and photos Moorea Biocode Project, supported by the Gordon and.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
Biocode Field Information Management System (FIMS) John Deck, UC Berkeley TDWG, 2014.
Core 2: Bioinformatics NCBO-Berkeley. Core 2 Specific Aims 1.Apply ontologies  Software toolkit for describing and classifying data 2.Capture, manage,
Overview PlantCollections – Publish information about public garden collections – Using existing infrastructure Morphbank – Goals and capabilities of.
Dag Endresen Knowledge Systems Engineer GBIF New Orleans (Louisiana, USA) 20 October 2011 Biodiversity Information Standards, TDWG.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
ISCC-meeting July 5, Current Status Coordinator from Nov. 2011: NTNU University Museum Memorandum of Understanding with 16 institutions in Norway.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Need for common standard upper ontology
Context: The Strategic Plan for Establishing the Network Integrated Biocollections Alliance Judith E. Skog, Office of the Assistant Director, Biological.
Biocode Commons Identifiers (BCIDS) A free* for use, persistent identifier solution for biological sample collection from the field, scalable to the billions.
Taxonomic Workflow in the EDIT Platform for Cybertaxonomy Andreas Kohlbecker, Pepe Ciardelli, Niels Hoffmann, Katja Luther, Andreas Müller Botanic Garden.
HISCOM An Australian Virtual Herbarium Jim Croft Australian National Herbarium.
ISA Project update CaNano November 12 th,2012 Philippe Rocca-Serra.
Habitat-Lite & EnvO Jin Mao Postdoc, School of Information, University of Arizona Nov. 20, 2015.
BIS TDWG Conference, New Orleans, 2011 GBIF and Genomic Data Éamonn Ó Tuama Senior Programme Officer, Inventory, Discovery, Access (IDA) Global Biodiversity.
The Global Genome Biodiversity Network (GGBN) Data Portal & ABCDDNA Gabriele Droege Botanic Garden and Botanical Museum Berlin-Dahlem.
The EDIT Partnership Network of 25 taxonomic institutions with the aim to integrate research and improve the production of knowledge Initiated by the.
John Deck, University of California, Berkeley Brian Stucky, University of Colorado, Boulder Lukasz Ziemba, University of Florida, Gaineseville Nico Cellinese,
Accessing MVZ: A Primer and Demo of Arctos, MVZ’s Collection Management System, for Biodiversity Researchers
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Sample-based data publication; reflections on semantics and logic 1(1) Hanna - GBIF Finland Lepidoptera collection of Hannu SaarenmaaPublicNo (but DwC.
Bringing Organism Observations Into Bioinformatics Networks
Eurostat activities update
OBI – Standard Semantic
Presentation transcript:

Linking biodiversity data with the Biological Collections Ontology Ramona Walls (iPlant Collaborative, University of Arizona) John Deck (University of California at Berkeley) Robert Guralnick (University of Colorado at Boulder) John Wieczorek (University of California at Berkeley)

What it means to be an OBO Foundry Ontology Shared commitment to creating a suite of interoperable ontologies that span the biological and biomedical domains – non-redundancy – re-use of existing terms Adherence to OBO Foundry principles, including: – open access, willingness to collaborate – shared formats, relations, URIs, naming conventions – good documentation, single locus of authority Access to OBO Foundry community resources – tools – expertise

Scope of the BCO: transect depth * * * * * *sample collection point water sample at depth X aliquot * metagenome Environmental samples: Collections of organisms and their parts (museum or voucher specimens): Surveys, ecological observations: plot sub-plot transect (within plot) individual (within plot) individual (within sub-plot)

Initial focus of BCO: tracking materials and data through sampling chains Moorea Biocode bioinventory event Museum specimens Tissue sample at Smithsonian Institution Gut sample Metagenomic sequences at CAMERA portal Genbank sequence Digital image stored on Morphbank identification

Insect specimen KEY: subclass of has specified output has specified input instance of derives from BCO:material sampling process BCO:identifica tion process BCO:material sample OBI:sequencin g assay OBI:sequenc e data Genbank sequence B TaxonID A TaxonID B Tissue sampling DNA extraction Identificatio n using key Identification using BLAST Sequencing Biocode Sampling Tissue sample DNA molecules BCO:taxonomic name rdfs:Class

Example data: processes InvestigationStudyprocess IDprocess typehas inputhas ouputdate Moorea Biocode Project Moorea Biocode projectplanned process Moorea Biocode Project Moorea insect inventory planned process Moorea Biocode Project Moorea insect inventoryinsect collection 01 material sampling processinsect Moorea Biocode Project Moorea insect inventorytissue sampling 01 material sampling processinsect 01tissue sample Moorea Biocode Project Moorea insect inventory insect gut sampling 04 material sampling processinsect 01insect gut sample Moorea Biocode Project Moorea insect inventory insect gut sampling 05 material sampling processinsect 02insect gut sample Moorea Biocode Project Moorea insect inventorydna isolation 01 material sampling processtissue sample 01DNA sample Moorea Biocode Project Moorea insect inventorydna isolation 04 material sampling process insect gut sample 04DNA sample Moorea Biocode Project Moorea insect inventory insect observation 06observing processinsect in situ 06image Moorea Biocode Project Moorea insect inventoryidentification 01.1tax. iden. by morph. keyinsect 01insect taxon Moorea Biocode Project Moorea insect inventoryidentification 01.2 tax. iden. using dna barcodedna isolation 01insect taxon Moorea Biocode Project Moorea insect inventoryidentification 04.1tax. iden. using BLASTdna isolation 04microbial taxon Moorea Biocode Project Moorea insect inventoryidentification 06.1 morph. tax. identificationimage 01insect taxon Moorea Biocode Project Moorea insect inventoryidentification 07.1 morph. tax. identificationimage 02insect taxon

Example data: material entities and information artifacts IndividualTypeInferred type insect 01organism or virus or viroidmaterial sample insect 02organism or virus or viroidmaterial sample tissue sample 01organism partmaterial sample tissue sample 02organism partmaterial sample insect gut sample 04material entitymaterial sample DNA sample 01DNAmaterial sample DNA sample 02DNAmaterial sample insect in situ 06organism or virus or viroidmaterial target of observation image 01photographic imageinformation artifact insect taxon 01taxonomic nameinformation artifact insect taxon 02taxonomic nameinformation artifact microbial taxon 01taxonomic nameinformation artifact microbial taxon 02taxonomic nameinformation artifact

List all processes that took place in 2010 as part of the Moorea insect inventory BFO: process and BFO:part of occurent BCO_example:Moorea insect inventory and date=2010 Studyprocess IDprocess typedate Moorea insect inventoryinsect collection 01material sampling process2010 Moorea insect inventoryinsect collection 02material sampling process2010 Moorea insect inventorytissue sampling 01material sampling process2010 Moorea insect inventorytissue sampling 02material sampling process2010 Moorea insect inventoryinsect gut sampling 04material sampling process2010 Moorea insect inventoryinsect gut sampling 05material sampling process2010 Moorea insect inventorydna isolation 01material sampling process2010 Moorea insect inventorydna isolation 02material sampling process2010 Moorea insect inventorydna isolation 04material sampling process2010 Moorea insect inventorydna isolation 05material sampling process2010 Moorea insect inventoryinsect observation 06observing process2010 Moorea insect inventoryidentification 01.1tax. iden. by morph. key2010 Moorea insect inventoryidentification 02.1tax. iden. by morph. key2010 Moorea insect inventoryidentification 04.1tax. iden. using BLAST2010 Moorea insect inventoryidentification 04.2tax. iden. using BLAST2010 Moorea insect inventoryidentification 04.3tax. iden. using BLAST2010 Moorea insect inventoryidentification 04.4tax. iden. using BLAST2010 Moorea insect inventoryidentification 04.5tax. iden. using BLAST2010 Moorea insect inventoryidentification 06.1morph. tax. identification2010

List the output (“has specified output”) of every “taxonomic identification process” that has as input (“has specified input”) the "insect 03". Studyprocess IDprocess typehas specified input has specified output Moorea insect inventory identification 03.1tax. iden. by morph. key insect 03insect taxon 01 Studyprocess IDprocess typehas inputhas ouput Moorea insect inventory tissue sampling 03material sampling process insect 03tissue sample 03 Moorea insect inventory dna isolation 03material sampling process tissue sample 03DNA sample 03 Moorea insect inventory identification 03.2tax. iden. using dna barcode DNA sample 03insect taxon 03

Future directions - technical SPARQL endpoint with example queries – Check the BCO wiki ( Implement community curation tools such as Quick Term Templates or BioPortal – Requests can go to the Issue tracker now:

Future directions - ontological Better integration with OBI and other ontologies More sophisticated treatment of naming/taxonomy/identification Ontological modeling of surveys/inventories Mappings to DwC, MIxS, other vocabularies Testing with real data sets

Contributors: Steve Baskauf, Vijay Barve, Jim Beach, Reed Beaman, Matthiew Bietz, Stan Blum, Shawn Bowers, Pier Luigi Buttigieg, Neil Davies, Gabi Droege, Dag Endresen, Maria Alejandra Gandolfo, Robert Hanner, Alyssa Janning, Michelle Koo, Kris Krishtalka, John Kunze, Andréa Matsunaga, Peter Midford, Chuck Miller, Norman Morrison, Gil Nelson, OBI Developers, Éamonn O’Tuama, Cynthia Parr, Sujeevan Ratnasingham, Jai Rideout, Robert Robbins, Phillipe Rocca-Serra, Joel Sachs, Inigo San Gil, Herbert Schentz, Mark Schildhauer, Barry Smith, Peter Sterk, Steve Stones-Havas, Brian Stucky, Andrea Thomer, Mellisa Tulig, Dave Vieglais, Brian Wee, Trish Whetzel, Jamie Whitacre, Greg Whitbread, John Wooley Funding RCN4GSC: Research Coordination Network for Genomic Standards Consortium (DBI ) IB3 EAGER: An Interoperable Information Infrastructure for Biodiversity Research (IIS )

Questions? #!forum/bco-discuss