1B Publishing Primary Biodiversity Data

Slides:



Advertisements
Similar presentations
Katia Cezón GBIF Spain, Coordination Unit Real Jardín Botánico, Madrid 2014 Mentoring Project 2014 France-Portugal-Spain DATA QUALITY WORKFLOW.
Advertisements

GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
BIS TDWG Conference 29 October 2014, Jönköping, Sweden Publishing sample-based data using Darwin Core Archives Éamonn Ó Tuama, Markus Döring, Kyle Braak,
Entomological Collections Network Meeting, Indianapolis, IN 13 December 2009 Darwin Core Ratified in the Year of Darwin Gail E. Kampmeier Illinois Natural.
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer September G A Darwin-Core Archive solution to publishing and.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
VOCABULARIES A data management presentation. Data management best practices Inventory of resources/datasets – Database level or series of datasets/collections.
GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics.
11 th GBIF Global NODES Meeting Incentivising and Strategising Publishing of Biodiversity Data Vishwas Chavan Senior Programme Officer for Digitisation.
Eastern Bearded-dragon (Pogona barbata) – Toowoomba, Australia © Arthur D. Chapman Principles of Data Quality Australian Biodiversity Information Services.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Mid-Term GBIF Committees Meetings eLearning Alberto González Talaván Global Biodiversity Information Facility (GBIF) May 2011.
1 DanBIF Danish Biodiversity Information Facility Arbejdsseminar om GBIF i Norge Norges Forskningsråd, Oslo 25. September 2003 Isabel Calabuig.
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
GLOBAL BIODIVERSITY INFORMATION FACILITY TDWG 2009, Montpelier, November 12, 2009 Dag Endresen (NordGen)Samy Gaiji (GBIF) Dag Endresen (NordGen) & Samy.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
GBIF Publishing Platform May Core publishing focus Primary Biodiversity Data (Specimens & Observations, Ecological Data) - Core data type is an.
Digitization of Natural History Collections (DIGIT) Larry Speers Program Officer Digitization of Natural History Collections Data TDWG Annual Meeting Oct.
GLOBAL BIODIVERSITY INFORMATION FACILITY Éamonn Ó Tuama Senior Programme Officer, IDA 21 June Metadata publishing with the IPT.
Biodiversity Data Journal: mobilization, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, Jordan Bisserkov.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY DNA Barcoding in Southern Africa Cape Town 7 April
BIS TDWG Conference, New Orleans 2011 Knowledge Organization Systems Session - Introduction Éamonn Ó Tuama Senior Programme Officer, Inventory, Discovery,
Isabel Calabuig Lotte Endsleff 1 NODES regional MEETING Europe Digitarium,
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Dag Endresen Knowledge Systems Engineer GBIF New Orleans (Louisiana, USA) 20 October 2011 Biodiversity Information Standards, TDWG.
CBD CoP 11 Special Event National Biodiversity Information Outlook (NBIO) Vishwas Chavan 15 October 2012 Hyderabad.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
1 The National Biological Information Infrastructure and Biodiversity Collections Annette Olson BCI meeting, Washington DC, January 28-29th, 2008.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
IABIN Executive Committee / Coordinating Institution Meeting GBIF and IABIN: status and opportunities in 2011 Juan Bello, Mélianie Raymond & Alberto González-Talaván.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Hannu Saarenmaa EC CHM & GBIF European Regional Nodes Meeting Copenhagen,
CEPDEC-TZ Training course: Digitisation of Biodiversity Information 13th – 17th July 2009 Dar es Salaam, Tanzania GLOBAL BIODIVERSITY INFORMATION FACILITY.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
Experts Workshop on the GBIF INTEGRATED PUBLISHING TOOLKIT V. 2 IPT Resources Alberto González Talaván Global Biodiversity Information Facility (GBIF)
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan and Eric Gilman 10 th Meeting of the GBIF Participant Node Managers Committee 3 – 5 October 2009.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
IPT + Darwin Core OBIS XML Schema OBIS Database Schema Explained Mike Flavell OBIS Data Manager OBIS Nodes Training Course, Oostende, Belgium, 6 May 2014.
GBIF NODES Committee Meeting Copenhagen, Denmark 4 th October 2009 The GBIF Integrated Publishing Toolkit Alberto GONZÁLEZ-TALAVÁN Programme Officer for.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition Practical Example of Data Mobilization Planning:
GB22 TRAINING EVENT FOR NODES – 4 OCTOBER 2015 Session 02: 2015 Data Publishing Landscape Laura Russell.
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
Colombia: Capacity enhancement in Latin America
Session 05: Promoting data publishing
GBIF Implementation Plan Highlights
International Congress of Entomology, Orlando
The IPT user interface and data quality tools
Open access as a means to produce high quality data Anja Gassner Head Research Method Group Sentinel Landscape Coordinator FTA World Agroforestry Centre.
Flanders Marine Institute (VLIZ)
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
GBIF Governing Board 20 12th Global Nodes Meeting
Data publishing from the viewpoint of a biodiversity publisher
GLOBAL BIODIVERSITY INFORMATION FACILITY
Citizen Science’s contribution to GEO BON
Open Access to your Research Papers and Data
OBIS Data flows Dave Watts 8 March 2017 Data Centre, O&A.
Bird of Feather Session
HOW (and why?) DO WE DESCRIBE ?
Presentation transcript:

1B Publishing Primary Biodiversity Data Alberto González-Talaván1 Data Sharing, Data Standards, and Demystifying the IPT Gainesville, FL, USA. 13 January 2015 1 GBIF Secretariat

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

What is biodiversity data? Digital text or multimedia data record detailing facts about the instance of occurrence of an organism, i.e. on the what, where, when, how and by whom of the occurrence and the recording. Dictionary picture from Asif Akbar, obtained via freeimages.com (http://www.freeimages.com/photo/1150967).

What is biodiversity data? Specimen labels For many of the participants in this workshop, primary biodiversity data may immediately bring to their minds these kind of images. Images from the biological collections of the Zoological Museum of the University of Copenhagen (Denmark).

What is biodiversity data? Journals Checklists Assessments Urban biodiversity

What is biodiversity data? Citizen science Genetics Camera traps Satellite images

What is biodiversity data? Specimen labels Journals Checklists Assessments Urban biodiversity Citizen science Genetics Different data sources, data types… impose different requirements in the publishing process, standards, software, etc. Camera traps Satellite images …

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Rationale for Publishing: What is Publishing? “Publishing” refers to making biodiversity datasets publicly accessible and discoverable, in a standardized form, via an access point, typically a web address (a URL). IPT ∞

Rationale for Data Publishing: Use Chapman, A.D., 2005, Uses of Primary Species-Occurrence Data, version 1.0. Report for the Global Biodiversity Information Facility, Copenhagen. 100 pp. ISBN: 87-92020-01-1. http://www-old.gbif.org/orc/?doc_id=1300

Rationale for Data Publishing: Use Taxonomy Agriculture, Forestry, Fisheries and Mining Biogeographic studies Species diversity and populations Health and Public Safety Bioprospecting Life histories and phenologies Forensics Endangered, Migratory and Invasive Species Border Control and Wildlife Trade Impact of Climate Change Education and Public Outreach Ecology, Evolution and Genetics Ecotourism and Recreational Activities Environmental Regionalisation Conservation Planning Society and Politics Natural Resource Management Human Infrastructure Planning

Rationale for Data Publishing: exercise Featured data section in GBIF.org http://www.gbif.org/newsroom/uses GBIF Public Library in Mendeley http://goo.gl/btrzDa (requires Mendeley account) Instead of giving an old-style lecture about uses, I suggest an exercise instead: why don’t you use the ‘featured data use’ section of GBIF.org and GBIF Science Reviews http://www.gbif.org/resources/3094

Rationale for Data Publishing: data quality Verbatim data In the section talking about the metadata, you will notice that it can be produced as an RDF file. Those files can be submitted to different journals and Processed data

Rationale for Data Publishing: citation & usage “Data citation standards can form the basis for increased incentives, recognition, and rewards for scientific data activities. Unfortunately, such standards and good practices are lacking” CODATA Data Citation Task Group “We believe that the lack of incentive similar to the impact factor for scholarly publication remains a major impediment to the provision of free and open access to biodiversity data” GBIF Data Publishing Framework Task Group In the section talking about the metadata, you will notice that it can be produced as an RDF file. Those files can be submitted to different journals and

Rationale for Data Publishing: benefits Data Paper A scholarly publication of searchable metadata document describing a dataset, or a group of datasets Promote and publicize the existence of the data Provide scholarly credit to data publishers through citable journal publications Describe the data in a structured human-readable form In the section talking about the metadata, you will notice that it can be produced as an RDF file. Those files can be submitted to different journals and

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Data Publishing Procedure Prioritization & planning Capture Curation Export & preparation Publishing The process of data publishing usually implies many more phases that those we are going to see in this workshop.

Data Publishing Procedure GBIF has been working on this matter for more than a decade now and there is plenty of documentation in different languages, and also training opportunities for those interested. Two key resources Frazier, C.K., Wall, J., and S. Grant. 2008. Initiating a Natural History Collection Digitisation Project, version 1.0. Copenhagen: Global Biodiversity Information Facility. 75 pp. Accessible online at http://www.gbif.org/orc/?doc_id=2176 Towards a Global Strategy and Action Plan for Discovery and Publishing of Natural History Collections Data. Biodiversity Informatics, 7, 2010. ISSN: 1546-9735. Accessible online at https://journals.ku.edu/index.php/jbi/issue/view/323 GBIF. 2010. Best practice guide for ‘Data Discovery and Publishing Strategy and Action Plans’ version 1.0. Authored by Chavan, V. S., Sood, R. K., and A. H. Arino. 2010. Copenhagen: Global Biodiversity Information Facility, 29 pp. ISBN: 87-92020-12-7. Accessible online at http://www.gbif.org/orc/?doc_id=2755 More resources are available in the resources area of GBIF.org: http://www.gbif.org/resources/summary.

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Biodiversity Data Standards ABCD Access to Biological Collection Data DwC Darwin Core DwC-A Darwin Core Archive NCD Natural Collection Descriptions AC Audubon Core … … TDWG is the international body where the standards related to biodiversity data are discussed and agreed. There is a set of procedures that the suggested standards have to go through before they are approved and implemented. www.tdwg.org

Biodiversity Data Standards: DwC higherClassification coordinatePosition specificEpithet geodeticDatum collectionCode taxonConceptID Darwin Core – a glossary of terms DwC is a defined set of terms with their definitions taxonRank collectionCode: The name, acronym, coden, or initialism identifying the collection or data set from which the record was derived. Examples: "Mammals", "Hildebrandt", "eBird".

Biodiversity Data Standards: Simple DwC Flat table Few restrictions http://rs.tdwg.org/dwc/terms/simple/index.htm http://rs.tdwg.org/dwc/terms/simple/index.htm

Biodiversity Data Standards: DwC-A DwC Archive Ext 5 Ext 1 + meta.xml Core http://rs.tdwg.org/dwc/terms/guides/text/index.htm Ext 2 Ext 4 EML.xml Ext 3

Biodiversity Data Standards: DwC-A Ex1 DwC Archive Occurrences Geographical + meta.xml Occurrence Core http://rs.tdwg.org/dwc/terms/guides/text/index.htm Media Germoplasm EML.xml Determination

Biodiversity Data Standards: DwC-A Ex2 DwC Archive Checklist Types Description + Distribution meta.xml Taxon Core http://rs.tdwg.org/dwc/terms/guides/text/index.htm Literature Vernacular EML.xml Occurrences

Biodiversity Data Standards: DwC-A Ex3 DwC Archive Samples Relevé Occurrences + meta.xml Event Core http://rs.tdwg.org/dwc/terms/guides/text/index.htm EML.xml Measurement/Fact

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

The technical infrastructure: Summary

The technical infrastructure: processing Official launch of the new GBIF.org http://vimeo.com/77782067 - from 24:15 to 27:00

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Data publishing software: some options

Data publishing software: spreadsheets Metadata Primary Biodiversity data Species Checklists

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

Structure of this session What is biodiversity data? Rationale for biodiversity data publishing Data publishing procedure Data exchange standards The technical infrastructure Data publishing software GBIF Integrated Publishing Toolkit

The GBIF Integrated Publishing Toolkit

The GBIF Integrated Publishing Toolkit: Vision A single platform allowing the sharing of Primary biodiversity data Species name information Dataset descriptions (metadata) The ability to register with GBIF Technical contact information E.g. Internet URLs Physical contact information E.g. telephone details Institutional affiliations Accurate attribution Connect Databases Upload text files Lower the technical threshold for participation Flexibility to accommodate data extensions Support efficient and simple transfer of content An open source project

Thank you!