GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting
Use cases l Map species data to a controlled list or authority file (or system). Are my names spelled right? Taxonomy up to date? l Provide controlled lists for data entry l Build new regional or thematic species lists l Provide integrated/flexible taxonomic browsing of species data l Support browsing in native/other languages l Support the use of common names in search/access l Find species information regardless of misspellings, synonyms, etc. l Automated processes for identifying, extracting, validating, and linking names in documents, publications, web sites, etc.
Scope of the Global Names Architecture A global publication and discovery system for taxon names and concepts
GNA infrastructure extends GBIF infrastructure Common publication framework for GBIF and other networks
ECAT Work Programme l Define/implement the architectural framework for publishing taxonomic data l Build a global index of published “checklists” to enable discovery, integration and access l Build services and tools that use the published “checklist” data
Scope “Checklists” l Taxonomic Catalogues l Monographs, Regional Flora/Faunas, Taxonomic “Aggregates” l Species Inventories l Regional species lists l Thematically defined species lists l Red Lists, Invasive Species Lists, Medicinal, etc. l Common Name Lists/Inventories l Species indices in published content
Status of Architecture l Robust and extensible data standard in place l Documentation, examples, code l Tools and services to support global extension and use of the standard l Capacity to register, publish, and access data is in place now * l Simple Name indexes l Complex Taxonomic Data * Needs promotion, use, and refinement
Rich, Extensible, Simple Standard
Developing, Detailed documentation Source Code Schemas Tools Instructions Documentation Feedback!!! (please)
Extensible, International, Controlled Community authoring Controlled Vocabularies Extensions Multi-lingual Thesauri Nov 2009 release
Publishing Infrastructure is in place 30% to 95% occurrence data linked to taxonomic sources
Goal 2 of ECAT Work Build and integrate Indexes of the Published Checklist Data
Global Checklist Index A global name service brokerage to names hosted on taxonomic servers
Global Names Index Publicise species data entry by species name Facilitates Linking URIs+Species Names Web Service “Fuzzy” name matching Simple DwC standard
Integration of Indexes within the Data Portal
Goal 3 of ECAT Work Build Services & Applications that use published Checklists
Using published Checklist Data
Enabling new and extended uses of taxonomic data New Derived ProductsNew Processes/Technologies
Taxon Tagger Application (Silverbiology)
Uses that provide benefits to GBIF participants Applied to data access and integration
Questions l Does this enhanced capacity, to publish species-level data, particularly checklists, serve a need with the NODES? l Is the IPT sufficient for providing the capacity to publish these types of checklists l Would NODES like additional options? l Focus on cataloging as much Checklist content as possible l LifeDesks and Scratchpads, plus all the known databases that are "out there” l Develop the widest array of basic web services to enable their use? l More name processing tools/ mapping services?
How to contact GBIF: Web site: Data portal: data.gbif.org GBIF Secretariat Universitetsparken Copenhagen Denmark Phone: Fax: