Download presentation
Presentation is loading. Please wait.
Published byDelphia Osborne Modified over 9 years ago
1
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan and Nicholas King February 12, 2008 vchavan@gbif.org WWW.GBIF.ORG GBIF efforts in digitizing and mobilising primary biodiversity data
2
GBIF’s Mission …to make the world’s biodiversity data freely and universally available via the Internet What is biodiversity? GBIF follows the broadly outlined CBD recognition of levels of biological diversity: Molecules / genes Species Ecosystems / ecology
3
l Scientists, experts, consultants l Government officials at all levels l Farmers, foresters, indigenous communities l Education at all levels l NGOs and the general public These needs are highly varied, but can be met by open access to the same datasets The same data can be analysed differently for different uses Who needs primary biodiversity data!
4
But this needs easy access to (digitised) data Screen shot: 26 Oct 2007 As of end 07 GBIF facilitates access to 142 million primary data records
5
GBIF Data Portal: Dispelling Mythes! l Searches l Taxonomic l Geographic (by country or bounding-box) l By dataset l Taxonomic browse navigation using choice of classification l Integration of data: DiGIR-Darwin Core & BioCASe-ABCD (new versions), TAPIR, tab-delimited, TCS, SDD l Search and download by one to many species, geography, dataset (or combination) l Web services Distributed, Decentralised, Data Discovery and Access through network of heterogenous and multicultural partners is possible!
6
Countries are organised alphabetically on the lhs, and show numbers of national records on the rhs. Here we can see that there are more than 3.2 million records available for South Africa (2,8 million with coordinates), referring to nearly 41.500 species
7
Example of a country summary page. This map provides an overview of the density of records currently available.
8
Sample of records available for South Africa at September 2007. The GBIF portal offers a range of options for further use of the data…
9
It is also possible to get the full list of organisations providing data collected in a specific country or region In this case 68 collections from all over the world are making available data for South Africa through GBIF – a good exemplar of data repatriation activities promoted and facilitated by GBIF
10
South African institutions are also providing data relevant to other countries and regions in the world, as demonstrated in this example from the Shark Collection at the Iziko South African Museum
11
The GBIF data portal also allows for more detailed views of regions, datasets, taxonomic groups, etc. Here it is possible to see nearly 100 000 records from the Linefish dataset collected in 1989 by the Marine and Coastal Management (MCM) at the Department of Environmental Affairs and Tourism in South Africa
12
Exporting data from the GBIF data portal to other applications such as Google Earth is a matter of a click!
13
Coverage for Africa l >5m records currently for Africa l > 1m from EU country institutions l Estimated >100m not yet digitised
19
Within Google Earth overlays it is also possible to go down to the level of individual primary records, getting back to the original data provider
20
With the filter functionality it is possible to perform complex queries on the data. In this example we are looking for all records on Lepidoptera (butterflies) collected or observed in South Africa from 1950 to 2000.
21
Proteaceae in the Cape Floral Kingdom Range changes due to Climate Change
22
Leucospermum tomentosum: range centres in 10 year time slices
23
But, this is just a beginning....... We need to cover much beyond imagination, and much much faster than we think?
24
Biological Data Domain - challenges Persistent digital and physical data stores, moderately accessible Migration of legacy data, metadata generation, taxonomy (species) 80% ? digital Ecological & Ecosystem Data Persistent physical data stores, accessible with difficulty Digitisation, migration of legacy data, indexing <5% digital Species- & Specimen Data Persistent digital, universally accessible data stores Data migration, cleansing, vouchering, taxonomy (gene & species) 95% digital Molecular Sequence & Gene/Genome Data Sub-domain Digital Status Greatest Informatics Problems Data Status
25
Primary Biodiversity Data
26
Observations / Monitoring Multimedia Resources Biological Collections NAMESNAMES
27
Growth rate of GBIF data sharing
28
1 Billion Record by 2008 – We need to expedite! l Many specimens remain to have their data digitised l Many records are already digital... l … but are not yet being shared * data useful in analyses that contribute to sustainable management of biodiversity
29
GBIF is all about our shared vision and partnership l 28 Voting Country Participants l 15 Associate country Participants l 35 International Organisations and Economies
30
GBIF Working Principles not l Collaboration and sharing — not compilation or entirely l Ownership of data (specimens or names) remains entirely with providers l Standardised schemata for data sharing — software free to providers l Worldwide network of collaborating institutions that share data (data providers) l GBIF’s Participants’ Nodes promote and coordinate activities of data providers
31
GBIF Working Principles l Procedures for interoperability and data integration l Web services (mostly for machines, but for people too) l Global registry for advertisement of shared data l Vision and coordination l GBIF has a unique global mandate in both Informatics and Content l GBIF is a multi-purpose, open-ended cyber- infrastructure that facilitates biologists serving biodiversity and society in new ways
32
GBIF Strategic Areas 2007 – 2011 l Informatics l Data portal powerful and friendly l Consolidated infrastructure and standards l Tools and support for Nodes and providers l Content l Data quantity and richness in priority areas l Data integration and discovery l Documented data quality l Participation l Nodes' expertise shared across the network l Guidance on setting up and maintaining Nodes
33
In a database, the data have no actual quality or value; they only have potential value. That value is realized only when someone uses the data to do something useful (English 1999). The quality of data cannot be assessed independently of the uses of that data (Strong et al. 1997). Data are of high quality if they are fit for their intended use in operations, decision-making, and planning (Juran 1964). Data: Fitness for Use
34
Data standards / protocols used by GBIF l Darwin Core (TDWG data standard) l Simple XML data model to represent taxon occurrence records (only core attributes) l Extensions to handle e.g. curation details, geospatial data, microbial specimens l ABCD - Access to Biological Collection Data (TDWG data standard) l More complex XML data model to represent collection or observation data l Detailed document structure including features for different communities l Taxon Concept Schema (TDWG data standard) l XML data model for exchange of nomenclatural/taxonomic data l Will be supported in new GBIF data portal l Tab-delimited links to species information l Lists of scientific names, URLs and key words l Will be supported in order to establish links to external resources from the new GBIF data portal
35
l DiGIR / BioCASe / TAPIR (TDWG access protocols) l XML protocols for searching remote data resources l Suitable for use with a wide range of different data models l TAPIR (latest version) supports flexible views and simple URLs l SPICE protocol (Species 2000 access protocol) l Web service interfaces for exploring taxonomic data (hierarchies, synonymy, common names) l Will be supported for connecting data resources to new GBIF data portal l LSIDs – Life Science Identifiers (TDWG-adopted GUID mechanism) l Globally unique identifiers to simplify tracking data records l Include protocol for resolving data for any LSID Data standards / protocols used by GBIF
36
Examples of resources provided by GBIF free
37
GBIF Training Manual 1: Digitisation of Natural History Collections CONTENTS l Introduction l The Uses of Primary Species Occurrence Data l Initiating a Natural History Collection Digitisation Project l Principles of Data Quality l Principles and Methods of Data Cleaning l BioGeomancer Guide to Best Practices for Georeferencing l Guide to Best Practices for Generalizing l Glossary and Acronym Expansion l To be released by end February 2007.
38
Observational Data Task Force l Quantum of observational data is unprecedented l Over 60% of GBIF mediated data is observational l Observational Data Task Group Recommend GBIF on mobilisation of observational data Criteria for Observational Data Sharing Infrastructure Metadata Schema for Observational Schema Protocols / Standards for observational data exchange / sharing Best Practices Guide for observational data management Encourage participation of potential data providers l Report by September 2008
39
l Broader range of supported import formats and protocols l Occurrence data l Darwin Core (original v1.2, MaNIS, OBIS, new v2.0 with extensions) l ABCD (v1.20, v2.06) l Taxonomic data l Catalogue of Life CD-ROM (moving to dynamic checklist) l Nomenclators via tab-delimited lists of LSIDs (work under way) l Data from ECAT projects (models and tools under way) l Other resources l Discussions under way with other resources (GenBank, BOLD, ARKive) l General support for handling XML and tab-delimited formats Enhanced support for data providers
40
l Validation and annotation of data during indexing l Presence of required fields l Consistency between country name and coordinates l Reports for data providers l Clear separation between “raw” and “processed” index data l Scientific name string versus interpreted taxon l Country name string versus interpreted country l “Home page” for each data resource Enhanced support for data providers
41
Training, Capacity Building, Mentoring l Training programs on how to share data l Training on Ecological Niche Modeling l Mentoring to developing countries l Help Desk services
42
Call for Action! With GBIFs’ decentralised approach of NBIFs, RBIFs, and ThBIFs Africa has lots to contribute..... Individual, institutional, national, regional and global level! With GBIFs’ decentralised approach of NBIFs, RBIFs, and ThBIFs Africa has lots to contribute..... Individual, institutional, national, regional and global level!
43
How to contact GBIF: Web site: www.gbif.orgwww.gbif.org Data portal: www.gbif.netwww.gbif.net GBIF Secretariat Universitetsparken 15 2100 Copenhagen Denmark E-mail: info @ gbif.orginfo @ gbif.org Phone: +45 3532 1470 Fax: +45 3532 1480 GBIF Secretariat building, supported by a grant from the Aage V. Jensens Fonde
44
Merci beau coup / Thank you Questions?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.