Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition Tools and Resources to Assess and Enhance Fitness-For-Use Meherzad Romer Senior Data Manager NatureServe Canada ) September 30, 2011
Overview 1.Tools that help: o Metadata o Spatial Data o Tabular Data 2.Other resources o Data sources o Documents
Tools - Metadata
Tools - Metadata Metadata Capabilities of IPT
Tools- Metadata Metadata Capabilities of the IPT Geographic coverage metadata in IPT
Tools - Metadata Darwin Core Archive Validator
Tools - Metadata Darwin Core Archive Validator
Tools - Spatial Data
Tools – Spatial Data BioGeomancer Convert text into coordinates Assign radius of uncertainty Single or Batch process
Tools – Spatial Data BioGeomancer Web based only Shows all possibilities
Tools – Spatial Data GeoLocate Convert text to coordinates Assign radius of uncertainty Single or batch process Correct by moving location place mark or drawing polygon Online, stand alone and collaborative versions
Tools – Spatial Data InfoXY Coordinates entered in lat/long Can batch process Results precise to the administrative boundaries Output: html or MS Excel spreadsheet
Tools – Spatial Data SpOutlier Enter lat/long (altitude) Map Better with more locations Uses weighted centre Checks on water/land
Tools – Spatial Data SpOutlier
Tools – Spatial Data Georeferencing Calculator Input: Coordinates, offsets and headings, uncertainty sources Output: Final coordinates Estimate of error
Tools – Spatial Data Diva-GIS Basic GIS software Free Biologically oriented Vector/Raster Compatible with many formats
Tools – Spatial Data Quantum GIS More complex Free Compatible with Spatial Geodatabases and geographical web services Vector/Raster Community development
Tools – Spatial Data The R-project Environment and language for statistical computing. Allows analysis and drawing. Plays nicely with others: GIS, scripting languages
Tools – Tabular Data
Tools – Tabular Data Name Parser Atomize Sci. Names 3 parted name Ignores varieties for subspecies
Tools – Tabular Data Name Parser
Tools – Tabular Data Name Finder Find scientific names Inputs: File Url Free text
Tools – Tabular Data Name Finder Returns: Count Names as found Recognized form
Tools – Tabular Data Taxon Tagger Input: Url PDF file
Tools – Tabular Data Taxon Tagger Output: Scientific Names Higher Taxonomy Hightlights names
Tools – Tabular Data Checklist Bank (Development Prototype) Input: Scientific Names Output: Higher taxonomy
Tools – Tabular Data Darwin Test GBIF France Validate and check records Darwin Core Archive format Coordinate conversions Check names against databases Detect encoding issues Generalize sensitive data (coords) Access based
Tools – Tabular Data Scripting/dynamic languages This is geocoding With 10 more lines, you can achieve very specific, complex and custom results
Tools – Tabular Data Google Refine DEMO
Other Resources - Data Sources
Data sources OpenStreetMap Data quality? Free Editable Downloadable Community based
Data sources Thesauri Thematic checklists: o Fish : FishbaseFishbase o Animals in general: Index to Organism Name (ION)Index to Organism Name (ION) o Mammals: Mammal Species of the World (MSW)Mammal Species of the World (MSW) o Bacteria: List of Bacteria with Standing in Nomenclature (LBSN)List of Bacteria with Standing in Nomenclature (LBSN) Country codes o ISO or ISO3166-2, available for example in Access format (
Other Resources - Documents
Documents GBIF Position Paper on Future Directions and Recommendations for Enhancing Fitness-for-Use Across the GBIF Network
Documents GBIF Spain BDQ Inventory
Documents GBIF Online Resource Centre Browse and download Fitness for use Best practices Training Manuals
Thank you. Questions?