John Wieczorek (for BGWG) Museum of Vertebrate Zoology University of California, Berkeley BioGeomancer: Collaboration to Automation
Georeferencing Collaborations Automation
Georeferencing Collaborations Automation
What is a georeference?
A numerical description of a place that can be mapped. What is a georeference?
A numerical description of a place that can be mapped. What is a georeference? In other words…
IDSpeciesLocality 1Lynx rufusDawson Rd. N Whitehorse 2Pudu pudacerca de Valdivia 3Canis lupus20 mi NW Duluth 9Ursus arctosBear Flat, Haines Junction 4Felis concolorPichi Trafúl 5Lama alpacanear Cuzco 6Panthera leoSan Diego Zoo 7Sorex lyelliLyell Canyon, Yosemite 8Orcinus orca1 mi W San Juan Island What we have: Specimens with Textual Localities
What we want: Mapped, Georeferenced Localities
What is a georeference? A numerical description of a place that can be mapped.
“Davis, Yolo County, California” “point method” Coordinates: Horizontal Geodetic Datum: NAD27
What is an acceptable georeference? A numerical description of a place that can be mapped and that describes the spatial extent of a locality and its associated uncertainties.
“Davis, Yolo County, California” “bounding-box method” Coordinates: Horizontal Geodetic Datum: NAD27
“Davis, Yolo County, California” “point-radius method” Coordinates: Horizontal Geodetic Datum: NAD27 Maximum Uncertainty: 8325 m
What is an ideal georeference? A numerical description of a place that can be mapped and that describes the spatial extent of a locality and its associated uncertainties as well as possible.
“Davis, Yolo County, California” “shape method”
pointeasy to produce no data quality bounding-boxsimple spatial queries difficult quality assessment point-radiuseasy quality assessment difficult spatial queries shapecomplete representation complex Method Comparison
Georeferencing Collaborations Automation
Collaborative Distributed Databases for Vertebrates
Collaborations
MaNIS Localities Georeferenced n = 326k localities (1.4M specimens) r = 14 localities/hr (point-radius method) t = 3 yrs (~40 georeferencers)
Scope of the Problem for Natural History Collections
Scope of the Problem for Natural History Collections ~2.5 Giga-records
Scope of the Problem for Natural History Collections ~2.5 Giga-records ~6 records per locality* ~14 localities per hour* * based on the MaNIS Project
Scope of the Problem for Natural History Collections ~2.5 Giga-records ~6 records per locality* ~14 localities per hour* ~15,500 years * based on the MaNIS Project
Georeferencing Collaborations Automation
Combining the Best in Georeferencing GeoLocate DIVA-GIS MaNIS Georeferencing Calculator BioGeomancer Classic
Major Activities (expected Sep 2006): 1)Develop spatial data standards for biodiversity
Major Activities (expected Sep 2006): 1)Develop spatial data standards for biodiversity 2)Assemble relevant spatial data
Major Activities (expected Sep 2006): 1)Develop spatial data standards for biodiversity 2)Assemble relevant spatial data 3) Further develop existing desktop applications
Major Activities (expected Sep 2006): 1)Develop spatial data standards for biodiversity 2)Assemble relevant spatial data 3) Further develop existing desktop applications 4) Create workbench for automated georeferencing, editing, and validating
BioGeomancer Workbench Automated Georeferencing Visual Georeference Editing Georeference Validation Outlier Detection
BioGeomancer Workbench Automated Georeferencing Visual Georeference Editing Georeference Validation Outlier Detection
BioGeomancer Workbench: Automated Georeferencing Text Interpretation Feature Lookup Shape Generation
BioGeomancer Workbench: Automated Georeferencing Text Interpretation Feature Lookup Shape Generation
Automated Georeferencing: Text Interpretation “Between Albuquerque and Moriarity” Locality Type: BF (Between Features) Feature1: Albuquerque Feature2: Moriarity
BioGeomancer Workbench: Automated Georeferencing Text Interpretation Feature Lookup Shape Generation
Automated Georeferencing: Feature Lookup
BioGeomancer Workbench: Automated Georeferencing Text Interpretation Feature Lookup Shape Generation
Automated Georeferencing: Shape Generation Bounding-boxPoint-radius Shape
BioGeomancer Workbench Automated Georeferencing Visual Georeference Editing Georeference Validation Outlier Detection
BioGeomancer Workbench: Visual Georeference Editing
BioGeomancer Workbench Automated Georeferencing Visual Georeference Editing Georeference Validation Outlier Detection
BioGeomancer Workbench: Georeference Validation Elevation Geography Species Distribution
BioGeomancer Workbench Automated Georeferencing Visual Georeference Editing Georeference Validation Outlier Detection
BioGeomancer Workbench: Outlier Detection Elevation Environment Species Distribution Model
Beyond 1 September 2006 Seek Industry Support for Data Layers Develop Workbench into Collaboratories Develop Locality Interpretation in Additional Languages
Why do all of this? So that those who generate and care for the data can realize and share its potential.