The VegBank Data Model. Biodiversity data structure Taxonomic database Plot/Inventory database Occurrence database Plot Observation/ Collection Event.


Similar presentations
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.

To share data, all providers must agree upon a data standard.
Taxonomic data issues: An ecologist’s experience R.K. Peet The University of North Carolina Adapted by J Kennedy.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Core Competencies Student Focus Group, Nov. 20, 2008. a Permanent, Open-Access Archive for Vegetation Plot Data. Michael T. Lee 1, Michael D. Jennings 2, Robert K. Peet 1. Interacting with the.
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
Vegetation databases Lessons from VegBank, SEEK, TDWG, IAVS, & NCEAS Robert Peet University of North Carolina.
Transition to taxon concepts from a world of legacy data --- R.K. Peet 1, A.S. Weakley 1,2, X. Liu 1,3, & N. Franz 4,5 1 The University of North Carolina.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Plant Systematics databases: Users perspectives Robert K. Peet, University of North Carolina In collaboration with The National Center for Ecological Analysis.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Names are not sufficient: the challenge of documenting organism identity R.K. Peet, J.B.Kennedy, and N.M. Franz and The Ecological Society of America Vegetation.
Improving Restoration Using CVS-Designed Web-Based Tools 7 October 2009 M. Forbes Boyle University of North Carolina, Chapel Hill.
Data models for Community information Robert K. Peet, University of North Carolina John Harris, Nat. Center for Ecol. Analysis & Synthesis Michael D. Jennings,
VegBank A vegetation field plot archive Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center.
EEP wants to do a better job creating natural ecosystems. CVS provides improved reference data, target design, monitoring, and data management and analysis.
EcoInformatics & Vegetation Science. The symposium message Plant community ecology is on the brink of a dramatic transformation that will be made possible.
VegBank and the ESA Cyber-infrastructure for Vegetation Science Robert K. Peet & The Ecological Society of America Vegetation Panel.
North American initiatives in Ecoinformatics: Vegbank and SEEK Robert K. Peet and The Ecological Society of America Vegetation Panel The SEEK development.
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer September G A Darwin-Core Archive solution to publishing and.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
Vegetation Plot Management: A National Plots Database Demo Funding: National Science Foundation (DBI ) John Harris - NCEAS Robert K. Peet - University.
Use case lessons: Components of the SEEK architecture Robert K. Peet University of North Carolina.
Measuring Habitat and Biodiversity Outcomes Sara Vickerman and Frank Casey September 26, 2013 Defenders of Wildlife.
A new floristic atlas for the Southeast based on taxon concept relationships Robert K. Peet 1, Alan S. Weakley 1,2 & Xianhua Liu 1,3 1 The University of.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
The National Park Service's Information Management Strategy, Infrastructure, and Software Applications.
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
Synopsis of current BIEN and Enquist projects managed by Martha iPlant 2014.
Digitization of Natural History Collections (DIGIT) Larry Speers Program Officer Digitization of Natural History Collections Data TDWG Annual Meeting Oct.
Experience from Mapping Existing Models to the Transfer Schema Robert Kukla.
Vegetation Data Management: VegBank Funding: National Science Foundation (DBI ) January 8, 2002 John Harris - NCEAS.
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
Ecoinformatics Workshop Summary SEEK, LTER Network Main Office University of New Mexico Aluquerque, NM.
The VegBank taxonomic datamodel Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center for.
Collections. Vegetation sampling We observe and collect data on soil.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Taxonomic verification: Species 2000 and the Catalogue of Life Frank Bisby.
Multi-institutional collaborative program. Established in 1988 to document the composition and status of natural vegetation of the Carolinas. Provides.
Current and planned tools and resources. Multi-institutional collaborative program Established in 1988 to document the composition and status of natural. Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
U.S. Department of the Interior U.S. Geological Survey The Biological Data Profile Extending the FGDC Metadata Standard Kirsten Larsen.
Fábio Lang da Silveira – This talk on behalf of OBIS International Committee and OBIS North & South America Nodes USP – Zoology.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
The role of persistent identifiers in tracking taxon changes Andrew C. Jones, Richard J. White, Ewen R. Orme, School of Computer Science, Cardiff University,
Transition to taxon concepts from a world of legacy data --- R.K. Peet 1, A.S. Weakley 1,2, X. Liu 1,3, & N. Franz 4,5 1 The University of North Carolina.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
OOI Cyberinfrastructure and Semantics OOI CI Architecture & Design Team UCSD/Calit2 Ocean Observing Systems Semantic Interoperability Workshop, November.
The challenge of organism identity --- The flora of the Southeast The flora of the Southeast as a case study Robert K. Peet University of North Carolina.
VegBank and the ESA Cyber-infrastructure for Vegetation Science R.K. Peet, Don Faber-Langendoen, Michael Jennings, & Michael Lee Ecological Society of.
Dave Thau - Mills College - Technology for a Better World111/24/2009 Biodiversity Informatics Dave Thau.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
A vision for community involvement and integration Robert K. Peet & Alan S. Weakley Alan S. Weakley.
U.S. Department of the Interior U.S. Geological Survey Manage and Provide Information: Examples from fish health, contaminants, and water quality data.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
IPT + Darwin Core OBIS XML Schema OBIS Database Schema Explained Mike Flavell OBIS Data Manager OBIS Nodes Training Course, Oostende, Belgium, 6 May 2014.
Data sharing and exchange: Experiences within the
Vegetation Data Management:
Course Outcomes of Object Oriented Modeling Design (17630,C604)
Taxonomic and Community Classification Resources and Standards
Data Management: The Data Repatriation Re-integration Step or …
Presentation transcript:

The VegBank Data Model

Biodiversity data structure Taxonomic database Plot/Inventory database Occurrence database Plot Observation/ Collection Event Specimen or Object Bio-Taxon Locality Vegetation Type Vegetation type database

Plot Observation Taxon Observation Taxon Interpretation Plot Interpretation Core elements of VegBank Taxon Assignment Plot Assignment

VegBank consists of three integrated databases 1. The Plot Database 2. The Plant Database 3. The Community Database

Taxonomic database challenge: Standardizing organisms and communities The problem: Integration of data potentially representing different times, places, investigators and taxonomic standards. The traditional solution: A standard checklists of organisms.

Standard checklists for Taxa Representative examples for higher plants in North America / US USDA Plants ITIS NatureServe BONAP Flora North America These are intended to be checklists wherein the taxa recognized perfectly partition all plants. The lists can be dynamic.

Most taxon checklists fail to allow effective dataset integration The reasons include: The user cannot reconstruct the database as viewed at an arbitrary time in the past, Taxonomic concepts are not defined (just lists), Multiple party perspectives on taxonomic concepts and names cannot be supported or reconciled.

R. plumosa R plumosa v. intermedia R. plumosa v. plumosa R. intermedia R. plumosa v. interrupta R. pineticola R. plumosa R. sp. 1 R. plumosa v. plumosa R. plumosa v. pineticola Multiple concepts of Rhynchospora plumosa s.l. Elliot 1816 Gray 1834 Kral 2003 Peet 2004? Chapman 1860

NameReferenceConcept Taxonomic theory A taxon concept represents a unique combination of a name and a reference “Taxon concept” roughly equivalent to “Potential taxon” & “assertion”

NameConceptUsage A usage represents an association of a concept with a name. Usage does not appear in the IOPI model, but instead is a special case of concept Usage can be used to apply multiple name systems to a concept Desirable for stability in recognized concepts

Carya ovata (Miller) K. Koch Carya carolinae-septentrionalis (Ashe) Engler & Graebner Carya ovata (Miller) K. Koch sec. Gleason 1952sec. Radford et al Three concepts of shagbark hickory Splitting one species into two illustrates the ambiguity often associated with scientific names.

Names Carya ovata Carya carolinae-septentrionalis Carya ovata v. ovata Carya ovata v. australis Concepts (One shagbark) C. ovata sec Gleason ’52 C. ovata sec FNA ‘97 (Southern shagbark) C. carolinae-s. sec Radford ‘68 C. ovata v. australis sec FNA ‘97 (Northern shagbark) C. ovata sec Radford ‘68 C. ovata (v. ovata) sec FNA ‘97 References Gleason Britton & Brown Radford et al Flora Carolinas Stone Flora North America Six shagbark hickory concepts Possible synonyms are listed together

NameConcept Start, Stop ConceptStatus Level, Parent Usage Start, Stop NameStatus Name system Reference Data relationships VegBank taxonomic data model Single party, dynamic perspective

Party Perspective The Party Perspective on a concept includes: Status – Standard, Nonstandard, Undetermined Correlation with other concepts – Equal, Greater, Lesser, Overlap, Undetermined. Lineage – Predecessor and Successor concepts. Start & Stop dates for tracking changes

ITIS FNA Committee NatureServe Carya ovata sec Gleason 1952 Carya ovata sec FNA 1997 Carya ovata sec Radford 1968 Carya carolinae sec Radford 1968 Carya ovata (ovata) sec FNA 1997 Carya ovata australis sec FNA 1997 PartyConcept PartyConceptStatusStart Usage:SciName ITIS ovata –G52 NS1996 ITIS ovata –R68 St1996C. ovata ITIScarolinae-s –R68 St1996C. carolinae-sept. ITIScarolinae-s –R68 NS2000 ITISovata aust –FNA St2000C. carolinae-sept. ITISovata – R68 NS2000 ITISovata ovata –FNA St2000C. ovata Status and usage Application of Party Perspective

NameConcept Party Usage Start, Stop NameStatus Name system Status Start, Stop ConceptStatus Level, Parent Reference Data relationships VegBank taxonomic data model Multiple parties, dynamic perspectives

NameConcept Party Usage Start, Stop NameStatus Name system Status Start, Stop ConceptStatus Level, Parent Correlation Reference Lineage Data relationships VegBank taxonomic data model With party correlations and lineages

Intended functionality Organisms are labeled by reference to concept (name-reference combination), Party perspectives on concepts and names can be dynamic, but remain perfectly archived, User can select which party perspective to follow, Different names systems are supported, Enhanced stability in recognized concepts by separating name assignment and rank from concept.

Plant Taxa Name (Reference) Concept Status Correlation Lineage Usage Party

State of Taxon Concept Development 1.TDWG, IOPI, & SEEK 2.VegBank 3. Collaborators NatureServe Biotics4 USDA PLANTS & ITIS

VegBank taxon data content Prototype populated with USDA PLANTS lists and synonyms = weak concepts. Contract with NatureServe and John Kartesz Develop reference-based concepts for by July 2004 of the ~32000 vascular plant taxa at species level and below List of unambiguous taxa (~6000?) Treatment of most ambiguous taxa Demonstration mapping to FNA A few demosntration groups in depth

Concept workbench Concept workbench for both plant concepts and community concepts is planned.

The VegBank ERD Available at Click tables for data dictionary and constrained vocabulary

The data dictionary provides critical information such as field types, field definitions, and constrained vocabularies.

Example plot metadata Project attributes Plot parties Observation date Cover & stratum methods Plot selection Plot layout Site data Geographic data

Plot Place Named Place

Observation Project Disturbance Obs Soil Obs Graphic Observation Synonym Cover method

Taxon Observation Importance values Author name Taxon Interpretation Which taxon Who decided and why Stem or collective Voucher information

Stems & Strata Stratum method Stratum type Stratum Stratum comp. Taxon observ. Stem count Stem location

Interpretation continued Plants Taxon Interpretation Taxon Alt Communities Class Interpretation

Problematic taxa of ecological datasets Carex sp. Crustose lichen Hairy sedge #6. Sporobolus sp. #1 Picea glauca – engelmannii complex Potentilla simplex or P. canadensis Carya ovata sec. Gleason 1952

Party Project Contr. Obs Contr. Role Address Telephone


Utilities User defined Notes Revisions

Intellectual Property issues Rare species Private lands Working datasets – not yet complete Ongoing research Citation Annotation

Connectivity & Collaboration Loaders for popular plot databases Data exchange standards for plots Data exchange standards for taxa Refresh activities among VegBank, Biotics, and ITIS/PLANTS. Distributed VegBank systems Deep links into VegBank

Possible VegBank nodes US – ESA New Zealand Canada Amazon collaboration Europe South Africa

Tools for semantic mediation & data discovery: Science Environment for Ecological Knowledge To improve how researchers can 1) gain global access to ecological data and information, 2) rapidly locate and utilize distributed computational services, and 3) capture, reproduce, and extend the analysis process itself.

The SEEK project Standard data structures. Public data archives (deposit, withdraw, cite). Standard exchange formats. Standard protocols. Tools for semantic mediation & data discovery.