A Provisional Observational Data Standard to Facilitate Data Sharing and Aggregation Lynn Kutner, Bruce Stein, and Donna Reynolds TDWG Annual Meeting,

Slides:



Advertisements
Similar presentations
GEOSS Data Sharing Principles. GEOSS 10-Year Implementation Plan 5.4 Data Sharing The societal benefits of Earth observations cannot be achieved without.
Advertisements

V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
C6 Databases.
Monarch Larva Monitoring Project Goals and Roles.
Spatial Data Infrastructure: Concepts and Components Geog 458: Map Sources and Errors March 6, 2006.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Entomological Collections Network Meeting, Indianapolis, IN 13 December 2009 Darwin Core Ratified in the Year of Darwin Gail E. Kampmeier Illinois Natural.
Maximizing Data and Data Services Monday, October 14, 2013 Location: Denver CO© 2013 Child Care Aware ® of America.
Overview of OASIS SOA Reference Architecture Foundation (SOA-RAF)
NBII and the FGDC CAP Grant: Building Metadata Partnerships Vivian Hutchison Metadata Program Coordinator NBII/USGS.
Meta Dater Metadata Management and Production System for surveys in Empirical Socio-economic Research A Project funded by EU under the 5 th Framework Programme.
Lecture Fourteen Methodology - Conceptual Database Design
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Methodology Conceptual Database Design
BIS310: Week 7 BIS310: Structured Analysis and Design Data Modeling and Database Design.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Collection Level Description: The RASCAL Experience A Case Study Clare McVeigh RASCAL Project Manager 8 February 2002.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
Observational Data Standard, v 1.0 Lynn Kutner, Donna Reynolds, Jennifer Nichols, Kristin Barker, Kristin Snow April 2007 A Provisional Standard to Facilitate.
Controlled Vocabularies (Term Lists). Controlled Vocabs Literally - A list of terms to choose from Aim is to promote the use of common vocabularies so.
Jake F. Weltzin United States Geological Survey Mark D. Schwartz University of Wisconsin - Milwaukee The RCN & the USA-NPN Founding & Current.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Methodology - Conceptual Database Design Transparencies
Methodology Conceptual Databases Design
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
The National Park Service's Information Management Strategy, Infrastructure, and Software Applications.
High Level Architecture Overview and Rules Thanks to: Dr. Judith Dahmann, and others from: Defense Modeling and Simulation Office phone: (703)
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
12/04/07FGDC Vegetation Subcommittee Briefing Federal Geographic Data Committee Vegetation Subcommittee Briefing for the FGDC Coordination Group December.
1 Department of Medical Assistance Services Stakeholder Advisory Committee June 25, 2014 Gerald A. Craver, PhD
Inventory and Monitoring Terrestrial Fauna Inventory and Monitoring Terrestrial Fauna Linking Field Activities to Budget Processes.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
Mobile Observation System for Handheld Field Data Collection Lori Scott, Dave Hauver TDWG Annual Conference September 27, 2010 Woods Hole, MA ] [
Digitization of Natural History Collections (DIGIT) Larry Speers Program Officer Digitization of Natural History Collections Data TDWG Annual Meeting Oct.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Characterization, Inventory and Monitoring of trends in indigenous livestock Dr. E. D. Ilatsia D. N. Kamiti 23-Oct-15Animal Breeding and Genomics Group1.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Methodology - Conceptual Database Design
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
Joint Canada-Mexico-USA (North American*) Carbon Program Planning Meeting January 25–26, 2007 *By North America we mean the North American land, adjacent.
Pilot Projects on Strengthening Inventory Development and Risk Management-Decision Making for Mercury: A Contribution to the Global Mercury Partnership.
TAPIR 1.0 Renato De Giovanni, Markus Döring, Javier de la Torre October 2006.
IFS310: Module 6 3/1/2007 Data Modeling and Entity-Relationship Diagrams.
Shaping a Health Statistics Vision for the 21 st Century 2002 NCHS Data Users Conference 16 July 2002 Daniel J. Friedman, PhD Massachusetts Department.
Jake F. Weltzin United States Geological Survey USA National Phenology Network Integrating phenology data across spatial and temporal scales.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
1 NOAA Priorities for an Ecosystem Approach to Management A Presentation to the NOAA Science Advisory Board John H. Dunnigan NOAA Ecosystem Goal Team Lead.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
NATIONAL TREASURES DATA PRESERVATION WITH METADATA Sharon Shin Metadata Coordinator Federal Geographic Data Committee Secretariat ASPRS-Reno 2006.
Efforts in Science and Technology to Address Data Management Needs A Network Connecting Science With Conservation January 31, 2006 Jennifer Swenson, Carmen.
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Middle Fork Project Relicensing Process Plan April 25, 2006.
Documenting (Driving) Progress in Immunization Injection Safety Philippe Duclos, WHO Presented at the SIGN annual meeting, Cairo, Egypt, Oct th,
NVS New Zealand National Vegetation Survey. What is NVS? NVS (National Vegetation Survey) – New Zealand’s largest archive facility for plot-based vegetation.
Session 2: Developing a Comprehensive M&E Work Plan.
IPT + Darwin Core OBIS XML Schema OBIS Database Schema Explained Mike Flavell OBIS Data Manager OBIS Nodes Training Course, Oostende, Belgium, 6 May 2014.
ESA UNCLASSIFIED – For Official Use INSPIRE Orthoimagery TWG Status Report Antonio Romeo ESRIN 15/02/2012.
A Framework for Assessing Needs Across Multiple States, Stakeholders, and Topic Areas Stephanie Wilkerson & Mary Styers REL Appalachia American Evaluation.
Lecture 5 Data Model Design Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant EPS
Understanding the Value and Importance of Proper Data Documentation 5-1 At the conclusion of this module the participant will be able to List the seven.
RCN Development of an Online Database to Enhance the Conservation of SGCN Invertebrates in the Northeastern Region James W. Fetzner Jr. & John.
Template library tool and Kestrel training
Challenge Grant Update
Palestinian Central Bureau of Statistics
Presentation transcript:

A Provisional Observational Data Standard to Facilitate Data Sharing and Aggregation Lynn Kutner, Bruce Stein, and Donna Reynolds TDWG Annual Meeting, St. Louis, Missouri, USA October 2006

Topics  Context and definitions  Development of the standard  challenges and goals  process  requirements  Overview of the standard  conceptual data model  required and core fields

Who We Are:  A nonprofit organization providing the scientific information and technology to guide effective conservation action  A network of 80 natural heritage programs in the United States, Canada, and Latin America, supported by NatureServe’s staff of science, information technology, and conservation professionals. What We Do:  Collect, manage, and disseminate detailed local information on species and ecosystems, and develop products, tools, and conservation services to help meet conservation needs at all geographic levels NatureServe

Data collection and recording Data management and reporting Conservation expertise and analysis Information access and interoperability Decision support Scientific standards and methods Conservation Decisions Standards: The Foundation NatureServe is a cosponsor of the TDWG Observational Data Subgroup.

Newest team member: Sam, with mom Lynn

Context and Definitions

Characteristics of an Observation  Fundamental unit of scientific inventory  Can be documented by various types of evidence, including voucher specimens, photographs, recordings, or sight records  Ancillary data may include abundance, distribution, reproductive status, phenology ecological associations, environmental conditions, and much more  Can be linked to other observations by common factors such as time, place, and protocol – the framework for monitoring

What is an Observation? An observation characterizes evidence for the presence or absence of an organism or set of organisms through a data collection event at a location.

What is Monitoring? Monitoring is an activity that results in a set of observational data gathered during repeated visits over a period of time using consistent or comparable methodology, for the purpose of detecting and documenting changes and trends.

Development of the Standard

Primary Challenge The Observational Data Standard must encompasses data sets that differ widely in content and quality  discipline-specific data  historical data  positive and negative (absence) data  data on single species as well as ecological communities

Goal and Purpose  Goal: Collaboratively identify a core set of concepts that are broadly applicable regardless of data or survey type  Purpose: Facilitate data sharing and aggregation within the conservation community, including data discovery through global search portals.

Process  Project began in mid-2005 with funding from Gordon and Betty Moore Foundation  Comparison of attributes among 16 observation databases to identify breadth and commonalities  Observation Working Group—multi-institutional composition; formed in March 2005 and held two-day workshop in June 2005  Identified and debated key issues  Drafted set of core concepts and requirements  Production of drafts for two rounds of review (April and June 2006), with subsequent incorporation of comments  Version 1.0 presented September 2006

Core Observation Concepts  What – attributes that identify the observed element  Where - location attributes  When - observation / collection event date / time  Who - observer / collector of the information  How - survey methods / effort / protocol  Evidence – documentation attributes  Biology - associated biological attributes  Environment - associated environmental attributes  Data management attributes – data sensitivity, quality control, etc.

Requirements The Observational Data Standard shall:  Focus on “core” attributes, including a subset of required fields.  Incorporate and be compatible with existing standards.  Apply to data from a variety of survey protocols.  Accommodate monitoring data.  Allow documentation of locations where a species was not observed (absence data).  Accommodate species and community data.  Provide ability to add to core attributes for specific groups of users (extensibility).  Allow users to record a list of specify on a plot.  Include attributes indicating data quality and validation.  Be independent of a particular software implementation.

Content of the Observational Data Standard

Major Entities  Observation – Contains the required information that defines an observation and optional biological and environmental attributes  Observation Grouping - A set of observations grouped according to some common criteria  Protocol – The plan or procedure used to collect data; may link to internal or external references.  Survey - Coordinated effort to gather information; multiple observations  Project - May link related surveys  Search Area – May link to single observation or survey  External Documentation – Pointer to information that is managed in a different system

Major Entity Relationships

Observation Cardinalities  An Observation may be related to zero or one Survey.  An Observation may be related to zero or one Protocol.  An Observation may be related to zero or one GIS Shape.  An Observation may be related to zero, one, or many External Documentations.  An Observation may be related to zero or one Element List.  An Observation may be related to zero, one, or many Observation Grouping.  An Observation may be related to zero or one Search Area.

Minimum Requirements for an Observation Record  Formal name (scientific name) Calypte anna  Where found (GIS shape, coordinates, or text) New Mexico: Eddy County, 3 mi S of Artesia along Rt. 285  Date (to year(s), at least) 30 November 1978  Observer(s) / collector(s), or “Unknown” D.J. Davis

Ranking of Attributes  Required – cannot be null, although data may be incomplete  Core – important information but not required (may not be available or applicable)  Priority Core – a subset of core attributes that should always be filled in if data are available  Additional – supplementary data

Identification Attributes

Observation Grouping  Name or label for observation grouping  Criteria - the common characteristic or other criteria used to group the observations (same element, same location, or any other criteria)  Owner - the person who created the observation grouping  Monitoring comments – changes over time and trends within the grouping

Species List  For community plot data  List of the species found and data such as stratum and abundance (e.g., percent cover or number)  Do not have to make an observation record for each species  May make an observation record for each species

Documentation  Every entity can link to internal or external reference  Link to materials via IDs in another data system  Specimens  Protocol descriptions  Project details

Next Steps  Standard will be utilized in current and future software development projects at NatureServe:  Kestrel Project – a web-enabled implementation of observations database being developed for Parks Canada; includes proof of concept of discipline- specific extensions  Handheld field data collection tool - funding received from NSF  Development of XML schema and mapping to existing TDWG standards

Questions / Discussion