BioData a new bioassessment database for the USGS Briefing for the CDI 2011.06.08

Slides:



Advertisements
Similar presentations
Step 1: Valley Segment Classification Our first step will be to assign environmental parameters to stream valley segments using a series of GIS tools developed.
Advertisements

Virtualizing Entomology Collection Student: Di Wang (Alan) Sponsors: John Marris: Curator, Entomology Research Museum Stuart Charters: Department of Applied.
Lec 12: Rapid Bioassessment Protocols (RBP’s)
Final stuff: n Lab practical –Coleoptera, Hemiptera n Final exam: Fri May 2:15 –Assessment with Invertebrates n Lecture material (IDEM protocol) n.
1 Web Services USGS/EPA Collaboration November 27, 2007 Dwane Young, U.S. EPA Nate Booth, USGS.
Managing Data & Information with the SECN Decision Support System GWS Meeting New Orleans, LA March 14-18, 2011.
Watershed Watch Network NJ Department of Environmental Protection Danielle Donkersloot Volunteer Monitoring Coordinator.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Mobile Framework Lorna Schmid, AEI Tim Kern, Fort Collins Science Center.
U.S. Department of the Interior U.S. Geological Survey U.S. National Water Census “Cyber – Platform” Update Progress and challenges to overcome in realizing.
Data for Water Resource Management Module 14, part A – Data types and sources.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
This presentation will guide you though the initial stages of installation, through to producing your first report Click your mouse to advance the presentation.
NWIS Data Pulls for National- and Regional-Scale Applications Kathy Smith (Crustal Geophysics and Geochemistry) and Nancy Bauch (Colorado Water Science.
What’s Important Is Information … and We Have Specimens, Too! Neftali Camacho and Darolyn Striley Natural History Museum of Los Angeles County We use databases.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 David Maltby and Andrea Ostroff September 5, 2012 Fish Passage.
U.S. Department of the Interior U.S. Geological Survey Biodiversity Information Serving Our Nation (BISON): A National Resource for Species Occurrence.
U.S. Department of the Interior U.S. Geological Survey BioData
© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE NEON APPROACH TO DATA INGEST, CURATION, AND SHARING Christine Laney (Data.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
More than you probably wanted to know about NWIS and NWISWeb U.S. Department of the Interior, U.S. Geological Survey Kenneth J. Lanfear USGS.
Roger Miller, Arkansas Department of Environmental Quality Barry Jackson, USGS Arkansas Water Science Center ARKANSAS EXCHANGE NETWORK FOR GROUNDWATER-QUALITY.
Managing Monitoring Data from Many Sources A New Hampshire Experience Deb Soule Watershed Management Bureau New Hampshire Department of Environmental Services.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
TECHNICAL DOCUMENTATIONPARTNERS DOWNLOAD DATA Download water quality data in MS Excel, CSV, TSV, and KML formats. Learn how to use the portal and data.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
Scott Ruddick Director, Integrated Support Services MEDA.
SWWG PROJECT OVERVIEW Semantic Technologies for Integrating USGS Data.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Metadata Wizard: An Easy-to-Use Tool for Creating FGDC-CSDGM Metadata in.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
U.S. Department of the Interior U.S. Geological Survey Management of Oceanographic time-series data at the Woods Hole Coastal and Marine Science Center.
Water Quality Data, Maps, and Graphs Over the Web · Chemical concentrations in water, sediment, and aquatic organism tissues.
Address Maps and Apps for State and Local Governments
The SharePoint Shepherd’s Course for End Users Based on the book by Robert L. Bogue Copyright 2011 AvailTek LLC All Rights Reserved.
EASI a free web database application for collecting and managing monitoring records.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Janice Gordon September 5, 2012 Semantic Technologies for Integrating.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
U.S. Department of the Interior U.S. Geological Survey North American Encyclopedia of Life Web-based resource to enable federal data usage, integration,
ArcGIS Data Reviewer: An Introduction
Final stuff: n Lab practical: Apr 29 n Final exam: due Fri May 2:15.
Using STORET Data to Characterize Your Watershed 1 Webcast on June 21, 2007 Randy E. Hill IT Project Manager, EPA Monitoring Branch Dwane Young IT Specialist,
IPortal Bringing your company and your business partners together through customized WEB-based portal software. SanSueB Software Presents iPortal.
U.S. Department of the Interior U.S. Geological Survey USGS Water Data Exchange Services USGS Office of Water Information June 2009 Nate Booth, Dave Briar.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Data Management at the National Climate Change and Wildlife Science Center.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
An introduction to data exchange protocols in TDWG Renato De Giovanni TDWG 2008.
U.S. Department of the Interior U.S. Geological Survey The Biological Data Profile Extending the FGDC Metadata Standard Kirsten Larsen.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
U.S. Department of the Interior U.S. Geological Survey The National Map West Virginia Geographic Names in May 12, 2004.
Assembling Biological Inventories for Analysis Robert J. Meese, Ph.D. University of California, Davis (530) Presented by Andrea.
U.S. Department of the Interior U.S. Geological Survey Decision Support Tools and USGS Data Management Best Practices Cassandra Ladino USGS Chesapeake.
NATIONAL TREASURES DATA PRESERVATION WITH METADATA Sharon Shin Metadata Coordinator Federal Geographic Data Committee Secretariat ASPRS-Reno 2006.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
The SharePoint Shepherd’s Course for End Users Based on the book by Robert L. Bogue Copyright 2011 AvailTek LLC All Rights Reserved.
Aquatic GAP program in Kansas Keith Gido, Walter Dodds, Chris Guy, Jessica Kemp, and Bob Oakes Kansas State University The Gap Analysis Program
SUSTAINING ENVIRONMENTAL CAPITAL (SEC) INITIATIVE Providing resources for applying ecosystem services in public land & water management.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
U.S. Department of the Interior U.S. Geological Survey Manage and Provide Information: Examples from fish health, contaminants, and water quality data.
U.S. Department of the Interior U.S. Geological Survey Stewardship of the National Hydrography Dataset Elizabeth McCartney National Geospatial Technical.
1 Web Services USGS/EPA Collaboration February 21, 2008 Dwane Young, U.S. EPA; Jon Scott, USGS; Dorinda Gellenbeck, USGS; Nate Booth, USGS.
Water-Use Open Forum Please put your phone on mute until the end of the presentation.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
6/13/2016 U.S. Environmental Protection Agency 1 Starting a Facilities Flow Lee David
Google Apps for Education Account Overview for Staff.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Flanders Marine Institute (VLIZ)
Electronic Data Exchange and Evaluation System
Presentation transcript:

BioData a new bioassessment database for the USGS Briefing for the CDI

Today  What is BioData?  Why Did We Build It?  Current Capabilities  Future Possibilities  Data Integration/Interoperability Challenges

What is BioData? – in a nutshell A data management, storage, and distribution system for aquatic bioassessment data. data capture data curation data publication

Why We Built It - A Brief History  1992 – National Water-Quality Assessment Program (NAWQA) began collecting bioassessment data (macroinvert, fish, algae, stream habitat)

NAWQA Study Units

Why We Built It - A Brief History  1992 – National Water-Quality Assessment Program (NAWQA) begins collecting bioassessment data (macroinvert, fish, algae, stream habitat)  1992 – 1999: Local data management and national data aggregations  1999 – NAWQA national bioassessment database – (BioTDB)

WRD Needs Assessment (2006)  Surveyed WRD Science Centers to find out:  How much aquatic ecology data is being collected outside the NAWQA Program?  What kinds?  What methods?  Where and how are data being stored?

What We Discovered  Water collaborative projects with other agencies, states, localities, and partners are producing as much data as the NAWQA Program  80 % of WSC’s reported projects collecting aquatic ecology data  120 projects had a macroinvertebrate, fish, algae, or habitat component (2000 – 2005)  Approximately 15,000 samples  The majority of samples are being collected using NAWQA and USEPA national stream bioassessment protocols  Samples are being sent to a variety of taxonomic labs

What We Discovered  The data are stored electronically, but are very difficult to discover, access, and integrate  47% in Excel  13% are in EPA databases  19% in home-grown relational databases 79%

U.S. Department of the Interior U.S. Geological Survey BioData a new bioassessment database for the USGS briefing for the USGS GCMRC 5/9/2011

What Should We Do? 1. Do nothing? 2. Implement a federated system? 3. Incrementally refurbish existing NAWQA database? 4. Redesign and “re-build” using modern, web- enabled, extensible architecture? (BioData)

Biodata - Version 1 Objective A data storage, retrieval, and distribution system for aquatic bioassessment data most commonly produced by USGS WRD projects.

“Most Commonly Produced”  Project Objectives  Setting  Types of Data  Sampling Protocols  Bioassessment and monitoring  Streams and rivers  Macroinvertebrates  Fish  Algae  Study reach habitat  NAWQA  USEPA

Additional Characteristics  An internet application  Available to any USGS ecologist.  Designed to be adapted and extended  Support scientific workflow  Serve as an online data archive  Curate taxonomic nomenclature - map it forward and harmonize it across all the data  Support biologist lab data exchange  Readily add web data services

BioData Retrieval (DWH) project data management BioData Input data distribution field datalab data field data input data exchange with labs data review external data NAWQA legacy data public web site web data services application- specific output

Data Retrieval Features  Real-time feedback on how many samples your query will return  Save the query to your desktop – then to friends for them to run  Variety of file formats  Multiple data sets downloaded in one step

Data Retrieval Demo 

BioData Retrieval (DWH) project data management BioData Input data distribution field datalab data field data input data exchange with labs data review external data NAWQA legacy data public web site web data services application- specific output

Data Input/Management Features  Retrieve restricted (unreleased) data  Manage and organize data by project  Project control over rights to enter and edit data  Built in help and data validation checks  Auto-saving  Data entry screens tailored to field sheets  Send electronic orders to labs

Data Input/Mgt Demo

Data integration – touchpoints  First challenge – find the data  Second challenge - compatible methods?

Data integration – touchpoints  First challenge – find the data  Second challenge - compatible methods?  Third challenge – get the data  We need to pick a data exchange standard

Data integration – touchpoints  First challenge – find the data  Second challenge - compatible methods?  Third challenge – get the data  Fourth challenge – harmonize taxonomy  Does “Thienemannimyia group” = “Thienemannimyia gr.” ??  Does ITIS solve this?

ITIS

 Only handles published names  We have to handle unpublished names  Provisional = new taxon claimed but not “officially” published  Conditional = uncertain or indeterminate identification, e.g. “Thienemannimyia group”  ITIS is not complete for all groups  Fish – good, we can integrate tightly with it  Macroinvertebrates – doable  Algae – ITIS not ready yet

Data integration – touchpoints  First challenge – find the data  Second challenge - compatible methods?  Third challenge – get the data  Fourth challenge – harmonize taxonomy  Does “Thienemannimyia group” = “Thienemannimyia gr.” ??  Fifth challenge – integrate with physio- chemical and ancillary data  Common geospatial framework would help

NHD  Which NHD?  NHD “snap to” service with API’s that developers could use in their application(s)?  Service to translate NHD address to other versions of NHD (and future)

BioData For more information contact: Pete Ruhl

U.S. Department of the Interior U.S. Geological Survey BioData a new bioassessment database for the USGS briefing for the USGS GCMRC 5/9/2011

NAWQA BioTDB Database  NAWQA data from present  2,294 sites  21,689 samples  6,715 macroinvertebrate community samples  2,819 fish community samples  8,749 algae community samples  2,819 reach habitat assessments  > 1,200,000 specimen records