U.S. Department of the Interior U.S. Geological Survey CDI Data Management Working Group December 12, 2011 Sally Holl, USGS Texas Water Science Center
EIM 2011: Take-home Points EIM is a forum for idea exchange about management of environmental data Discussion topics are parallel with issues addressed by USGS Community for Data Integration USGS data were prominently featured in some of the presentations and posters USGS should play both leadership and participant roles in future EIM conferences by submitting abstracts and attending
EIM 2011: Overview A forum for information managers, scientists, and informatics researchers to present and discuss advances in environmental information management, analysis, and modeling. EIM 2011 invites original, short papers (6 pages) on environmental information management. proceedings-2011
EIM 2011: Who, When & Where Sponsors: National Center for Ecological Analysis and Synthesis (NCEAS), University of California, Santa Barbara Long-term Ecological Research Network (LTER), University of Wisconsin DataONE September 28-29, Barbara, CA Proceedings: proceedings proceedings-2011
EIM 2011: Submission Topics Sensor networking and deployment Data acquisition, field-computing devices Quality control processing Data archival for long-term persistence Metadata generation, storage, and management Metadata-driven data analysis services Data and metadata semantics, controlled vocabularies, and ontologies Analysis and modeling frameworks Data and metadata versioning and lineage Visualization tools Geospatial tools, applications, and standards Sociology of collaboration and data sharing
EIM 2011: Data Interoperability for Synthesis Science Session Topics: Sensors and Workflows Semantics and Data Management Discovery, Visualization, and Analysis CyberInfrastructure Systems 25 talks, 2 keynotes, a poster session, 4 BOFs, and a concluding group discussion Plenary (group) discussion: Community Standards and Practices Development
Birds of a Feather (BOF) Topics Internet mapping: What are the options? Using web tools and methods to support earth science collaborations Geospatial data management for ecological research organizations Automating data processing and quality control using workflow software: Converting sensor data to usable environmental information
Keynotes Patricia Cruse, Founding Director, UC Curation Center (UC3) at the California Digital Library (CDL), “Building Communities, Partnerships, Tools, and Services in Order to Thrive in a Dynamic Information Landscape” Ned Gardiner, Visualization Manager, NOAA Climate Program Office, “The Future is Unwritten: Data and Information for a Transforming World”, Digital Universe Atlas,
EZID: Long-term identifers made easy USGS has signed on to the UC3 EZID system
Sensors and Workflows Kepler Free, open source scientific workflow application project.org/ project.org/
Semantics & Data Management Deborah L. McGuinness, RPI, “A Semantically-Enabled Provenance-Aware Water Quality Portal”
Semantics & Data Management Carly Strasser, DCXL Project Manager, UC3 DCL, “DataONE: Promoting Data Stewardship Through Best Practices” Digital Curation for Excel Project – will build an add-in for Microsoft Excel, intended to be open- source, that will assist scientists in preparing their Excel data for sharing
Discovery, Visualization and Analysis Kepler, Metacat, EML Atlas of Living Australia, Lifemapper: Infrastructure and Services for Biodiversity Science, Interactive Visualization of Spatial and Temporal Patterns of Diversity, Digital Universe Atlas,
The Atlas of Living Australia Lee Belbin, web programmer $64.7M project, presents continental scale environmental data with a biodiversity focus
CyberInfrastructure Systems CUAHSI – Data Interoperability in the Hydrologic Sciences, Critical Zone Observatory (CZO) – Initial design of data sharing infrastructure, Zaslavsky, UCSD Supercomputer Center CZO is a group of NSF-funded observatories Investigate earth processes in the critical zone, which is the region between bedrock and the atmospheric boundary layer. Uses CUAHSI data structure
Poster Session Wade Sheldon, Georgia Coastal Systems LTER, “GCE Data Toolbox: Metadata-driven Software for Data Acquisition, Quality Control and Synthesis” Speeds up the turn-around time from sensor data acquisition to bringing data online by automating processing, documentation and QC steps Data can be retrieved for any ClimDB, USGS NWIS or NOAA NWS station across the country Matlab tools - svn.marsci.uga.edu/trac/GCE_Toolboxhttp://gce- svn.marsci.uga.edu/trac/GCE_Toolbox “PyGIS” library will be here: svn.marsci.uga.edu/trac/Python-GIS/wikihttps://gce- svn.marsci.uga.edu/trac/Python-GIS/wiki
EIM 2011: Take-home Points EIM is a forum for idea exchange about management of environmental data Discussion topics are parallel with issues addressed USGS Community for Data Integration USGS data were prominently featured in some of the presentations and posters USGS should play both leadership and participant roles in future EIM conferences by submitting abstracts and attending