Summary Report from Thursday, 3 March 2011 Pine Room Data Integration Breakout Group Geo-Data Informatics (GDI) Workshop: Exploring the Life Cycle, Citation.

Slides:



Advertisements
Similar presentations
Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
Advertisements

Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Visualizing Fitness for Purpose Bob Groman and Dicky Allison Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution.
Richard Lane, Natural History Museum, London Scientific Collections International (SciColl) An international coordinating mechanism OECD GSF Krakow Oct.
Ocean Data Interoperability Platform EU-US-Australia collaborative project Grant Number: Call: FP7-INFRASTRUCTURES INFSO Activity: INFRA :
Data Citation Breakout Report Pine Group March 3, 2011.
GeoData 2011 Workshop Data Life Cycle Break Out #3 Wednesday, 2 March 2011 Moderator: Mohan Ramamurthy, Unidata.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Global Earth Observation Grid Workshop, Bangkok, Thailand, March Integration Platform.
Biological and Chemical Oceanography Data Management Office 1 of 12 An Introduction to the Biological and Chemical Oceanography Data Management Office.
State Geological Survey Contributions to the National Geothermal Data System.
Field Project Planning, Operations and Data Services Jim Moore, EOL Field Project Services (FPS) Mike Daniels, EOL Computing, Data and Software (CDS) Facility.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
GeoData 2011 Data Life Cycle: Breakout Session #4 (Pine) Breakout Moderator: Joanne Luciano Tetherless World Constellation Rensselaer Polytechnic Institute.
Metadata Guides for Smarties Marine Metadata Initiative URL:
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
The Marine Metadata Interoperability Project A Model for Community Collaboration September 23, 2010 Nan Galbraith WHOI.
Helen Glaves (NERC- BGS), Dick Schaap (MARIS), Robert Arko (LDEO) and Roger Proctor (IMOS)
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
ENEON first workshop Observing Europe: Networking the Earth Observation Networks in Europe September, Paris Summary on data availability, sharing,
Concept Award PI Meeting Outcomes: Virtual Presentation to the EarthCube Community September 13, 2012.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
U.S. Department of the Interior U.S. Geological Survey A vision for a global community Linda Gundersen Director Science Quality and Integrity US Geological.
ESIP Federation: Connecting Communities for Advancing Data, Systems, Human & Organizational Interoperability November 22, 2013 Carol Meyer Executive Director.
ESIP Federation 101 Federation of Earth Science Information Partners July 17, 2012.
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
Introduction GeoData 2011 Workshop March 2-4, 2011, Broomfield, CO Peter Fox (RPI) Tetherless World Constellation
EarthCube Building Block for Integrating Discrete and Continuous Data (DisConBB) David Maidment, University of Texas at Austin (Lead PI) Alva Couch, Tufts.
Introduction GeoData 2014 Workshop #geodata2014 June 17-19, 2014,NCAR, Boulder, CO Peter Fox (RPI)
CODATA-CRIA Inter-American Workshop on Access to Scientific Data GSDI: Vision, Goals and Progress Santiago Borrero, Secretary General Panamerican Institute.
WGISS Working Group on Information Systems and Services Richard MORENO CNES WGISS report, Agenda Item 14 Tromsø, Norway October 2014.
VERTIGO data OCB database status update Cyndy Chandler Ocean Carbon and Biogeochemistry Data Management Office Cyndy Chandler Ocean Carbon and Biogeochemistry.
NOAA National Geophysical Data Center & collocated World Data Centers, Boulder CO USA World Data Center for Marine Geology and Geophysics, Boulder, CO.
INTO THE NEW YEAR January 3, Objectives Reaffirm principles –China’s interest in exploring ESIP structure prompted review of ESIP evolution (more.
Biological and Chemical Oceanography Data Management Office slide 1 of 19 CAMEO Data Management Bob Groman Biological and Chemical Oceanography Data Management.
The Role of Academic Libraries in the Digital Data Universe Break-Out Session: New Partnership Models Bob Hanisch and Brian Schottlaender Co-Leaders ARL.
The CF Conventions: Options for Sustained Support Involving Unidata Russ Rew Unidata Policy Committee May 12, 2008.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Science Data in the Science Mission Directorate (SMD) Jeffrey J.E. Hayes Program Executive for MO & DA, Heliophysics Division August 17, 2011.
November 16, 2009 Page 1 of 28 Data and Data Management: Introduction to the BCO-DMO Presented to Professor Keiichi Uchida November 16, 2009 Robert C.
Towards an European Network of Earth Observation Networks (ENEON): Addressing Challenges and Facilitating Collaboration for non-space based Earth Observations.
IPY International Polar Year Progress report to STG 2.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
NATIONAL TREASURES DATA PRESERVATION WITH METADATA Sharon Shin Metadata Coordinator Federal Geographic Data Committee Secretariat ASPRS-Reno 2006.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
CUAHSI HIS: Science Challenges Linking small integrated research sites (
Convergence And Trust in Earth and Space Science Data Systems Ted Habermann, NOAA National Geophysical Data Center Documentation: It’s not just discovery...
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
4 th WCRP Observations and Assimilation Panel Meeting Hamburg, Germany, March 29-31, Workshop on Ensuring Access and Trustworthiness of Climate.
Semantic Concepts in Expedition Metadata Semantic Concepts in Expedition Metadata Bob Arko Lamont-Doherty Earth Observatory OOSSI Workshop Nov. 18, 2008.
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
Helen Glaves 1 and Dick Schaap 2 1 British Geological Survey, United Kingdom 2 MARIS, The Netherlands.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Section: The Case for Data Stewardship.
A Shared Commitment to Digital Preservation and Access.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 The Biological and Chemical Oceanography Data Management Office (BCO-DMO) Cyndy.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Data Citation Breakouts GeoData 2011 Workshop March 3, 2011, Broomfield, CO Peter Fox (RPI) Tetherless.
JCU Australian Marine Science Data Network.
Federation of Earth Science Information Partners EGIDA Workshop May 9-11, 2011, Bonn, Germany.
Helmholtz Open Science Webinars on Research Data Webinar 34 – 6 / 11 April 2016 Dr. Birgit Schmidt Niedersächsische Staats- und Universitätsbibliothek.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Three Uses for a Technology Roadmap
Bird of Feather Session
  1-A) How would Arctic science benefit from an improved GIS?
Presentation transcript:

Summary Report from Thursday, 3 March 2011 Pine Room Data Integration Breakout Group Geo-Data Informatics (GDI) Workshop: Exploring the Life Cycle, Citation and Integration of Geo-Data

Discussion Prompt In your view/experience what parts of data integration implementations/applications or frameworks are well established (or not) in your discipline(s) and what are the common gaps? Moderator: Cyndy Chandler (WHOI, BCO-DMO) Rapporteur: Chris Mattmann (NASA JPL, USC) Discussion notes kept at TWC hosted titanpad site

Participants Bob Arko (Lamont-Doherty Earth Observatory) Joanne Luciano (TWC, RPI) Anna Milan (National Geophysical Data Center) Bob Simons (NOAA) Brian Wee (NEON, Inc.) Leslie Hsu (LDEO) Roland Viger (USGS) James Wilson (James Madison University) Tom Narock (NASA/GSFC) Cathy Constable (SIO, UCSD) Ruth Duerr (NSIDC) Yoori Choi (CUAHSI) Lee Allison, Arizona Geological Survey Erin Robinson (ESIP) Kavitha Chandrasekar, Indiana University Bob Detrick (NSF) Clifford Jacobs (NSF) Leonard Jonson (NSF)

Data Integration What does that mean? Combining more than one data source into a single data object. Different from display of multiple data sources in a single view. Example: a database join Time series data sets made up of a variety of sources of data often require data integration. Data aggregation and interoperability are related concepts. Group did not come to consensus.

Geo Disciplines Represented Geology Hydrology Oceanography Geophysics Geography Marine geology and geophysics Space science Air quality Computational neuroscience Multi-disciplinary or discipline-agnostic: data management, computer science and archive

Geo-Data Integration What aspects are well established or not? Identify common gaps?

For many projects, two common themes emerged as being associated with some level of success in ability to do data integration: – ‘long-term’ commitment of funding support – Active engagement of funding managers Examples: Unidata (Atmospheric Sciences) CUASHI (Hydrography) IRIS (Earthquake) US JGOFS, US GLOBEC, US WOCE (Ocean Sciences) ODP (Ocean Drilling) NEON

Support for Data Integration Development of community of practice Infrastructure to foster communication (workshops) Mentoring of students and early career PIs Development of tools (e.g. Unidata developed NetCDF which has been adopted by many communities) Education and training The persistence and recognition of a ‘named’ community can enable funds to flow from some agencies to researchers

Support for Data Integration Some communities agreed on common data formats that facilitated data integration Pressures from funding agencies or community needs resulted in common software tools Some communities identified ‘primary’ or ‘core’ variables (e.g. common, essential measurements)

Summary ‘Long-term’ funding support enables development of a community-of-practice that fosters communication, education and training, development and adoption of common tools and identification of core measurements. Communities-of-Practice can divide up the labor and work collaboratively to address shared challenges (economy of scale).

Additional Observations Tension between local and global (single PI to coordinated project to national to international). An awareness of global use of data could help with subsequent data integration. Early planning/specs for data management are important but traditionally difficult to obtain funding.

Gaps Lack of awareness/understanding that keeping data ‘alive’ (usable) is not free Many people think data stewardship and data preservation are "solved problems” (not). "bit level preservation" has been solved, but what is the useful lifespan of those files? What effort is required to make the archived data compatible with all the latest tools and technology. Ability to use a dataset declines over time, without continuing and ongoing attention to ensure that it's still meeting the current access requirements.

Gaps Historical or legacy data (originating PI is no longer active in the research community) no national policy for scientific preservation different disciplines have different interpretations of features in a dataset Lack of guidelines for best practices regarding metadata required to document model results * software, methodology, inputs, outputs, etc

Gaps Misconception that you create metadata one time, and it's forever good – not a true statement – somehow the metadata needs to be updated – systems and the infrastructure need to support this – metadata needs to evolve over time

Suggestion Group agreed that ESIP would be an appropriate community in which to continue these discussions and start to do some much needed planning and cross-disciplinary solutions needed to address the gaps and improve infrastructure for geo-data integration.

Additional Comments NRC study done 7-8 years ago about the loss of data and samples in the geosciences: Geoscience Data and Collections: NATIONAL RESOURCES IN PERIL

Additional Comments Marine Metadata Interoperability (MMI) Collection of ‘Guides’ on topics including Semantic Web technologies, controlled vocabularies, ontologies, standards, metadata best practices, and much more. MMI Ontology Registry and Repository (ORR) is a web application through which you can create, update, access, and map ontologies and their terms.

Additional CUASHI: Hydrologic Ontology System (funded by NSF) "Data Management Plan" template available from CUAHSI (February 2011). It is available at and includes data inventory, data and metadata standards, data management life cycle, etc.

Additional Comments EXILIR xir.aspx European life science infrastructure for biological information. Its Mission: To construct and operate a sustainable infrastructure for biological information in Europe to support life science research and its translation to medicine and the environment, the bio-industries and society.