Data Life Cycle GeoData 2011 Workshop March 2, 2011, Broomfield, CO Peter Fox (RPI) Tetherless.

Slides:



Advertisements
Similar presentations
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
Advertisements

Spatial Data Infrastructure: Concepts and Components Geog 458: Map Sources and Errors March 6, 2006.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
BLUE GROWTH CALL AREA 3 : Ocean observation systems and technologies Horizon 2020 Societal Challenge 2 Info Day 17/01/2014 Dr Efthimios ZAGORIANAKOS.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
Brent Frakes, Functional Analyst.  Need to discover, share, have access to large holdings of data and information to address rapid climate change  e.
GeoData 2011 Workshop Data Life Cycle Break Out #3 Wednesday, 2 March 2011 Moderator: Mohan Ramamurthy, Unidata.
11-12 June 2015, Bari-Italy Coordinating an Observation Network of Networks EnCompassing saTellite and IN-situ to fill the Gaps in European Observations.
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
TWC Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semantic Technologies Xiaogang (Marshall) Ma Tetherless World Constellation.
1 Peter Fox Xinformatics 4400/6400 – Week 9, April 7, 2015 Information integration, life- cycle and visualization.
1 Peter Fox Data Science – ITEC/CSCI/ERTH Week 1, August 31, 2010 History of Data and Information, Data Science, Current Challenges.
U.S. Department of the Interior U.S. Geological Survey Data Integration Progress and Guiding Principles Disciplines, generalization, and open-access. David.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
U.S. Department of the Interior U.S. Geological Survey CDI Data Management Working Group December 12, 2011 Sally Holl, USGS Texas Water Science Center.
Project number: Data and Data Requirements Wouter Los University of Amsterdam.
Facilitating Next Generation Science Collaboration: Respecting and Mediating Vocabularies with Semantics in Ecosystems Assessments. December 7, 2011, AGU11.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
SWWG PROJECT OVERVIEW Semantic Technologies for Integrating USGS Data.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Earth Observation from Satellites GEOF 334 MICROWAVE REMOTE SENSING A brief introduction.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
U.S. Department of the Interior U.S. Geological Survey USGS Scientist Panel: Coastal and Marine Spatial Planning (Natural Hazards) Fran Lightsom August.
IntroductionToSensorML Alexandre Robin – October 2006.
U.S. Department of the Interior U.S. Geological Survey A vision for a global community Linda Gundersen Director Science Quality and Integrity US Geological.
U.S. Department of the Interior U.S. Geological Survey Accomplishments and New Horizons Council for Data Integration August 10, 2010 Linda C. Gundersen.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Scientific Needs from the Climate Change Study in the Ocean Toshio Suga Tohoku University (Japan) International Workshop for GODAR-WESTPAC Hydrographic.
The New State Assessment Frameworks A Tool to Enhance Science Teaching and Learning Sandra Laursen, CIRES Outreach, CU Boulder Dan Snare, Jeffco Science.
Semantically-Enabled Science Data Integration (SESDI) and The Virtual Solar-Terrestrial Observatory (VSTO) Semantically-enabled (large-scale) Scientific.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
Assessing the Maturity of Climate Data Records
Data discovery and data processing for environmental research infrastructures Roberto Cossu ENVRI WP4 leader ESA.
Introduction GeoData 2011 Workshop March 2-4, 2011, Broomfield, CO Peter Fox (RPI) Tetherless World Constellation
WMO WMO INTEGRATED GLOBAL OBSERVING SYSTEM (WIGOS) Dr L. P. Riishojgaard, WIGOS Project Manager WMO Secretariat, Geneva WMO; OBS.
Automated Weather Observations from Ships and Buoys: A Future Resource for Climatologists Shawn R. Smith Center for Ocean-Atmospheric Prediction Studies.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
GEO Work Plan Symposium 2014 Data Management Task Force.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
Transparency, applications, and ab- stuff – effect on tools for e-science: it’s all about Informatics June 21, 2010, IATUL 2010 Peter Fox (RPI and WHOI)
Overview of CEOS Virtual Constellations Andrew Mitchell NASA CEOS SIT Team / WGISS NASA ESRIN – Frascati, Italy September 20, 2013 GEOSS Vision and Architecture.
NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Climate Archives in NOAA: Challenges and Opportunities March 4, 2014.
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
NOAA Report WGISS 19 Climate and Meteorology Status Glenn K. Rutledge NOAA Cordoba, Argentina March 7,2005.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
A Proposed Short Course on Data Stewardship Scott Hausman Deputy Director NOAA’s National Climatic Data Center Preparing Scientists to Steward Their Data.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Argo: Tracking the Pulse of the Global Oceans. How do Argo floats work? Argo floats collect a temperature and salinity profile and a trajectory every.
NATIONAL TREASURES DATA PRESERVATION WITH METADATA Sharon Shin Metadata Coordinator Federal Geographic Data Committee Secretariat ASPRS-Reno 2006.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Introduction to Earth Science Section 1 SECTION 1: WHAT IS EARTH SCIENCE? Preview  Key Ideas Key Ideas  The Scientific Study of Earth The Scientific.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Section: The Case for Data Stewardship.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Data Browsing/Mining/Metadata
Bit.ly/2c3XMgd.
Analysis Ready Data ..
Informatics underlying Data Science (ists)
Introduction to Research Data Management
Bird of Feather Session
MSDI training courses feedback MSDIWG10 March 2019 Busan
NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION
Presentation transcript:

Data Life Cycle GeoData 2011 Workshop March 2, 2011, Broomfield, CO Peter Fox (RPI) Tetherless World Constellation

Motivation, temptation A world of challenges – as if Tim did not motivate you enough Data and people at the heart of it Researchers and their data are valuable (as ever) But not enough attention, focus 2Tetherless World Constellation

3 Working premise Scientists – actually ANYONE - should be able to access and use a global, distributed knowledge base of scientific data that: appears to be integrated appears to be locally available But… data and information is obtained by multiple means (instruments, models, analysis) using various (often opaque) protocols, in differing vocabularies, using (sometimes unstated) assumptions, with inconsistent (or non-existent) meta-data. It may be inconsistent, incomplete, evolving, and distributed AND created in a form that facilitates generation, not use (except by accident) And … significant levels of semantic heterogeneity, large- scale data, complex data types, legacy systems, inflexible and unsustainable implementation technology… Uh-oh

Definitions Data - are encodings that represent the qualitative or quantitative attributes of a variable or set of variables. Data (plural of "datum", which is seldom used) - are typically the results of measurements and can be the basis of graphs, images, or observations of a set of variables but are now models, etc. Data - are often viewed as the lowest level of abstraction from which information and knowledge are derived*** 4

Definitions ctd. Information –Representations (of facts? data?) in a form that lends itself to human use Knowledge –Check out Wikipedia…. meaning Metadata – data about data Metainformation – information about information Data documentation – integrated collection of information and metadata intended to support all aspects of data (find, access, use…) 5

Examples Rock sample: –Data – weight, composition, shape, size –Information – images of the rock as collected –Knowledge – evidence of geologic activity –Metadata – location and time of collection –Documentation – published lab report … Weather –Data – wind speed and direction, temperature,.. –Information – weather map with contours and features –Knowledge – high pressure system, stable weather –Metadata – type of radar, sensor, use of model 6

Cox/2005 AGU Spring Fields vs. objects classic geology “Feature” viewpoint classic geophysics “Coverage” viewpoint simple data structures collated/gridded ready for analysis netCDF, HDF-EOS complex data database insertion complete feature interpretations XML documents

Definitions ctd. Data life-cycle elements (simple 3-level) –Acquisition: Process of recording or generating a concrete artefact from the concept (see transduction) –Curation: The activity of managing the use of data from its point of creation to ensure it is available for discovery and re-use in the future ( curator) curator –Preservation: Process of retaining usability of data in some source form for intended and unintended use –Stewardship: Process of maintaining integrity across acquisition, curation and preservation 8

Definitions ctd. Stewardship -> Management: Process of arranging for discovery, access and use of data, information and all related elements. Also oversees or effects control of processes for acquisition, curation, preservation and stewardship. Involves fiscal and intellectual responsibility. Not explicitly the focus of this workshop.. 9

10.. Data has Lots of Audiences From “Why EPO?”, a NASA internal report on science education, 2005 More Strategic Less Strategic Science too!

Too many diagrams

Fox VSTO et al.12 Curation stages People!

On to Life Cycle… Life Cycle, lifecycle, life-cycle … By now I hope you know I know it’s about a mix of factors Research data and researchers

Digital Curation Center model

MIT DDA Alliance model

It does not go on forever…

Business or software model?

18 Physical quantity versus measured as quantity Value and units? Reference frame? Reference units? Value and units? Courtesy Krishna Sinha (VT)

21 September 2015© GEO Secretariat Local in-situ Networks and Systems Air pollution measurement station Emden, Germany Local and national air pollution networks Venice, Italy, and Indonesia

© GEO Secretariatslide 20 Global in situ Networks and Systems Global Seismic Network Signal from the Indian Ocean Earthquake - 26 December 2004 Global Argo Float Array Measuring ocean temperature and salinity

© GEO Secretariat ENVISAT RA-2 observing the Gulf Stream current velocity Satellite Observation Systems

© GEO Secretariat Can we really fulfil futures with diagrams?

Modeling the Climate as a System Transformative Science, Data Infrastructures and the IPCC Experience Lawrence Buja National Center for Atmospheric Research Boulder, Colorado CAM T341- Jim Hack

Briefing on Results : USGS Science Strategy to Support U.S. Fish & Wildlife Service Polar Bear Listing Decision: a 6 month effort U.S. Department of the Interior U.S. Geological Survey

E.g. Solar Irradiance

One composite, one assumption

Another composite, different assumption

29 Temptation To run screaming from the room? –Wait – there are cookies (and a reception)! To really focus on what you are DOING (less that WANT to do) and NOT DOING, but need to – near term (next week) Talk about it… argue it… listen to others To focus on value – the real and immediate value to you and the people you work with and institution/ communities you work for/ with!

Questions?