A Science Community Perspective

Slides:



Advertisements
Similar presentations
Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
Advertisements

The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles School of Chemistry, University of Southampton, U.K.
Basic Measurement Concepts ISAT 253 Spring Dr. Ken Lewis Mod. 2 Measurement Concepts So far… In the Design of Experiments In the Design of Experiments.
name___________________________ World of Physical Science
How old are you. How tall are you
Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
Michigan High School Science Meap Test Constructing.
19/09/20151 Climate Change Data Analysis, Risks Assessments On agric/Water Resources and Adaptation Strategies In Some AAP-Countries Seyni Salack (UNOPS-IRTSC,
David T. Marx Physics Department 10 th Annual University-Wide Symposium on Teaching and Learning Wednesday, January 6, 2010.
Preserving the Scientific Record: Preserving a Record of Environmental Change Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review.
COSTOC Olivier MestreMétéo-FranceFrance Ingebor AuerZAMGAustria Enric AguilarU. Rovirat i VirgiliSpain Paul Della-MartaMeteoSwissSwitzerland Vesselin.
Phys211C1 p1 Physical Quantities and Measurement What is Physics? Natural Philosophy science of matter and energy fundamental principles of engineering.
OFCM Special Session on Research Needs 19 Jun 03 John Pace Operational Applications Division Technology Development Directorate Defense Threat Reduction.
Data-Model Assimilation in Ecology History, present, and future Yiqi Luo University of Oklahoma.
Semantically-Enabled Science Data Integration (SESDI) and The Virtual Solar-Terrestrial Observatory (VSTO) Semantically-enabled (large-scale) Scientific.
Modifying TC Energy Metrics: Using Wind Radii to Better Estimate TC Induced Heating Philippe Papin.
Phys211C1 p1 Physical Quantities and Measurement What is Physics? Natural Philosophy science of matter and energy fundamental principles of engineering.
AN ENHANCED SST COMPOSITE FOR WEATHER FORECASTING AND REGIONAL CLIMATE STUDIES Gary Jedlovec 1, Jorge Vazquez 2, and Ed Armstrong 2 1NASA/MSFC Earth Science.
South Africa in the global knowledge arena: implications for academic libraries Andrew M. KANIKI Executive Director: Knowledge Management and Strategy.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
2nd GODAE Observing System Evaluation Workshop - June Ocean state estimates from the observations Contributions and complementarities of Argo,
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Universiteit Antwerpen Conference "New Frontiers in Evaluation", Vienna, April 24th-25th Reliability and Comparability of Peer Review Results Nadine.
Instrumental Surface Temperature Record Current Weather Data Sources Land vs. Ocean Patterns Instrument Siting Concerns Return Exam II For Next Class:
Meteorological Observatory Lindenberg Results of the Measurement Strategy of the GCOS Reference Upper Air Network (GRUAN) Holger Vömel, GRUAN.
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
CE 401 Climate Change Science and Engineering evolution of climate change since the industrial revolution 9 February 2012
Data Quality A Science Community Perspective 17/13/11K. Lehnert, ESIP Panel on Data Quality Kerstin Lehnert Lamont-Doherty Earth Observatory Columbia University.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
SCIENCE SKILLS Chapter What is Science I. Science from Curiosity A. Involves asking questions about nature and finding solutions. B. Begins with.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Introduction to Power and Effect Size  More to life than statistical significance  Reporting effect size  Assessing power.
ESA Climate Change Initiative Sea-level-CCI project A.Cazenave (Science Leader), G.Larnicol /Y.Faugere(Project Leader), M.Ablain (EO) MARCDAT-III meeting.
Helmholtz Open Science Webinars on Research Data Webinar 34 – 6 / 11 April 2016 Dr. Birgit Schmidt Niedersächsische Staats- und Universitätsbibliothek.
3.1 Using and Expressing Measurements > 1 Copyright © Pearson Education, Inc., or its affiliates. All Rights Reserved. Chapter 3 Scientific Measurement.
Literature Review Dr. Mozaherul Hoque Abul Hasanat.
Amplify Science.
How old are you. How tall are you
Reliability & Validity
Data Ingestion in ENES and collaboration with RDA
Persistent Identifiers Implementation in EOSDIS
The Systems Engineering Context
Purpose of Research Research may be broadly classified into two areas; basic and applied research. The primary purpose of basic research (as opposed to.
Statistical Methods for Model Evaluation – Moving Beyond the Comparison of Matched Observations and Output for Model Grid Cells Kristen M. Foley1, Jenise.
Instrumental Surface Temperature Record
Publishing software and data
The art of weather forecasting
Value proposition for the app
Citizen Science’s contribution to GEO BON
Overview of Temperature Measurement ME 115
Chapter 3 Scientific Measurement 3.1 Using and Expressing Measurements
Tools of Software Development
Chapter 1.3 Notes Name: How old are you? How tall are you? The answers to these questions are measurements.
Outline of the Scientific Method
Instrumental Surface Temperature Record
Systems Engineering for Mission-Driven Modeling
Please take a notes packet and put your name on it.
Measurement in Chemistry
Bird of Feather Session
School of Information Studies, Syracuse University, Syracuse, NY, USA
Instrumental Surface Temperature Record
GEO - Define an Architecture Integrated Solutions
  1-A) How would Arctic science benefit from an improved GIS?
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 Tools of Software Development l 2 types of tools used by software engineers:
7th Grade Science Mrs. Gallagher
Designing Experimental Investigations
Table 1. Conceptual Framework Learning Outcomes
Table 3. Standardized Factor Loadings of EFA
Presentation transcript:

A Science Community Perspective Kerstin Lehnert Lamont-Doherty Earth Observatory Columbia University lehnert@ldeo.columbia.edu Thanks for helpful comments: Mark Ghiorso Ken Ferrier Al Hofmann Alexey Kaplan Roger Nielsen Mohan Ramamoorthy Tom Whittaker Data Quality A Science Community Perspective K. Lehnert, ESIP Panel on Data Quality 7/13/11

DQ & Science Science Technology Standards Norms Tools DQ is a shared norm which against the work of scholars is measured. Scientist produce data and use data that are the foundation of theories, hypothesis, and models. The quality of the data impacts the quality of the Before we can start thinking about the development and implementation of metadata schemas and protocols, science communities need to understand the norms Attacking large-scale problems in the Earth Sciences from climate change to convection of the Earth mantle requires scientists to use large datasets , many of which need to be compiled K. Lehnert, ESIP Panel on Data Quality 7/13/11

The Social Side of DQ “The reliability of knowledge about climate change depends on the commensurability of data in space and time.” From Paul N. Edwards: "A Vast Machine": Standards as Social Technology Science, vol. 304, 2004 DOI: 10.1126/science.1099290 Yet even these changes have not eliminated the importance of disciplined human beings for the successful implementation of standards. For example, in the late 1980s, the U.S. Weather Service replaced liquid-in-glass thermometers with digital electronic ones at thousands of stations in its Cooperative Station Network. The new thermometers displayed Fahrenheit in units of 0.1°. Observers were meant to round off the readings to the nearest degree, but about 10% of observers simply entered the entire figure. Many who did round off probably did so incorrectly (6). Furthermore, the new, more accurate instruments did not correlate exactly with the old ones. Network-wide, the new instrumentation altered the mean daily temperature range by −0.7°C and the average daily temperature by −0.1°C compared with the previous system (7). This example illustrates the complex combination of social and technical problems that affect the implementation of standards. The consequences for the detection of climatic change can be profound: The biases discovered in the U.S. Cooperative Station Network, although correctable, “are of the same magnitude as the changes of global and United States mean temperatures since the turn of the 20th century” (6). Matthew Maury's 1858 diagram of the global atmospheric circulation. K. Lehnert, ESIP Panel on Data Quality 7/13/11

Earth Science Data Diversity of data, many disciplines, mostly observational data and model outputs Big data from sensors and sensor networks, standardized data acquisition, scientist is not the producer of raw data, but synthesizes and generates derived products Small data that are generated in the lab or in the field, sometimes not even numerical, acquired by a wide range of methods, often personalized K. Lehnert, ESIP Panel on Data Quality 7/13/11

Error Budgets http://www.ssterrorbudget.org/ISSTST/White_Paper.html 7/13/11 K. Lehnert, ESIP Panel on Data Quality Error Budgets Diagram from White Paper on the SST Error Budget, produced by the U.S. SST Science Team http://www.ssterrorbudget.org/ISSTST/White_Paper.html

DQ: Instrument Errors “Most of the rapid decrease in globally integrated upper (0–750 m) ocean heat content anomalies (OHCA) between 2003 and 2005 reported by Lyman et al. [2006] appears to be an artifact resulting from the combination of two different instrument biases recently discovered in the in situ profile data.” K. Lehnert, ESIP Panel on Data Quality 7/13/11

DQ: Precision “Mantle Myths, Reservoirs, and Databases” Presentation by A. Hofmann at the Goldschmidt Conference 2008 K. Lehnert, ESIP Panel on Data Quality 7/13/11

What Defines DQ? “Knowing that I can trust the numbers.” “Data having an uncertainty that actually corresponds to the uncertainty stated in the the source.” “In one word, ‘completeness’.” (allows others to assess the validity of data, because then you can check for standards used, techniques, reproducibility, etc. Reproducibility, precision, … K. Lehnert, ESIP Panel on Data Quality 7/13/11

How Do You Evaluate DQ? ‘Analytical completeness’, including uncertainties, and metadata. Statistical tests, internal consistency. Rely on reputation of the investigator, either directly or by association. “Well, usually I don't, because that's a lot of work.” K. Lehnert, ESIP Panel on Data Quality 7/13/11

DQ Needs Carrots & Sticks Tools for DQ metadata management, e.g. capture during data acquisition Software for using DQ metadata in data analysis, synthesis, modeling Policies for and enforcement of data & metadata reporting Peer-review of data K. Lehnert, ESIP Panel on Data Quality 7/13/11

Data Publication Publication of data in repositories QC/QA at repository (completeness, consistency) Open Access Long-term archiving Link to scientific articles via unique identifiers Support for investigators to comply with agency policies K. Lehnert, ESIP Panel on Data Quality 7/13/11

Conclusions (I): Science Community Needs to define the disciplinary norms for DQ measures Needs to drive the implementation of disciplinary standards Policies for data reporting & publication Recommendations for data acquisition K. Lehnert, ESIP Panel on Data Quality 7/13/11

Conclusions (II): Technology Needs to translate disciplinary standards to technical standards Needs to provide software tools that facilitate DQ management (capture, communication, & assessment) K. Lehnert, ESIP Panel on Data Quality 7/13/11

Conclusion (III) Science and technology need to work closely to develop meaningful solutions for DQ management. The process needs to take into account the diversity of Earth Science disciplines and data types. K. Lehnert, ESIP Panel on Data Quality 7/13/11