CODATA 2006 Beijing - E-Science Session The Role of Scientific Data in e-Science: How Do We Preserve All Necessary Data So They are Useful John Rumble.

Slides:



Advertisements
Similar presentations
Ferdinand de Saussure Cours de Linguistique Generale.
Advertisements

Copyright © Allyn & Bacon (2007) Research Methodology: An Evolving Discipline Graziano and Raulin Research Methods: Chapter 15 This multimedia product.
Yeast Lab!. What makes something living? Consider the following questions… How big/complex must something be? What must it be able to do? Where must it.
Basic Test Taking Tips 40 questions – 35 minutes.
Social and behavioral scientists building cyberinfrastructure David W. Lightfoot Assistant Director, National Science Foundation Social, Behavior & Economic.
CSCI 3 Introduction to Computer Science. CSCI 3 Course Description: –An overview of the fundamentals of computer science. Topics covered include number.
Genetic Algorithms Learning Machines for knowledge discovery.
Science and Engineering Practices
Biology Chapter 1 The Science of Biology
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Framework for K-12 Science Education
On Roles of Models in Information Systems (Arne Sølvberg) Gustavo Carvalho 26 de Agosto de 2010.
The Science of Life Biology unifies much of natural science
Data Science and Scientific Discovery New Approaches to Nature’s Complexity Dr. John Rumble President R&R Data Services Gaithersburg MD
Crosscutting Concepts and Disciplinary Core Ideas February24, 2012 Heidi Schweingruber Deputy Director, Board on Science Education, NRC/NAS.
SCI Scientific Inquiry The Big Picture: Science, Technology, Engineering, etc.
Chapter 1: Introduction to Chemistry
Introduction to Biology
KEY CONCEPT Biology is the study of all forms of life.
KEY CONCEPT Science is a way of thinking, questioning, and gathering evidence.
Biology I.  Biology offers a framework to pose and answer questions about the natural world.  What do Biologists study?  Questions about how living.
Chapter 1.  Length: Measured in Meters, Centimeters, and Millimeters  Mass: Measured in Grams and Kilograms  Volume: Measured in Liters and Milliliters.
Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.
Yeast Lab!. What makes something living? Consider the following questions… How big/complex must something be? What must it be able to do? Where must it.
Unit 1 The Basics of Biology. Goals of All Science Investigate and Understand the natural world Explain what happens in the natural world Predict what.
Nature of Science. NOS Card Exchange Step 1: Obtain 8 cards (that are different from one another). Step 2: Trade cards with classmates in order to amass.
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
Linguistics Introduction.
Modeling and simulation of systems Model building Slovak University of Technology Faculty of Material Science and Technology in Trnava.
Studying Life Vodcast 1.3 Unit 1: Introduction to Biology.
Chapter 1 The Science of Biology. (What is science?) The Nature of Science.
WHAT IS SCIENCE? WHAT IS SCIENCE? An organized way of gathering and analyzing evidence about the natural world.
Artificial Intelligence By Michelle Witcofsky And Evan Flanagan.
WHAT IS SCIENCE? WHAT IS SCIENCE? An organized way of gathering and analyzing evidence about the natural world.
Roadmap Activity 2a: A GEOSS citation standard : Hans-Peter Plag IEEE University of Nevada, Reno, Nevada, USA;
Chapter 1: The Science of Life. The Science of Life Chapter 1 Table of Contents Section 1 The World of BiologySection 1 The World of Biology –What is.
Russ Hobby Program Manager Internet2 Cyberinfrastructure Architect UC Davis.
IjCSCL invited symposium : “ productive tensions in CSCL” Jürgen Buder Ulrike Cress Friedrich W. Hesse Timothy Koschmann Peter Reimann Gerry Stahl Daniel.
Chapter 1 The Science of Biology. Section 1 – What is Science? The goal of science is to investigate and understand nature, to explain events in nature,
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
WHAT IS THE NATURE OF SCIENCE?. SCIENTIFIC WORLD VIEW 1.The Universe Is Understandable. 2.The Universe Is a Vast Single System In Which the Basic Rules.
I Robot.
CHAPTER 1 – THE SCIENCE OF BIOLOGY What Is Science? (A) Organized way of using evidence to learn about the natural world. (B) Collection of knowledge that.
Question? How does science affect you in your daily life?
1.3 Scientific Thinking and Processes KEY CONCEPT Science is a way of thinking, questioning, and gathering evidence.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
 Many different methodologies are used to study cognitive science. As the field is highly interdisciplinary, research often cuts across multiple areas.
The Science of Biology Remember to wait for the vocabulary word to pop up! Can you guess it before it comes up? **Indicates Bonus (and important) Vocabulary!
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Click on a lesson name to select. The Study of Life Section 1: Introduction to Biology Section 2: The Nature of Science Section 3: Methods of Science.
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
Resources and Reflections: Using Data in Undergraduate Geosciences Cathy Manduca SERC Carleton College DLESE Annual Meeting 2003.
Computational Science & Engineering meeting national needs Steven F. Ashby SIAG-CSE Chair March 24, 2003.
Lecture №1 Role of science in modern society. Role of science in modern society.
What is science? an organized way of investigating and using evidence to learn about the natural world.
Scientific Method 1.Observe 2.Ask a question 3.Form a hypothesis 4.Test hypothesis (experiment) 5.Record and analyze data 6.Form a conclusion 7.Repeat.
Essential Questions What is biology? What are possible benefits of studying biology? What are the characteristics of living things? Introduction to Biology.
Introduction to Biology. Section 1  Biology and Society Biology  The study of life.
INTRODUCTION TO COGNITIVE SCIENCE NURSING INFORMATICS CHAPTER 3 1.
Bar Graphs Used to compare amounts, or quantities The bars provide a visual display for a quick comparison between different categories.
1 This Changes Everything: Accelerating Scientific Discovery through High Performance Digital Infrastructure CANARIE’s Research Software.
LEARNING OBJECTIVES 8 TH Grade Blue Team Science.
WHAT IS THE NATURE OF SCIENCE?
The Science of Biology Chapter 1.
Chapter 1 The Science of Biology.
What Is Science? Read the lesson title aloud to students.
Qualitative Observation
Biology and You.
Ch. 1 Review A test which determines the effect of a single variable by changing it while keeping all other variables the same. Controlled Experiment.
The Nature of Science Ideas are supported by testable and verifiable data or observation Science is a process or way to investigate nature. Scientific.
Presentation transcript:

CODATA 2006 Beijing - E-Science Session The Role of Scientific Data in e-Science: How Do We Preserve All Necessary Data So They are Useful John Rumble Technical Director Information International Associates Oak Ridge TN USA

CODATA 2006 Beijing - E-Science Session Data and e-Science Data express the quantitative results of scientific research Experiments on nature –Testing an isolated and controlled part of the natural world Observations of nature –Making measurements on the natural world as it is found but not controlled Calculations on models of nature –Creating a virtual world containing some, but not all, factors that control natural phenomenon Today fostered and facilitated by e-Science

CODATA 2006 Beijing - E-Science Session Data and e-Science Virtually all data are generated, collected and preserved digitally by scientists in an e-Science environment Preserved data collections allow others to reuse past measurements rather than generate new measurements to develop new ideas and knowledge Today’s large scale data collections are a new source of scientific discoveries Both an extension of traditional scientific method and a new discovery mechanism

CODATA 2006 Beijing - E-Science Session Challenges to the Preservation and Reuse of Data in e-Science The gap between an ideal experiment, observation and calculation and reality Large number of independent variables The evolution of scientific knowledge and language Changing scientific language The multi-center nature of scientific research Science is a team effort – across disciplines, places and time

CODATA 2006 Beijing - E-Science Session The gap between an ideal experiment, observation and calculation and reality Real systems are very complex Large number of independent variables The Challenge Real experiments, observations and calculations do not control, capture or record all independent variables The reporting of independent variables changes over time as scientific knowledge increases

CODATA 2006 Beijing - E-Science Session Time and Independent Variables Independent variables are the quantitative mechanism for expressing our knowledge about how and why a phenomenon occurs Capturing complete knowledge of independent variables requires a large or (perhaps) even an impossible amount of data One goal of research is to understand which variables are important and why And which variables to report! Our knowledge clearly evolves over time

CODATA 2006 Beijing - E-Science Session Time and Independent Variables Major challenge of data standards is to capture evolution of knowledge of independent variables The set of variables we must report today is not adequate tomorrow Standards must allow for growth of knowledge Yet must also enable compatibility of data generated at different times Let’s work through a quick example of the complexity

CODATA 2006 Beijing - E-Science Session Time and Independent Variables Brain imaging Recording techniques evolve and improve over time –X-ray, CT, MRI, PET, next? Each technology individually evolves, as do the types of signals collected, their association with brain activity and region Monitoring reactions to stimulus: pain, visual, auditory, tactile, etc. Details of independent variables must be defined and recorded

CODATA 2006 Beijing - E-Science Session Time and Independent Variables Consider brain history If we imagine the details necessary to describe this, the number of independent variables expands rapidly –Stimuli history, physiological history, developmental history, environmental exposures, education, more As with the development of unifying theories of the large- scale physical world – motion, evolution, chemistry, genetics - the details are necessary to find the dominant factors What are the most important independent variables for recording brain history? Still an open question and will change over time!

CODATA 2006 Beijing - E-Science Session Time and Scientific Language How do languages evolve? John McWhorter – The Power of Babel Contractions of words Reordering sentences Borrowing words Dropping and adding of word beginnings and endings Differentiation of concepts Evolution of concepts These are powerful change factors that cannot be ignored in preserving data

CODATA 2006 Beijing - E-Science Session Time and Scientific Language Data preservation efforts must recognize evolution of scientific language Not just independent variables and metadata – the scientific language itself As concepts change, the definition of the word(s) defining the concept change As different disciplines begin to overlap, slightly different concepts merge into a unified concept, different from each original, often using the same word

CODATA 2006 Beijing - E-Science Session Time and Scientific Language Examples of evolving concepts and language Atomic nucleus Nonsense DNA Chemical bonding Dam The tools to locate and determine the equivalency of scientific and technical words and language over time are just beginning to be developed

CODATA 2006 Beijing - E-Science Session Data and e-Science Yesterday Collections managed by a small number of people Collections readable by one scientist Collections interpretable by one person Discoveries made by thinking, with analysis by one person Today and the Future èCollections managed by groups èCollections not readable by any individual èCollections interpretable only with aid of software èDiscoveries made by computers, with verification by people

CODATA 2006 Beijing - E-Science Session Data in e-Science One goal of e-Science is to create large data sets consisting of all measurements relevant to a system, phenomena, filed of scientific discourse This can be done only be combining data generated over time, by different groups, looking at different features (independent variables) and using different language To support discovery, the aggregation of these different data sets must be legitimate The role of data is central to e-Science and these challenges must be met