Large Scientific Databases. Large scientific datasets are those which are systematically collected and organized and which stretch the technical capabilites.

Slides:



Advertisements
Similar presentations
Chapter 1 Objectives Define science. Describe the branches of science.
Advertisements

Luquillo Experimental Forest Information Management: a Long-Term Ecological Research system to deposit documented data ready for analysis and synthesis.
Is the use of computers and software to manage information. In some companies, this is referred to as Management Information Services (or MIS) or simply.
Steps of the Scientific Method.
Scientific Method.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
PSAE Practice Session Science Mr. Johns Room 2012.
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
Data Conservancy: A Life Sciences Perspective Sayeed Choudhury Johns Hopkins University
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
By Mr. Abdalla A. Shaame.  Computer Science is basically concerned with the study of computers.  A student will learn about hardware and operating systems.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
Introduction to Biology Fall Taking Cornell Notes Biology Introduction What is science? Answer or Definition for the Question/Main Idea.
Data Conservancy: A Blueprint for Libraries in the Data Age Sayeed Choudhury Johns Hopkins University
1. Systems, Science, and Study. Outline What is geographic information? Definition of data, information, knowledge and wisdom Kinds of decisions that.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
Scientific data cloud infrastructure and services in Chinese Academy of Sciences Jianhui Yuanke
Managing the Impacts of Programmatic Scale and Enhancing Incentives for Data Archiving A Presentation for “International Workshop on Strategies for Preservation.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Geologists study the forces that have shaped Earth throughout its long history. Geologists study the chemical and physical characteristics of rock, the.
Assessing the Frequency of Empirical Evaluation in Software Modeling Research Workshop on Experiences and Empirical Studies in Software Modelling (EESSMod)
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Copyright 2012 Matthew Mayernik. Version 1.0 October 2012 Section:
Data Infrastructure Services for Data Curation Jian Qin School of Information Studies Syracuse University Syracuse, New York ALA 2015, San Francisco, CA.
Scientific Processes Mrs. Parnell. What is Science? The goal of science is to investigate and understand the natural world, to explain events in the natural.
RNR 403/503 Applications of GIS Fall, GIS – What does it mean? Geographic (geospatial) – Place-based, georeferenced, location is quantitatively.
From Under Sea to Outer Space The video is part of NASA’s Liftoff to Learning. See the Featured Links for a link to NASA’s resources.
CIS/SUSL1 Fundamentals of DBMS S.V. Priyan Head/Department of Computing & Information Systems.
Data Curation Issues and Challenges ARL/CNI Fall Forum 2008 Sayeed Choudhury
CSE 102 Introduction to Computer Engineering What is Computer Engineering?
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
Scientific Method 1-2: Pgs IN: What steps of the scientific method did Cain practice using when he built his arcade?
Tools of Astronomy. Telescopes  Most collect and focus light.  Two types- 1. optical 2. radio 2. radio.
The Scientific Method Lecture  I CAN explain the steps of the scientific method and identify variables involved in experimental study.
Scientific Method Notes Science. Vocabulary Scientific method – A systematic approach to problem solving. Hypothesis – a proposed solution to a scientific.
COMPUTER SCIENCE Computer science (CS) is The systematic study of algorithmic.
EARTH SCIENCE. THE FOUR BRANCHES OF EARTH SCIENCE Geology – the scientific study of the origin, history, and structure of the Earth and the processes.
Computer Applications Chapter 16. Management Information Systems Management Information Systems (MIS)- an organized system of processing and reporting.
Why does science matter?. Nature follows a set of rules… If we learn the rules and how they affect us we can understand, predict and prepare for what.
Engineering. ENGINEERING What is Engineering? Engineering is the application of mathematics and scientific principles to better or improve life.
Scientific Method By Q. H. TOADS. Scientific Method  A series of steps that scientists use to answer questions and solve problems  Several distinct.
Hmmm … I think I know… well… give me a minute here…
Mr. Ruark’s Earth Science Thought of the Day- Variables and Branches of Earth Science Daily Objective(s): We will discuss analyze how scientific data is.
Company LOGO Network Architecture By Dr. Shadi Masadeh 1.
Research refers to a search for knowledge Research means a scientific and systematic search for pertinent information on a specific topic In fact, research.
Chapter 1 Section 1. What is Science? Science is a method for studying the natural world. Science comes from the Latin word “scientia” which means knowledge.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Section: The Case for Data Stewardship.
PAA on Scientific Data and Information Roberta Balstad Chair, PAA Panel.
Data Management. all a scientist never wanted to know but will not be able to avoid Data Management Plans To support research activity to accomplish scientific.
Education FOUNDATION CAREER DEVELOPMENT 4 TH GRADE.
9/9/16 WHAT IS EARTH SCIENCE?
The Methods of Science Chapter 1.
Joslynn Lee – Data Science Educator
Earth as a System and the Nature of Science
Digital library for Earth System Education Teaching Boxes
What is Science.
7. Scientific Method- = The systematic approach to problem solving that involves observation and experimentation.
Software engineering Lecturer: Nareena.
CS 1104 INTRODUCTION TO COMPUTER SCIENCE
Earth Science Mr. Kennel
Introduction to Multiprocessors
The Science of Biology Chapter 1.
ECOLOGY THE INTRODUCTION.
Network Architecture By Dr. Shadi Masadeh 1.
Note Pack #1 September 10, 2015 Aim: What is Earth Science? Do now: Pick up “Note Pack #1” - Put your name and date on it Write down 3 things that you.
Presentation transcript:

Large Scientific Databases

Large scientific datasets are those which are systematically collected and organized and which stretch the technical capabilites of the species to store, manipulate, and distribute data for scientific investigation--hence limiting that scientific investigation.

What is a “small” dataset? “Only a few hundred gigabytes.” -Alex Szalay

What about non-scientific databases? Why not Google?

Fields producing these datasets Observational data –Earth and space sciences Astronomy and Astrophysics Space Physics Atmospheric Science Geoscience Ocean Science Experimental Laboratory Data –CERN [From Preserving Scientific Data on Our Physical Universe (Washington, National Academy Press: 1995)]

Observations The datasets they are collecting are huge and will grow. These datasets stretch the technical capabilities of what our species can do with computer applications and hardware. Thus limiting what we can learn. That there are bottlenecks in storage, manipulation, and in distribution. There is not enough bandwidth for scientific use in the sizes of datasets that now exist.

More observations It may be that there are solutions in other disciplines for addressing some problems scientists working with large datasets are wrestling with. –Library & Information Science –Graphics –Hardware and software vendors They shouldn't all have to reinvent everything separately

Is there a field? Connections between scientists working on large datasets appear to be informal Assembling scientists working with large datasets will be useful because different ones may have solved different problems already or may have useful insights to share There is an extensive literature but it is technical and largely not self-aware

Is there a field 2 On a broader scale, if in 10 years these datasets can be put on a desktop computer, there will be scientists out gathering even bigger datasets. It is what humans do. Can principles be derived from current experience that will help deal with those future larger limits? Can we focus on this aspect of science?

Ancillary issues Policy Characteristics of the data etc.

What next? Conference? –Gather The scientists Vendors Disciplines that might help the scientists Literature review