Silvia Nittel University of California, Los Angeles Scientific Data Mining in ESP2Net.

Slides:



Advertisements
Similar presentations
NCAS-Climate: Carries out research into climate change and variability, motivated by the need to understand how the climate system will evolve over the.
Advertisements

List of Nominations Connecting User Needs with Weather Research and Forecasts Rebecca E. Morss National Center for Atmospheric Research Boulder, Colorado,
Climate Change: Science and Modeling John Paul Gonzales Project GUTS Teacher PD 6 January 2011.
Draft Essential Principles with Fundamental Concepts By Marlene Kaplan & David Herring NOAA & NASA.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
Weather and Water Unit Unit Portfolio Presentation Facilitator: Mary Trent Sixth Grade Science.
April “ Despite the increasing importance of mathematics to the progress of our economy and society, enrollment in mathematics programs has been.
Weather & Climate. As a class, brainstorm the meanings of the words weather and climate and some examples of both. Write down your responses in the space.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Global Climate Change Sara Parr Sigrid Smith Kellogg Biological Station.
Data Mining – Intro.
Data mining By Aung Oo.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Brenda Woods John Williams Daniel Bailey Breia Stamper.
Chapter 1 Overview of Databases and Transaction Processing.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
CLIMATE CHANGE THE GREAT DEBATE Session 10. CLIMATE CHANGE? If we have learnt anything from this course, it is that climate is not constant It is, and.
Lecture 6: The Hydrologic Cycle EarthsClimate_Web_Chapter.pdfEarthsClimate_Web_Chapter.pdf, p. 10, 16-17, 21, 31-32, 34.
1 Peter Fox Data Science – ITEC/CSCI/ERTH Week 6, October 5, 2010 Introduction to Data Mining.
Weather and Climate Part II
Report on March Crystal City Workshop to Identify Grand Challenges in Climate Change Science By its cochair- Robert Dickinson For the 5 Sept
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Data Analysis. What is unique about phenology? Data is sparse Definition of many phenological events is fuzzy More dependence on visual interpretation.
World Climate Research Programme Climate Information for Decision Making Ghassem R. Asrar Director, WCRP.
Lecture 5 The Climate System and the Biosphere. One significant way the ocean can influence climate is through formation of sea ice. Sea ice is much more.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
* The relative measure of the amount of water vapor in the air * Psychrometer – measures the humidity * Water vapor affects the density of the air. * Cold.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
The Carbon Cycle Upwelling Ocean Currents Abrupt Climate Change
Modern Era Retrospective-analysis for Research and Applications: Introduction to NASA’s Modern Era Retrospective-analysis for Research and Applications:
Lesson #8 Climate & Weather Patterns Earth & Space Science.
Science Weather Review
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Mining Weather Data for Decision Support Roy George Army High Performance Computing Research Center Clark Atlanta University Atlanta, GA
Comparing Climate vs. Weather. Climate Change Video #1: Climate Change Video #2:
Unit 4 Lesson 2 pg Big Idea How Do the Oceans and the Water Cycle Affect Weather?
Foundations of Business Intelligence: Databases and Information Management.
9/03 Data Mining – Introduction G Dong (WSU)1 CS499/ Data Mining Fall 2003 Professor Guozhu Dong Computer Science & Engineering WSU.
Trends in Tropical Water Vapor ( ): Satellite and GCM Comparison Satellite Observed ---- Model Simulated __ Held and Soden 2006: Robust Responses.
1 Proposal for a Climate-Weather Hydromet Test Bed “Where America’s Climate and Weather Services Begin” Louis W. Uccellini Director, NCEP NAME Forecaster.
WEATHER AND CLIMATE Weather is a great metaphor for life - sometimes it's good, sometimes it's bad, and there's nothing much you can do about it but carry.
Lesson 3: Air Masses. What is an Air Mass? Air masses are large areas of air with similar temperature, humidity, and pressure.
World Geography CHAPTER 3 NOTES.  A. What is the main cause of the earth’s seasons/weather?  Tilt of the Earth and the revolution of the Earth I. SEASONS.
Vision of an Integrated Global Observing System Gregory W. Withee Assistant Administrator for Satellite and Information Services National Oceanic and Atmospheric.
1. What is a thin blanket of air that surrounds the Earth?
The Role of Solar Energy (continued)
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
ERP and Related Technologies
NASA Earth Exchange (NEX) A collaborative supercomputing environment for global change science Earth Science Division/NASA Advanced Supercomputing (NAS)
How Convection Currents Affect Weather and Climate.
Global Warming The heat is on!. What do you know about global warming? Did you know: Did you know: –the earth on average has warmed up? –some places have.
Chapter 1 Overview of Databases and Transaction Processing.
Introduction to Business Analytics
Lesson 67: Hurricane! Extreme Physical Change.
Knowledge Discovery in a DBMS Data Mining Computing models and finding patterns in large databases current major challenge in database systems & large.
Factors that Affect Climate What is Climate? Weather conditions of an area including any variations from the norm. Exchange of energy and moisture.
Climate Change Spring 2016 Kyle Imhoff. Let’s start with the big picture (climate forcings)…
Chapter 3: Physical Geography Climate and Vegetation
Tools and Services Workshop
Joslynn Lee – Data Science Educator
What Causes Different Climates?
Geospatial Technology in Climate Change
8th Grade Matter and Energy in Organisms and Ecosystems
WATER IN THE ATMOSPHERE
Big DATA.
What is the difference between climate and weather
Presentation transcript:

Silvia Nittel University of California, Los Angeles Scientific Data Mining in ESP2Net

Silvia NittelGeoSKI Februar 2000 Overview Motivation What is scientific data mining ? Examples of scientific data mining at UCLA CS interests in scientific data mining –Tools –Collaboration paradigms –Interoperability

Silvia NittelGeoSKI Februar 2000 Motivation The advent of the computer has brought with it the ability to generate and store huge amounts of data. Business data (DBs) Scientific Data What is it ? The process of extracting useful information has become more formalized and the term Data Mining has been coined for it.

Silvia NittelGeoSKI Februar 2000 What is data mining ? Definition: Data mining is the process of extracting previously unknown, comprehensible, valid and actionable information from large data stores (and using it to make crucial business decisions). There are two approaches: –verification driven, whose aim is to validate a hypothesis postulated by a user, or –discovery driven, which is the automatic discovery of information by the use of appropriate tools. The discovery driven approach depends on a more sophisticated and structured search of the data for associations, patterns, rules or functions, and then having the analyst review them for value.

Silvia NittelGeoSKI Februar 2000 Process

Silvia NittelGeoSKI Februar 2000

Silvia NittelGeoSKI Februar 2000 What is scientific data mining ? Data mining started with “simple info” (business data) like in DBMS; this is called OLAP (online analytical processing). Scientific data mining: –Data is more complex. –Data is much larger. –Often discovery-oriented approach used. Medicine, Biology, Physics, Weather… Principles of a science method: –observation-hypothesis-experiment cycle Data mining for science: –“observation-hypothesis” supported by discovery driven mining –“hypothesis-experiment” supported by verification driven mining

Silvia NittelGeoSKI Februar 2000 Example: Farming Environment Goal: –optimization of crop yield while minimizing the resources supplied. –How: identify what factors affect the crop yield, One analysis looked at over 64 separate items measured over a number of years to extract the items that were significant. Initially analysis: discovery driven mining –To attempt to find what parameters were significant, either by themselves or in conjunction with others. –Use of statistical methods to determine the parameters that are significant and their relative influence. –Result: derive equation of interdependence Later on: verify equation via verification driven mining against new datasets.

Silvia NittelGeoSKI Februar 2000 Example: Global Climate Change Often a verification driven mining approach. –Climate data has been collected for many centuries. –It is extended into the more distant past through such activities as analysis of ice core samples from the Antarctic. –At the same time, a number of different predictive models have been proposed for future climatic conditions. Use predicitive model: –Use sample data from the past –Verify the predictive models by Using them on historical data then compared the results with the sample data. –From this, the models can then be refined further and used for another round of verification driven mining.

Silvia NittelGeoSKI Februar 2000 Scientific Data Mining at UCLA Project scope: –ESP2Net: Earth Science Partners’ Private Network Computer science: UCLA, HRL, Earth science: JPL, Scripps, U Arizona Scientific data mining: –Verification driven approach –Large amounts of raster satellite data

Silvia NittelGeoSKI Februar “Warm pool” develops in tropical Pacific ocean 3 Vigorous convection produces very high cold clouds 4 Storm systems push “moisture flare” Eastward 2 Warm moist air rapidly rises 5 Heavy rainfall over Southwest U.S. VPN Hypothesis: Coastal rainfall correlated with remote convective events in tropical Pacific ISCCP DX, CL UA Cluster operators Matching operators JPL TOVS, NVAP, MLS Tracking operators Statistical operators Scripps Precipitation Correlation operators GLINT operators Scientific Data Mining at UCLA

Silvia NittelGeoSKI Februar 2000 Visualization Convective cloud cluster motion –ISCCP CL, March (UA) Water vapor motion in the atmosphere –NVAP, March (Scripps) Different perspective reveals new info –NVAP stacking and slicing (JPL) Cloud movie Water vapor movie

Silvia NittelGeoSKI Februar 2000 Challenges of Scientific Data Mining Challenges : Distributed collaboration –share results (passive) –share analysis processes (active) Leverage partners expertise and efforts Re-use core analysis tools (operators) Large datasets, decadal time spans (> ½ TB data) Project goal: Build a flexible and extensible framework for scientific investigations which are Distributed and internet-based, provide reusable, extensible, efficient tools, address interoperability and collaboration

Silvia NittelGeoSKI Februar 2000 UCLA Support of Scientific Data Mining Re-usable Tools: –Conquest (CONcurrent Queries in Space and Time) Collaboration Support: –Scientific Markup Language (SEML): XML-based Scientific Experiment Logbook –Conquest (Distributed Queries) –Secure Collaboration (Virtual Private Networks) Interoperability –OpenGIS standard to represent data –CORBA –Java

Silvia NittelGeoSKI Februar 2000 Summary Scientific data mining is a relatively new research area (first conference in 1994, KKD) Science (hypothesis) Statistics (methods) Computer Science (visualization, animation)