1 Foundations VI: Discovery, Access and Semantic Integration Data Mining and Knowledge Discovery - Continued Deborah McGuinness and Joanne Luciano with.

Slides:



Advertisements
Similar presentations
Geoinformatics 2008 Fox Semantic Provenance 1 Semantic Provenance for Image Data Processing Peter Fox (HAO/ESSL/NCAR) Deborah McGuinness (RPI) Jose Garcia,
Advertisements

Measurement and modeling of aerosol Fe speciation Jingqiu Mao (GFDL/NOAA), Songmiao Fan (GFDL/NOAA), Ying Chen (Fudan U, China)
4th Training Course on WMO SDS-WAS products: (satellite and ground observation and modelling of atmospheric dust) November 2014, Casablanca, Morocco.
Emissions From The Oceans To The Atmosphere Deposition From The Atmosphere To The Oceans And The Interactions Between Them Tim Jickells Laboratory for.
Marine Ecosystems and Food Webs. Carbon Cycle Marine Biota Export Production.
Atmospheric Iron Flux and Surface Chlorophyll at South Atlantic Ocean: A case study Near Patagonia J. Hernandez*, D. J. Erickson III*, P. Ginoux†, W. Gregg‡,
Calcifying plankton and their modulation of the north Atlantic, sub-arctic and European shelf-sea sinks of atmospheric carbon dioxide from Satellite Earth.
Figure 3. The MJO-related vertical structures of MACC CO. CO are averaged between 15ºN – 15ºS. Rainfall is averaged between 5ºN – 5ºS. ABSTRACT. We report.
The Ocean Institute’s new Weather and Ocean Monitoring Program developed in collaboration with the Southern California Coastal Ocean Observing System Funded.
What is the Saharan Air Layer? The Saharan Air Layer (SAL) is a layer of warm, dry, dusty air which normally overlays the cooler more humid surface air.
Temporal and Spatial Variations of Sea Surface Temperature and Chlorophyll a in Coastal Waters of North Carolina Team Members: Brittany Maybin Yao Messan.
1 Improved Sea Surface Temperature (SST) Analyses for Climate NOAA’s National Climatic Data Center Asheville, NC Thomas M. Smith Richard W. Reynolds Kenneth.
Giovanni Facilitates Investigations of Coastal Environmental Processes with NASA Remote-Sensing Data James G. Acker NASA Goddard Earth Sciences Data and.
Abbie Harris - NOAA Ocean Acidification Think Tank #5 Current and Future Research at the Institute for Marine Remote Sensing Abbie Rae Harris Institute.
The influence of dust on ocean biogeochemistry; chasing Saharan dust storms Eric Achterberg, Micha Rijkenberg, Polly Hill, Matt Patey, Maria Nielsdottir,
(Images from NOAA web site). How to use satellite data ?
1 Peter Fox Data Science – ITEC/CSCI/ERTH Week 6, October 5, 2010 Introduction to Data Mining.
Contribution from Natural Sources of Aerosol Particles to PM in Canada Sunling Gong Scientific Team: Tianliang Zhao, David Lavoue, Richard Leaitch,
Equatorial Pacific primary productivity: Spatial and temporal variability and links to carbon cycling Pete Strutton College of Oceanic and Atmospheric.
1 DESERT DUST YOU HAVE A HANDOUT THAT PROVIDES AN OVERVIEW OF THE SUBJECT – READ IT AS WELL AS MY BRIEF SUMMARY DUST STORMS COMMON PHENOMENON IS ARID AND.
CDC Cover. NOAA Lab roles in CCSP Strategic Plan for the U.S. Climate Change Science Program: Research Elements Element 3. Atmospheric Composition Aeronomy.
1 Foundations VII: Data life-cycle, Mining and Knowledge Discovery Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 13,
Iron and Biogeochemical Cycles
Imagery.
Operational assimilation of dust optical depth Bruce Ingleby, Yaswant Pradhan and Malcolm Brooks © Crown copyright 08/2013 Met Office and the Met Office.
ABSTRACT In recent years, the NOAA CoastWatch Okeanos Ocean Color Operational Production System (OPS) has been providing a series of high quality ocean.
NCAR ECSA Workshop on Coastal ZonesJune 2004 Importance of study of coastal zones in the carbon cycle has been explicated by two major carbon science steering.
AN ENHANCED SST COMPOSITE FOR WEATHER FORECASTING AND REGIONAL CLIMATE STUDIES Gary Jedlovec 1, Jorge Vazquez 2, and Ed Armstrong 2 1NASA/MSFC Earth Science.
Marine microbiology from space Rafel Simó, Sergio Vallina, Jordi Dachs & Carles Pedrós-Alió Institut de Ciències del Mar CMIMA, CSIC Barcelona.
Center for Satellite Applications and Research (STAR) Review 09 – 11 March 2010 Image: MODIS Land Group, NASA GSFC March 2000 Presented by Menghua Wang.
The Influence of the Indonesian Throughflow on the Eastern Pacific Biogeochimical Conditions Fig.1 The last year of the two runs is used to force offline.
Ecosystem changes after the SEEDS iron fertilization in the western North Pacific simulated by a one-dimensional ecosystem model Naoki Yoshie1, Masahiko.
U.S. Eastern Continental Shelf Carbon Budget: Modeling, Data Assimilation, and Analysis U.S. ECoS Science Team* ABSTRACT. The U.S. Eastern Continental.
The Carbon Cycle Upwelling Ocean Currents Abrupt Climate Change
Influence of the Asian Dust to the Air Quality in US During the spring season, the desert regions in Mongolia and China, especially Gobi desert in Northwest.
Optical Water Mass Classification for Interpretation of Coastal Carbon Flux Processes R.W. Gould, Jr. & R.A. Arnone Naval Research Laboratory, Code 7333,
Definition and assessment of a regional Mediterranean Sea ocean colour algorithm for surface chlorophyll Gianluca Volpe National Oceanography Centre, Southampton.
1 Lecture 17 Ocean Remote Sensing 9 December 2008.
Estimating the radiative impacts of aerosol using GERB and SEVIRI H. Brindley Imperial College.
A National Hazards Information Strategy (NHIS) Helen M. Wood Director, Office of Satellite Data Processing & Distribution “A coordinated approach for using.
Desert Aerosol Transport in the Mediterranean Region as Inferred from the TOMS Aerosol Index P. L. Israelevich, Z. Levin, J. H. Joseph, and E. Ganor Department.
NASA Earth Observing System Visualization Tools ARSET - AQ Applied Remote SEnsing Training – Air Quality A project of NASA Applied Sciences Introduction.
Radiative Coupling in the Oceans using MODIS-Aqua Ocean Radiance Data Watson Gregg, Lars Nerger Cecile Rousseaux NASA/GMAO Assimilate MODIS-Aqua Water-Leaving.
The effect of wind on the estimated plume extension of the La Plata River Erica Darken Summer 2004.
Timothy Logan University of North Dakota Department of Atmospheric Science.
Theory West African dust outbreaks and the relationship with North Atlantic hurricanes Amato T. Evan, Christopher S. Velden, Andrew K. Heidinger & Jason.
Studying impacts of the Saharan Air Layer on hurricane development using WRF-Chem/EnKF Jianyu(Richard) Liang Yongsheng Chen 6th EnKF Workshop York University.
Air and Waste Management Association Professional Development Course AIR-257: Satellite Detection of Aerosols Issues and Opportunities Fraction.
Dust as a Tracer of Climate Change in Antarctica and as modulator of Phytoplankton Activity Ice core records show a correlation of dust deposition and.
Giovanni and LOCUS: Innovative Ways for Teachers and Students to Conduct Online Learning and Research with Oceanographic Remote Sensing Data James G. Acker.
Ocean Biological Modeling and Assimilation of Ocean Color Data Watson Gregg NASA/GSFC/Global Modeling and Assimilation Office Assimilation Objectives:
Assimilation of Satellite Derived Aerosol Optical Depth Udaysankar Nair 1, Sundar A. Christopher 1,2 1 Earth System Science Center, University of Alabama.
Episodic Dust Events of Utah’s Wasatch Front and Adjoining Region Jeffrey D. Massey, W. J. Steenburgh Department of Atmospheric Sciences, University of.
Art or Science?. Explain the thermal transfers of energy within oceans and the importance of oceanic conveyor belts.
Filling the Gap in the Ocean Color Record Watson Gregg and Nancy Casey NASA/Global Modeling and Assimilation Office ABSTRACT A critical.
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
Assimilation of Aqua Ocean Chlorophyll Data in a Global Three-Dimensional Model Watson Gregg NASA/Global Modeling and Assimilation Office.
SeaWiFS Highlights July 2002 SeaWiFS Celebrates 5th Anniversary with the Fourth Global Reprocessing The SeaWiFS Project has just completed the reprocessing.
Center for Satellite Applications and Research (STAR) Review 09 – 11 March 2010 Marine Environmental Responses to the Saemangeum Reclamation Project in.
Incorporating Satellite Time-Series data into Modeling Watson Gregg NASA/GSFC/Global Modeling and Assimilation Office Topics: Models, Satellite, and In.
Monitoring and prediction of ENSO, the Benguela Nino and other large scale phenomena; subsequent impacts upon southern African rainfall patterns; and the.
NASA’s Ocean Color Online Visualization and Analysis System
What are the causes of GCM biases in cloud, aerosol, and radiative properties over the Southern Ocean? How can the representation of different processes.
Intro to “Data Mining” for Data Science
NASA’s Ocean Color Online Visualization and Analysis System
What weather phenomena has the largest impact on our weather in Texas?
Theme 4: Movement The Mobility of People Goods Ideas
Iron and Biogeochemical Cycles
DATABASE OF MINERAL CONTENT IN ARID SOILS
The Chemical Connection Between Wind and Whales
Presentation transcript:

1 Foundations VI: Discovery, Access and Semantic Integration Data Mining and Knowledge Discovery - Continued Deborah McGuinness and Joanne Luciano with Peter Fox and Li Ding CSCI Week 13, November 29, 2010

Extra

Knowledge Discovery Has a broad meaning –Finding ontologies –Creating new knowledge from Previous knowledge New sources (data, information) Modeling We’ll look at a mining approach as an example 3

Mining We will start with data but the ideas apply to information and knowledge bases as well Definition History Our interest 4

SAM: Smart Assistant for Earth Science Data Mining PI: Rahul Ramachandran Co-I: Peter Fox, Chris Lynnes, Robert Wolf, U.S. Nair

Science Motivation Study the impact of natural iron fertilization process such as dust storm on plankton growth and subsequent DMS production –Plankton plays an important role in the carbon cycle –Plankton growth is strongly influenced by nutrient availability (Fe/Ph) –Dust deposition is important source of Fe over ocean –Satellite data is an effective tool for monitoring the effects of dust fertilization Analysis entails –Mine MODIS L1B data for dust storm events and identify the swath of area influenced by the passage of the dust storms. –Examine correlations between fertilization, plankton growth and DMS production

Current Analysis Process MODIS aerosol products don’t provide speciation Locate and download all the data to their local machine Write code to classify and detect dust accurately [ 3-4 month effort] Write code to classify and detect other dust aerosols [ 3- 4 month effort] Write code to segment the detected region in order to account for advection effect and correlation coefficient [2 months effort]

Analysis with SAM Create a workflow to perform classification using many different state of the art classifiers on distributed data Create a workflow to segment detected regions using image processing services on distributed data Bottom line: Scientist does not have to write all the code to perform the analysis Can compose workflows that utilize distributed data/services Can share the workflow with others to collaborate, reuse and modify

Conducting Science using Internet as the Primary Computer

Mash-ups Example: Yahoo Pipes

Data Mining in the ‘new’ Distributed Data/Services Paradigm

Too many choices!! And that’s only part of the toolkit ADaM-IVICS toolkit has over 100+ algorithms

SAM Objectives Improve usability of Earth Science data by existing data mining services for research, by incorporating semantics into the workflow composition process. –Semantic search capable of mapping a conceptual task –Assistance in mining workflow composition –Verification that services are connected in a semantically correct fashion

Ontology Use

Semi-automated Workflow Composition Filtering services based on data format

Semi-automated Workflow Composition Filtering service options based on both data format and task selected

Semi-automated Workflow Composition Final Workflow

Science Motivation Study the impact of natural iron fertilization process such as dust storm on plankton growth and subsequent DMS production –Plankton plays an important role in the carbon cycle –Plankton growth is strongly influenced by nutrient availability (Fe/Ph) –Dust deposition is important source of Fe over ocean –Satellite data is an effective tool for monitoring the effects of dust fertilization

Hypothesis In remote ocean locations there is a positive correlation between the area averaged atmospheric aerosol loading and oceanic chlorophyll concentration There is a time lag between oceanic dust deposition and the photosynthetic activity

Primary source of ocean nutrients WIND BLOWNDUST SAHARA SEDIMENTS FROM RIVER OCEAN UPWELLING

SAHARA DUST SST CLOUDS NUTRIENTS CHLOROPHYLL Factors modulating dust-ocean photosynthetic effect

Objectives Use satellite data to determine, if atmospheric dust loading and phytoplankton photosynthetic activity are correlated. Determine physical processes responsible for observed relationship

Preliminary Results

Data and Method Data sets obtained from SeaWiFS and MODIS during 2000 – 2006 are employed MODIS derived AOT

The areas of study Tropical North Atlantic Ocean 2-West coast of Central Africa 3- Patagonia 4-South Atlantic Ocean 5-South Coast of Australia 6-Middle East 7- Coast of China 8-Arctic Ocean *Figure: annual SeaWiFS chlorophyll image for 2001

Tropical North Atlantic Ocean  dust from Sahara Desert Chlorophyll AOT

Arabian Sea  Dust from Middle East Chlorophyll AOT

Summary and future work Dust impacts oceans photosynthetic activity, positive correlations in some areas NEGATIVE correlation in other areas, especially in the Saharan basin Hypothesis for explaining observations of negative correlation: In areas that are not nutrient limited, dust reduces photosynthetic activity But also need to consider the effect of clouds, ocean currents. Also need to isolate the effects of dust. MODIS AOT product includes contribution from dust, DMS, biomass burning etc.

Case for SAM MODIS aerosol products don’t provide speciation Why performing this data analysis is hard? –Need to classify and detect Dust accurately –Need to classify and detect other aerosols (eg. DMS accurately) –Need to segment the detected region in order to account for advection effects and correlation coefficient. What will SAM provide? –Provide capability to create a workflow to perform classification –Provide capability to create a workflow to segment detected regions Bottom line: Scientist does not have to write all the code to perform the analysis Can compose workflows that utilize distributed data/services Can share the workflow with others to collaborate, reuse and modify