Interoperability of Data and Processing Services: Experience Using the DataFed System Rudolf B. Husar, Stefan R. Falke and Kari Hoijarvi Washington University,

Slides:



Advertisements
Similar presentations
WCS Data Exchange at the DataFed Server/Client Center for Air Pollution Impact and Trend Analysis (CAPITA) Washington University, St. Louis, MO OGC TC.
Advertisements

1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Web Services Implementation Case Study: DataFed Air Quality Data & Services Project Coordinators: Software Architecture: R. Husar Software Implementation:
OGC Demo at IGARSS06 July 30 - Denver, CO Telecon 11 July 2006 Liping Di, George Mason University Rudolf Husar, Washington University.
Virtual Product of Air Quality Demo Scripts Liping Di, Peisheng Zhao Center for Spatial Information Science and Systems (CSISS) George Mason University.
Web 2.0 in Air Quality Analysis and Management Rudolf B. Husar and Erin M. Robinson Washington University, St. Louis, MO 2007 National Environmental Information.
Talk Outline Air Quality Information System Challenges (5min) –Real-time monitoring and data delivery (1 slide) –Characterization of pollutant in.
DRAFT June 6, 2005 ESIP AQ Cluster, Air Quality Cluster Air Quality Cluster TechTrack Earth Science Information Partners Partners NASA.
DRAFT April 28, 2005 ESIP AQ Cluster, The data life cycle consists of the acquisition and the usage parts Usage ActivitiesData Acquisition.
Instrument Builders Information Specialists (ESIP) Scientists Curriculum Developers Teachers Decision Analysts Decision Makers Reports From Kim Kastens.
REASoN REASoN Project to link NASA's data, modeling and systems to users in research, applications and education Application of NASA ESE Data and Tools.
Architecture and Technologies for an Agile, User-Oriented Air Quality Data System Rudolf B. Husar Washington University, St. Louis Presented at the workshop.
WCS Data Exchange at the DataFed Server/Client Center for Air Pollution Impact and Trend Analysis (CAPITA) Washington University, St. Louis, MO OGC TC.
Federated Network for Sharing Air Quality Data and Processing Services Center for Air Pollution Impact and Trend Analysis (CAPITA) Washington University,
1 International Standards for Data Interoperability GALEON Geo-interface for Air, Environment, Land, Ocean NetCDF Ben Domenico Unidata Program Center*
Architecture and Technologies for an Agile, User-Oriented Air Quality Data System Rudolf B. Husar Washington University, St. Louis Presented at the workshop.
Interoperable Information System of Systems for HTAP Rudolf B. Husar and Rich Scheffe With Erin Robinson Presented at HTAP Workshop, WMO, Geneva, January.
Work Group Meeting on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed Experience and Perspectives Rudolf B. Husar CAPITA, Washington University,
REASoN REASoN Project to link NASA's data, modeling and systems to users in research, education and applications Application of NASA ESE Data and Tools.
Integration of Multi-Sensory Earth Observations for Characterization of Air Quality Events E. M. Robinson Advisor, R. B. Husar 2010 Masters of Science.
Spatio-Temporal Data Sharing using XML Web Services Presented at the Workgroup Meeting on Web-based Environmental Information System for Global Emission.
DataFed Challenge. Value-Adding Processes Integrated DataDatasets Std. Interface Data Views Std. Interface Data Control Reports Obs. & ModelsDecision.
Current Air Quality Information ‘Ecosystem’ (Draft for Feedback) AQ information includes emissions, ambient & satellite data and model outputs The distributed.
REASoN REASoN Project to link NASA's data, modeling and systems to users in research, education and applications Application of NASA ESE Data and Tools.
Application of ESE Data and Tools to Particulate Air Quality Management The CAPITA REASoN Project August 15, 2003 Stefan Falke and Rudolf Husar Center.
The theme set for this ESIP meeting is: Matching Opportunities in Application Areas with Technical Solutions Topic of this demo: Air.
Air Quality Cluster Air Quality Cluster TechTrack Earth Science Information Partners Partners(?) NASA NOAA EPA USGS DOE NSF Industry… Data Flow Technologies.
Emerging Approaches to the Integration of Autonomous Heterogeneous Air Quality Information Systems Invited Paper IN13E-01 American Geophysical Union, San.
Accessing and Using Fire-Related Data with the CAPITA DataFed.net* Services Framework Stefan Falke Rudolf Husar Kari Hoijarvi Washington University in.
Air Quality Data Services: Application of OGC specifications Air Quality Data: Multi-dimensional, multi-source, multi-format Point observations are collected.
GEOSS Air Quality Community Infrastructure ESIP, 23 July 2010 Knoxville, TN E.M. Robinson, R.B. Husar, S.R. Falke, E.T. Habermann, A. Warnock, M. Hogeweg,
NASA Air Quality Applications Program and the ESIP Air Quality Cluster The goal of the NASA Air Quality Management program is to: Enable partners’ beneficial.
Interoperable Information System of Systems for HTAP Rudolf B. Husar and Rich Scheffe With Erin Robinson Presented at HTAP Workshop, WMO, Geneva, January.
Why so many data systems? Dickerson – ppt. Information as a Resource Shared not exchanged …
OGC Demo at IGARSS06 July 30 - Denver, CO Telecon 11 July 2006 Liping Di, George Mason University Rudolf Husar, Washington University.
Stefan Falke and Rudolf Husar Center for Air Pollution Impact and Trend Analysis Washington University in St. Louis A NSF Digital Government Pilot Project.
REASoN REASoN Project to link NASA's data, modeling and systems to users in research, education and applications Application of NASA ESE Data and Tools.
Architectures and Technologies Enabling the Diffusion of Atmospheric Science Information Rudolf B. Husar and Erin Robinson Washington University, St. Louis.
Interoperability of Data and Processing Services: Experience Using the DataFed System Rudolf B. Husar, Stefan R. Falke and Kari Hoijarvi Washington University,
Air Quality Demonstration Prepared for the 18 th ESIP Federation Meeting January 3, Air Quality Community Support through New Information Technologies.
Part I DataFed An Agile Distributed Air Quality Data System Rudolf B. Husar Washington University, St. Louis Seminar Presented at University of Alabama,
COMMUNITY. Data Acquisition and Usage Value Chain.
NASA REASoN Project SHAirED: S ervices for H elping the Air -quality Community use E SE D ata Stefan Falke, Kari Höijärvi and Rudolf Husar, Washington.
NASA REASoN Project SHAirED: S ervices for H elping the Air -quality Community use E SE D ata Stefan Falke, Kari Höijärvi and Rudolf Husar, Washington.
Processes of the Information Value Chain Informing Knowledge ActionProductive Knowledge Information Organizing Grouping Classifying Formatting Geo-referencing.
Part I DataFed An Agile Distributed Air Quality Data System Rudolf B. Husar Washington University, St. Louis Seminar Presented at University of Alabama,
Architecture and Technologies for an Agile, User-Oriented Air Quality Data System Rudolf B. Husar Washington University, St. Louis Presented at the workshop.
Architecture and Technologies for an Agile, User-Oriented Air Quality Data System Rudolf B. Husar Washington University, St. Louis Presented at the workshop.
Web Services-Based Mediator of Distributed Data Flow and Processing Project Coordinators: Software Architecture: R. Husar Software Implementation: K. Höijärvi.
Application of NASA ESE Data and Tools to Air Quality Management Stefan Falke and Rudolf Husar (Co-PIs) Washington University in St. Louis Project Period:
1 Using the GEOSS Common Infrastructure in the Air Quality & Health SBA: Wildfire & Smoke Assessment Prepared by the GEOSS AIP-2 Air Quality & Health Working.
Integration of Multi-Sensory Earth Observations for Characterization of Air Quality Events using Service Oriented Architecture E. M. Robinson Advisor,
ESIP Air Quality Jan Air Quality Cluster Air Quality Cluster Technology Track Earth Science Information Partners Partners NASA NOAA EPA (?) USGS.
: Data Sharing/Processing Infrastructure Data Catalog and Access Dozens of datasets on aerosols, emissions, fire, meteorology,
The Federated Data System DataFed: Experiences in Data Homogenization and Networking R.B. Husar, K. Hoijarvi, S. R. Falke, E. M. Robinson, Washington University,
Architecture and Technologies for an Agile, User-Oriented Air Quality Data System Rudolf B. Husar Washington University, St. Louis, MO, USA Outline Highlight.
DRAFT June 6, 2005 ESIP AQ Cluster, Contact R. Husar Air Quality Cluster Air Quality Cluster TechTrack Earth Science Information Partners.
Application of NASA ESE Data and Tools to Particulate Air Quality Management A proposal to NASA Earth Science REASoN Solicitation CAN-02-OES-01 REASoN:
The Federated Data System DataFed R. Husar, K. Hoijarvi, S. Falke, DaFed Community EPA Data Summit, Feb. 12, 2008, RTP Non-intrusive data integration infrastructure.
Standards-based Access to Air Quality Data: Application of OGC WMS and WCS Protocols Client Server Std. Interface GetCapabilities GetData Capabilities,
Topic Suggestions Scheffe GEOSS Support to Regional Air Quality (see next slide) –Data. Services –Sharing/Harvesting Infrastructure –Intellectual Resources.
Federated Network for Sharing Air Quality Data and Processing Services Center for Air Pollution Impact and Trend Analysis (CAPITA) Washington University,
Interoperability Day Introduction Standards-based Web Services Interfaces to Existing Atmospheric/Oceanographic Data Systems Ben Domenico Unidata Program.
Fire, Smoke & Air Quality: Tools for Data Exploration & Analysis : Data Sharing/Processing Infrastructure This project integrates.
OGC Live Demo at GEOSS Workshop July 30 - Denver, CO Demo Scripts Liping Di, Peisheng Zhao, George Mason University Rudolf Husar,
NATIONAL AERONAUTICS AND SPACE ADMINISTRATION ESDS Reuse Working Group Earth Science Data Systems Reuse Working Group Case Study: SHAirED Services for.
ESIP Vision: “Achieve a sustainable world” by Serving as facilitator and advisor for the Earth science information community Promoting efficient flow of.
ESIP Air Quality Jan Air Quality Cluster Air Quality Cluster Technology Track Earth Science Information Partners Partners NASA NOAA EPA (?) USGS.
GEOSS Air Quality Community Infrastructure
4/5 May 2009 The Palazzo dei Congressi di Stresa Stresa, Italy
Presentation transcript:

Interoperability of Data and Processing Services: Experience Using the DataFed System Rudolf B. Husar, Stefan R. Falke and Kari Hoijarvi Washington University, CAPITA Client Server Std. Interface GetCapabilities GetData Capabilities, DataProfile Data Back End Front End Std. Interface Where? When? What? Which Format?

Outline DataFed in a Nutshell Information Engineering: Data Control and Data Flow Web Coverage Service (WCS) for Ubiquitous Data Access Summary

Federated Network for Air Quality Data and Processing Services Software Architecture (00-05) : R. Husar Software Implementation (01-04): K. Höijärvi Data and Applications (03-05): S. Falke, R. Husar

The challenge is to design a general supportive infrastructure Simply connecting the relevant provides and users for each info product is messy Integrated Data System for Air Quality-IDAQ ESIP AQ Cluster Draft The info system infrastructure needs to facilitate the creation of info products AQ Compliance Nowcast/Forecast Status & Trends Find Data Gaps ID New Problems ……… Info Needs Reports Providers supply the ‘raw material’ (data and models) for ‘refined’ info products Emission Surface Satellite Model Single Datasets Providers Wrappers Where? What? When? Federate Data Structuring Structuring the heterogeneous data into where-when-what ‘cubes’ simplifies the mess Slice & Dice Explore Data Viewers The ‘cubed’ data can be accessed and explored by slicing-dicing tools Programs Integrate Understand More elaborate data integration and fusion can be done by web service chaining This infrastructure support for IDAQ can be provided by the ESIP Federation Non-intrusive Linking & MediationData UsersData Providers

Some of the Tools Used in FASTNET –Data Catalog –Data Browser –PlumeSim, Animator –Combined Aerosol Trajectory Tool (CATT) Consoles: Data from diverse sources are displayed to create a rich context for exploration and analysis CATT: Combined Aerosol Trajectory Tool for the browsing backtrajectories for specified chemical conditions Viewer: General purpose spatio-temporal data browser and view editor applicable for all DataFed datasets

A Sample of Datasets Accessible through ESIP Mediation Near Real Time (~ day) It has been demonstrated (project FASTNET) that these and other datasets can be accessed, repackaged and delivered by AIRNow through ‘Consoles’ MODIS Reflectance MODIS AOT TOMS Index GOES AOT GOES 1km Reflec NEXTRAD Radar MODIS Fire Pix NRL MODEL NWS Surf Wind, Bext

WS Benefits: Composition of Data Views from Distributed Data and Web Services NASA SeaWiFS Satellite NOAA ATAD Trajectory OGC Map Boundary RPO VIEWS Chemistry Data Access Data Processing Layer Overlay LAYERS Web Service Composition

Web Services: Building Blocks of DataFed Programming Access, Process, Render Data by Service Chaining [Station] [Monitor] Clickable monitor locations layer Map boundary layer

Service Flow Program for a VIEW Layer View

DataFed in a Nutshell DataFed Goals Facilitate access and flow of AQ data from provider to users Support user-driven data processing value chains Participate in specific application projects Approach: Mediation Between Users and Data Providers DataFed assumes spontaneous, autonomous data providers Non-intrusively wraps datasets for access by web services Mediates, homogenizes data views. e.g. geo-spatial, time... Applications Browsers and analysis tools for distributed monitoring data Serve as data gateway for user programs; GIS, science tools DataFed is focused on the mediation of air quality data

GEOSS Eco-informatic Test beds Accountability/ indicators SIPs, nat. rules designations PHASE PM research Risk/exposure assessments AQ forecasting Programs Actions NAAQS setting EPA NOAA NASA NPS USDA DOE Private Sector States/Tribes/RPO’s Interstate orgs. Academia NARSTO NAS, CAAAC CASAC, OMB Enviros Organizations Stakeholders People CDC Supersites IMPROVE, NCore PM monit, PAMS CASTNET Lidar systems NADPSatellite data Intensive studies PM centers Other networks: SEARCH, IADN.. Data sources CMAQ GEOS-CHEM Emissions Meteorology Health/mort. records Coordination Cluster Mess The Dependency Puzzle? Forces for Information Flow? Info System of Systems?

Information Flow Control Organizations Have $$, Manage Programs Provide Info to Orgs Goal, $$ Info, Guidance Mandate, $$ Decision, Action Public, Spec. Groups Data Systems Provide Data to Progs Data need $$ Raw Data Judge, Decide, Act Analyze, Report Measure, Organize Societal Benefits Actionable Knowledge Data as Information Public & Spec. Groups Organizations Programs Data Sources Set up Create Comis sion Control Information

Data Value – Sum of Values to Programs Prog Public Dat Org Dat Prog Dat Prog Dat Org Dat Prog Dat Data Value i =  Prog j x Value j ) Information is non-depletive – can not be ‘used up’ Information re-use leads to the ‘network effect’ Total Info value is increased with use – NOT a zero sum game

Air Quality Cluster Air Quality Cluster TechTrack Earth Science Information Partners Partners NASA NOAA EPA (?) USGS DOE NSF Industry… Air Quality Information System Architecture Flow of Data Flow of Control Air Quality Data Meteorology Data Emissions Data Informing Public AQ Compliance Status and Trends Network Assess. Tracking Progress Data to Knowledge Transformation (intended as background for AQ Cluster discussions) Data ProductsKnowledge ProductsMediators

Interoperability Wrappers and Adapters

Abstract Data Model 4D Data Model Physical Data Model

Generic Data Interoperability QueryGetData Standards Where?BBOXOGC, ISO When?TimeOGC, ISO What?TemperatureCF FormatnetCDF, HDF, GMLCF, EOS, OGC T2T1 Client-Server Communication Protocol Client Server Std. Interface GetCapabilities GetData Capabilities, DataProfile Data Back End Front End Std. Interface Where? When? What? Which Format? Data Coding Geo-reference Time-code Std names Model, format

Generic Data Interoperability QueryGetData Where?BBOX When?Time What?Temperature FormatnetCDF, HDF.. T2T1 Client-Server Communication Protocol Client Server Std. Interface GetCapabilities GetData Capabilities, DataProfile Data Back End Front End Std. Interface Where? When? What? Which Format? Data Coding Geo-reference Time-code Std names Model, format

Data Interoperability Groups, Activities Interoperability Experiments and Demos: GEOSS Services Network (GSN Network) Geo-interface for Air Land Earth Ocean NetCDF (GALEON) Hemispheric Transport of Air Pollution (HTAP) Others Protocol Stack Composition: Data Access: Metadata: Data Formats: Server BPEL, AJAX,.Net SOAP, OGC-W*S CF, EOS, OGC-W*S netCDF, HDF, geoTIFF Client JScript,.Net, AJAX OGC-WCS CF netCDF, GRIB Interoperability Stack and Connectivity Example

OGC Services Services are self-describing through “Capabilities” and “Description” docs Filter parameters allow selection of subsets of source data Output formats advertised by each service instance HTTP GET/POST based interfaces WMS Maps Format: Image WFS Feature – road Format: GML WCS coverage data Format:Grid, Table M11 <gml:LineString srsName="EPSG:4326"> 0, , ,60.9 EOS HDF, netCDF, RichTable Web Map ServiceWeb Feature Serv.Web Coverage Serv.

OGC Web Coverage Service (WCS) Specification HTTP GET/POST based interfaces Services have XML service descriptions (“Capabilities”, “Description”) Filter parameters allow selection of subsets of source data Output formats advertised by each service instance OGC WCS getCoverage Schema Suitable for wrapping with SOAP envelope, WSDL access, loose coupling WCS is for "coverages" – information representing space-time-varying phenomena WCS describes, requests and delivers coverages in spatio-temporal domain WCS version 1.1 is limited to grids/"simple” coverages with homogeneous range sets

DataFed WCS Server for Point Observation Data The current version of WCS (1.0) only supports grid and image data types. Coverages can include other representations of space-time varying phenomena, such as observations from an air quality monitoring network. An example WCS query for a point dataset in DataFed is: ?SERVICE=wcs &REQUEST=GetCoverage &VERSION=1.0.0 &CRS=EPSG:4326 &COVERAGE=AIRNOW.pmfine &FORMAT=CSV &BBOX=-125,22,-61,51,0,0 &TIME= T15:00:00Z &WIDTH=999 &HEIGHT=999 &DEPTH=999 &BBOX=-100,22,-100,22,0,0 &TIME= T00:00:00Z/ T23:00:00Z

Benefits of WMS Compatibility: Geospatial One-Stop Provides Access to DataFed spatial layers through WMS Problem: No Time Dim! DataFed/VIEWS Asian Dust Concentration (circles) S. Falke

Relational Data Model Star Schema WMS WCSSOS SensorML WFS Observations Station Info. Param/Sensor/Method Data View Services WMS StationsPar-Meth Observations SOS

WCS Query for Map, Time and Elevation Views Lat/Lon Box Elev Range Time Range Map: BBOX=-180,-90,180,90, 1350,1350& TIME= / /PT3H Time: BBOX=-34,49.05,-34,49.05, 1350,1350& TIME= / /PT3H Elev: BBOX=-34,49.05,-34,49.05, 0,18000 & TIME= / /PT3H The form of the WCS query is the same for all slices through the data cube (views) The only difference in the views is the thickness of the slices in each dimension

WCS Query for Map Views from Various Sites Coverage=THEEDDS.T& BBOX=-126,24,-65,52,0,0 &TIME= / &FORMAT=NetCDF Coverage=SURF.Bext& BBOX=-126,24,-65,52,0,0 &TIME= / &FORMAT=NetCDF-table Coverage=SEAW.Refl& BBOX=-126,24,-65,52,0,0 &TIME= / &FORMAT=GeoTIFF COVERAGE=sst& BBOX=-126,24,-65,52,0,0 &TIME= , &FORMAT=NetCDF UNIDATA – THREDDS/GALEON WCS DataFed GALEON WCS U Florence, It GALEON WCS DataFed GALEON WCS

Wind Data for GEOSS Services Network (GSN) Beijing Demo ( see More ) G. Rutledge (NOAA-NSDC), B. Domenico (Unidata), R. Husar (Wash U.)More NCDC WCS Server Unidata WCS Server DataFed WMS Server Other WMS, WCS Forecast Meteorology Archived Meteorology Other Data Station, Image Internet GSN Beijing WMS Client Data Access Services Portrayal & Chaining Services Appliation

Trajectory Service Chain Demo Trajectory Service Chain Demo B. Domenico (Unidata), G. Rutledge (NOAA-NCDC), R. Husar (Wash U.) WCS Data Access Service Single Traj. Service 4D Wind FieldSingle Trajectory Traj. Aggregator Multi-Trajectory Backward Traj Forward Traj Receptor Model Wind Servers Portray Overlay Render GMU? Plume Wind Field Subset Access to Forecast & Archived Winds Unidata & NOAA Trajectory Algorithms: NOAA HYSPLIT CAPITA Monte Carlo Trajectory Filtering & Aggregation Based on Chemical Conditions

Summary OGC WCS protocol is suitable for delivering most air quality data Need WCS Interface to Satellite Data Suitable for most data view queries BBOX, TIME Station-Point SQL Server, Files… Sequence Image, file nDim Grid OpenDAP NetCDF, … Applicable to most data types Station, Image, Grid WCS Issues WCS: Add Point coverage to WCS (in addition to grid, polygon) WCS: Strongly typed (XML Schema-verifiable) return data types Testing: Much fixing and testing for server-client compatibility