Dvoy Related Ideas. Data Acquisition and Usage Value Chain.

Slides:



Advertisements
Similar presentations
Agnes Adjabeng Librarian/Principal Programme Officer, EPA/GHANA Mainstreaming eLearning for Environment UNEP Pre-Conference Seminar at 4 th eLearning Africa,
Advertisements

Web Services Implementation Case Study: DataFed Air Quality Data & Services Project Coordinators: Software Architecture: R. Husar Software Implementation:
REASoN REASoN Project to link NASA's data, modeling and systems to users in research, education and applications Application of NASA ESE Data and Tools.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Federated PM and Haze Data Warehouse Project a sub- project of (enter your sticker & logo here ) Nov 20, 2001, RBH St. Louis Midwest Supersite Project.
© 2006 Carnegie Mellon University Establishing a Network Centric Capability: Implications for Acquisition and Engineering Dennis Smith Complex System Symposium.
A New Computing Paradigm. Overview of Web Services Over 66 percent of respondents to a 2001 InfoWorld magazine poll agreed that "Web services are likely.
ESIP Air Quality Workgroup and the GEO Air Quality Community of Practice collaboratively building an air quality community network for finding, accessing,
Stefan Falke Center for Air Pollution Impact and Trend Analysis Washington University in St. Louis Networked Data and Tools for Environmental Management.
Web Development Using ASP.NET CA – 240 Kashif Jalal Welcome to week – 1 of…
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Select, Overlay, Explore; Multidimensional data Maintain Distributed Data; Heterogeneous coding, access Connect providers to users; Homogenize data access.
AIRNow-International The future of the United States real-time air quality reporting and forecasting program and GEOSS participation John E. White U.S.
Interoperability ERRA System.
CAPITA Projects NSF ToolsCollaboration Tools for Virtual Workgroups EPA WebVis Internet Visibility System NOAAASOS Data Evaluation EPAICAP Intercontinental.
Distributed Voyager (DVoy) Web Services
DRAFT June 6, 2005 ESIP AQ Cluster, Air Quality Cluster Air Quality Cluster TechTrack Earth Science Information Partners Partners NASA.
1 Systems Development Cheryl Itkin SIMCorB Meeting RTP, NC June 29-30, 2000 SIMCorB Organization Policy Systems Outreach.
DRAFT April 28, 2005 ESIP AQ Cluster, The data life cycle consists of the acquisition and the usage parts Usage ActivitiesData Acquisition.
Instrument Builders Information Specialists (ESIP) Scientists Curriculum Developers Teachers Decision Analysts Decision Makers Reports From Kim Kastens.
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
EO/GEO Team Response to Open GIS Consortium Catalog Interface RFP George Percivall February 1999.
Air Quality Focus Group Discussion Summary ESIP Winter Meeting January 2005 Air Quality is one of 12 Applications of National Priority as defined by NASA.
Ideas on a Network Evaluation and Design System Prepared for EPA OAQPS Richard Scheffe by Rudolf B. Husar and Stefan R. Falke Center for Air Pollution.
Enterprise Architecture, Enterprise Data Management, and Data Standardization Efforts at the U.S. Department of Education May 2006 Joe Rose, Chief Architect.
REASoN REASoN Project to link NASA's data, modeling and systems to users in research, education and applications Application of NASA ESE Data and Tools.
Spatio-Temporal Data Sharing using XML Web Services Presented at the Workgroup Meeting on Web-based Environmental Information System for Global Emission.
DataFed Challenge. Value-Adding Processes Integrated DataDatasets Std. Interface Data Views Std. Interface Data Control Reports Obs. & ModelsDecision.
EPA Geospatial Segment United States Environmental Protection Agency Office of Environmental Information Enterprise Architecture Program Segment Architecture.
Air Quality Cluster Air Quality Cluster TechTrack Earth Science Information Partners Partners(?) NASA NOAA EPA USGS DOE NSF Industry… Data Flow Technologies.
1 Application Scenario: Smoke Impact REASoN Project: Application of NASA ESE Data and Tools to Particulate Air Quality Management (PPT/PDF)Application.
WS Roadmap. The pathway to a service-oriented architecture The pathway to a service-oriented architecture Bob Sutor, IBM IBM identified four steppingstones.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
1 Using the GEOSS Common Infrastructure in the Air Quality & Health SBA: Wildfire & Smoke Assessment Prepared by the GEOSS AIP-2 Air Quality & Health Working.
Select, Overlay, Explore; Integration of diverse data Distributed Data Heterogeneous coding, access Connects providers to users; Homogenize data access.
Stefan Falke and Rudolf Husar Center for Air Pollution Impact and Trend Analysis Washington University in St. Louis A NSF Digital Government Pilot Project.
VOYAGER Data Explorer: Architecture and Technologies See also the the Voyager Developer Website and early ApplicationsDeveloper WebsiteApplications Layered.
ESIP Vision: “Achieve a sustainable world” by Serving as facilitator and advisor for the Earth science information community Promoting efficient flow of.
Smoke Event Public EPA NAAQS Exc. Events States: AQ Warning NOAA Travel Advisories AQ Forecasting FAA Flight Advisories NASA Earth Obs: Public.
Kemal Baykal Rasim Ismayilov
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
COMMUNITY. Data Acquisition and Usage Value Chain.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
IT and Network Organization Ecommerce. IT and Network Organization OPTIMIZING INTERNAL COLLABORATIONS IN NETWORK ORGANIZATIONS.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Air and Waste Management Association Professional Development Course AIR-257: Satellite Detection of Aerosols Issues and Opportunities Fraction.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
NASA REASoN Project SHAirED: S ervices for H elping the Air -quality Community use E SE D ata Stefan Falke, Kari Höijärvi and Rudolf Husar, Washington.
NASA REASoN Project SHAirED: S ervices for H elping the Air -quality Community use E SE D ata Stefan Falke, Kari Höijärvi and Rudolf Husar, Washington.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
Processes of the Information Value Chain Informing Knowledge ActionProductive Knowledge Information Organizing Grouping Classifying Formatting Geo-referencing.
Architecture and Technologies for an Agile, User-Oriented Air Quality Data System Rudolf B. Husar Washington University, St. Louis Presented at the workshop.
Architecture and Technologies for an Agile, User-Oriented Air Quality Data System Rudolf B. Husar Washington University, St. Louis Presented at the workshop.
Why PM Data Analysis by States? These are fragments that may be used somewhere in the Introduction section of the Workbook. PM Data Analysis Workbook:
1 Using the GEOSS Common Infrastructure in the Air Quality & Health SBA: Wildfire & Smoke Assessment Prepared by the GEOSS AIP-2 Air Quality & Health Working.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
: Data Sharing/Processing Infrastructure Data Catalog and Access Dozens of datasets on aerosols, emissions, fire, meteorology,
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
1 SEEDS IT Vision Scenario: Smoke Impact REASoN Project: Application of NASA ESE Data and Tools to Particulate Air Quality Management (PPT/PDF)Application.
DRAFT June 6, 2005 ESIP AQ Cluster, Contact R. Husar Air Quality Cluster Air Quality Cluster TechTrack Earth Science Information Partners.
Application of NASA ESE Data and Tools to Particulate Air Quality Management A proposal to NASA Earth Science REASoN Solicitation CAN-02-OES-01 REASoN:
VOYAGER Data Explorer: Architecture and Technologies See also the the Voyager Developer Website and early ApplicationsDeveloper WebsiteApplications Layered.
The Federated Data System DataFed R. Husar, K. Hoijarvi, S. Falke, DaFed Community EPA Data Summit, Feb. 12, 2008, RTP Non-intrusive data integration infrastructure.
Topic Suggestions Scheffe GEOSS Support to Regional Air Quality (see next slide) –Data. Services –Sharing/Harvesting Infrastructure –Intellectual Resources.
NATIONAL AERONAUTICS AND SPACE ADMINISTRATION ESDS Reuse Working Group Earth Science Data Systems Reuse Working Group Case Study: SHAirED Services for.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Current and Future State of the IMPROVE Website
University of Technology
Service Oriented Architecture (SOA)
Presentation transcript:

Dvoy Related Ideas

Data Acquisition and Usage Value Chain

Data Processing Value Chain Monitor Store Data 1 Monitor Store Data 2 Monitor Store Data n Monitor Store Data m IntData1 IntDatan IntData2 Virtual Int. Data

Information Processing Value Chain (Taylor, 1975) Informing Knowledge ActionProductive Knowledge InformationData Organizing Grouping Classifying Formatting Displaying Analyzing Separating Evaluating Interpreting Synthesizing Judging Options Quality Advantages Disadvantages Deciding Matching goals, Compromising Bargaining Deciding Forces to Move Data one-shot to reusable form External force – contracts Internal – humanitarian, benefits Resistances to Move Data Mechanical Personal Institutional

DVOY (A Federated System for Finding, Exploring and Analyzing Environmental Data) (Unified Access to 4-Dimensional Geo-Environmental Data through Web Services) Outline Prepared by Special Interest Group on Environmental Data Integration March 2002 Coordinated by CAPITA Supported by NSF, EPA and NOAA

The Researcher’s Challenge “The researcher cannot get access to the data; if he can, he cannot read them; if he can read them, he does not know how good they are; and if he finds them good he cannot merge them with other data.” Information Technology and the Conduct of Research: The Users View National Academy Press, 1989 These resistances can be overcome through A catalog of distributed data resources for easy data ‘discovery’ Uniform data coding and formatting for easy access, transfer and merging Rich and flexible metadata structure to encode the knowledge about data Powerful shared tools to access, merge and analyze the data

Data Catalog All the data in the system are to be distributed on the Web and maintained by their custodians The purpose of the catalog is to help finding and and accessing the data Catalog would be limited to data that can be accessed/merged in DVOY

Uniform Coding and Formatting of Distributed Data Data are now easily accessible through standard Internet protocols, but the coding and formatting of the data is very heterogeneous On the other hand data sharing is most effective if the codes/formats/protocols are uniform (e.g. the Web formats and protocols ) Re-coding and reformatting all the heterogeneous data into universal form in their respective server is unrealistic An alternative is enrich the heterogeneous data with uniform coding along the way from the provider to the user. A third party ‘proxy’ server can perform the necessary homogenization with the following benefits: –The data user interfaces with a simple universal data query and delivery system (interface, formats..) –The data provider does not need to change the system; gets additional security protection since the data data accessed by the proxy –Reduced data flow resistances results in increased overall data flow and data usage.

OGC Web Service OGC Web Service Interoperability Program: GoalsGoals Promote interoperability. The interaction between services should be completely platform and language independent, based on XML Enable just-in-time integration. The discovery, access to and ad-hoc chaining of services should be possible dynamically at runtime. Reduce complexity. All components are services with published capabilities (incl. ConMan?); implementation is opaque. Support legacy systems. Enable interoperability by encapsulating existing components and exposing them as services. (Same as DVoy, isn’t it? We could put it better ourselves! RBH)

Outline of an Open, Distributed Air Quality Data Integration and Analysis System Notes prepared for a discussion with EPA NERL and OAQPS December 1, 1998

The Problem: The researcher is not aware of the relevant data; if he is aware, he can not access to them; if he can access them, he can not read them; if he can, he does not know how good they are; if they are good he cannot merge them with other data and by the time he merges them, the data are outdated Based on Information Technology and the Conduct of Research: The Users view National Academy Press, 1989

AQ Data and Analysis: Challenges and Opportunities Shift from primary to secondary pollutants. Ozone and PM2,5 travel miles across state or international boundaries and their sources are not well established New Regulatory approach. Compliance evaluation based on ‘weight of evidence’ and tracking the effectiveness of controls Shift from command & control to participatory management. Inclusion of federal, state, local, industry, international stakeholders. Challenges Broader user community. The information systems need to be extended to reach all the stakeholders ( federal, state, local, industry, international) A richer set of data and analysis. Establishing causality, ‘weight of evidence’, emissions tracking requires more data and air quality analysis Opportunities Rich AQ data availability. Abundant high-grade routine and research monitoring data from EPA, NASA, NOAA and other agencies are now available. New information technologies. DBMS, data exploration tools and web-based communication now allows cooperation (sharing) and coordination among diverse groups.

Challenges Broader user community. The information systems need to be extended to reach all the stakeholders ( federal, state, local, industry, international) A richer set of data and analysis. Establishing causality, ‘weight of evidence’, emissions tracking requires the analysis of air quality, meteorology emissions and effects data. Rich AQ data availability. Abundant high-grade routine and research monitoring data from EPA and other agencies are now available. New information technologies. DBMS, data exploration tools and web-based communication now allows cooperation (sharing) and coordination among diverse groups. Opportunities

Recap: Harnessing the Winds Secondary pollutants along with more open environmental management style are placing increasing demand on data analysis. Meanwhile, rich AQ data sets and the computer and communications technologies offer unique opportunities. It appears timely to consider the development of a web-based, open, distributed air quality data integration, analysis and dissemination system. The challenge is learn how to harness the winds of change as sailors have learned to use the winds for going from A to B

Standard Data Support System Data management systems, DBMS Data processing end exploration tools Presentation tools

Data Flow and Processing

Data sharing standards. A set of open standards for the sharing of AQ data, tools and reports. Examples: TCP/IP, HTML, XML, FGDC Data catalog. A virtual centralized catalog with search and retrieval facilities. Examples: GCMD, web-indexes Web-based shared workspace. Place to share comments, feedback, plans,... Infrastructure support for a distributed system

Benefits of a Distributed and Shared System Access to data. Users can get data, tools, reports out of the system for specific projects. It can be a forum for the exchange of ideas, peer- feedback etc. Saving time and money. The data, tools and other resources in the system could be leveraging the dollars and time available for specific projects. Recycling Data. Data are costly resource. The system can help managing, accessing and documenting one's own data, and share it with others for re-use.

The Dvoy Project DVOY is Federated Information System for heterogeneous, multidimensional datasets Voyager is a generic graphic browser for the federated DVOY data. The initial Dvoy infrastructure is being developed at CAPITA, with NSF supportDvoy Further services for data access, processing and viewing are expected from the community The project evolution is to ride 'web services wave‘ of the Internet CAPITA Support: –NSF ITRWorkgroup Collaboration Tool:Aug Aug 2004 –EPA Web-based Visibility:Aug Apr 2003 –NOAAASOS Visibility:Sep Sep 2002 –MARAMAChemical Trajectory Tool:Aug July 2003 –EPA OAQPS Global Transport Analysis:Nov 2002 – Oct 2003 –NSF DigiGov Fire and Smoke Network :May 2003 – Apr 2006 Pending –NASA ESE Satellite Appl. to PM Managment :June 2003 – May 2008 Pending In-kind support by organizations participating in DVOY-based federated data sharing Collaborators: CIRA (Schichtel), NRL(Westphal), NASA (Goddard)…many data sources.

DATAFED Catalog Maintenance System PUBLISHFINDVIEW