1 A New Enterprise Information Architecture and Data Management Strategy for the U.S. Government Part 10: Web 2.0 for Earth Science Collaboration for Information.

Slides:



Advertisements
Similar presentations
European Clearing-House Mechanism Portal Toolkit Expert Group Meeting
Advertisements

REASoN REASoN Project to link NASA's data, modeling and systems to users in research, education and applications Application of NASA ESE Data and Tools.
Data Science for Business: Semantic Verses Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
e-Framework Components and Responsibilities.
NOAA Science Advisory Board The U.S. Climate Change Science Program Strategic Plan James R. Mahoney, Ph.D. Assistant Secretary of Commerce for Oceans and.
Connecting People With Information DoD Net-Centric Services Strategy Frank Petroski October 31, 2006.
1 Social Networking and the Medici Effect: The SICoP Experience Brand Niemann Senior Enterprise Architect, US EPA, and Federal SICoP Co-Chair Computer.
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 2 Brand Niemann Senior Enterprise Architect, US EPA, and Co-chair,
DoDAF 3.0: A Web 2.0 and SOA Mashup!
1 Improved Access to EPA Information: Before and After with Web 2.0 Brand Niemann Senior Enterprise Architect, US EPA, and Co-chair, Federal SOA CoP and.
Third-generation information architecture November 4, 2008.
Build the Binary Group in the Cloud Brand Niemann Senior Enterprise Architect Binary Group August 5, Updated August 8,
Environmental Terminology System and Services (ETSS) June 2007.
ESIP Air Quality Workgroup and the GEO Air Quality Community of Practice collaboratively building an air quality community network for finding, accessing,
Semantic Interoperability Community of Practice (SICoP) Semantic Web Applications for National Security Conference Hyatt Regency Crystal City, Regency.
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
Business Driven Technology Unit 4
Flood Risk Management Program Rolf Olsen Institute for Water Resources U.S. Army Corps of Engineers.
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
1 Semantic Cloud Computing & Open Linked Data Pattern Brand Niemann Invited Expert to the NCIOC SCOPE and Services WGs September 22, 2009.
DoD Architecture Registry System DARS 16 September 2009 Walt Okon Senior Architect Engineer Senior Architect Engineer for Information Sharing Enterprise.
1 Briefing for EPA and OEI Communications Coordinators and Press Officers Brand Niemann US EPA Senior Enterprise Architect and Federal CoP Leader January.
1 Integration of Environmental Information and Indicators Brand Niemann, US EPA The Sustainable Water Resources Roundtable January 25-26, 2007 Updated.
1 A New Enterprise Information Architecture and Data Management Strategy for the U.S. Government Part 8: Emergency Management and Sensors Brand L. Niemann.
XBRL Seminar: The New Data Reference Model
1 Data Architecture, Modeling, and Networks Brand L. Niemann January 5, 2007.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
State of the Federation Winter Meeting Washington, D.C. January 9, 2008.
1 Community Engagement Pilot Tutorial: One EPA EPA Web Work Group, EPA Wiki and Blog Work Group, and EPA Open Government Directive Work Group Brand Niemann.
1 Building DRM 3.0 and Web 3.0 for Managing Context Across Multiple Documents and Organizations Mills Davis and Brand Niemann, SICoP Co-Chairs, and Lucian.
SEARCH Membership Group Systems & Technology PAC Global Justice XML Data Model (GJXDM) Update January 29, 2005.
ESIP Federation: Connecting Communities for Advancing Data, Systems, Human & Organizational Interoperability November 22, 2013 Carol Meyer Executive Director.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
1 "Wikis: The Good, The Bad, and the Ugly" Brand Niemann Senior Enterprise Architect, US EPA and COIC Semantic Interoperability CoP, Co-Chair Panel at.
1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.
Revelytix SICoP Presentation DRM 3.0 with WordNet Senses in a Semantic Wiki Michael Lang February 6, 2007.
1. Housekeeping Items June 8 th and 9 th put on calendar for 2 nd round of Iowa Core ***Shenandoah participants*** Module 6 training on March 24 th will.
Puget Sound Information Challenge Experiences and Lessons Learned.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
1 Data and Information Architecture: Not Just for Enterprise Architects! Gartner Enterprise Architecture Conference June 2007, Nashville, TN Gaylord.
EPA Geospatial Segment United States Environmental Protection Agency Office of Environmental Information Enterprise Architecture Program Segment Architecture.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
Emergency Management Training and Education System Protection and National Preparedness National Preparedness Directorate National Training and Education.
1 Tutorial for the EAWG: Solution Architecture for 2010 Brand Niemann Senior Enterprise Architect U.S. EPA January 28, 2010.
Semantic Web: The Future Starts Today “Industrial Ontologies” Group InBCT Project, Agora Center, University of Jyväskylä, 29 April 2003.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
® GEOSS AIP 5 Water SBA Update HDWG June 2012 Matt Austin NOAA Stefan Fuest KISTERS Jochen Schmidt NIWA.
1 DAS Annual Review June 2008 “Build to Share” Suzanne Acar, US DOIAdrian Gardner, US National Weather ServiceCo-Chair, Federal DAS
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 4 Interagency and Non-government (in process) Brand Niemann.
Shaping a Health Statistics Vision for the 21 st Century 2002 NCHS Data Users Conference 16 July 2002 Daniel J. Friedman, PhD Massachusetts Department.
ESIP Vision: “Achieve a sustainable world” by Serving as facilitator and advisor for the Earth science information community Promoting efficient flow of.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Information Architecture The Open Group UDEF Project
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
The Federated Data System DataFed R. Husar, K. Hoijarvi, S. Falke, DaFed Community EPA Data Summit, Feb. 12, 2008, RTP Non-intrusive data integration infrastructure.
Semantic Interoperability for the Office of the National Coordinator for Health Information Technology Brand Niemann and the Health Information Technology.
1 EPA Data Architecture for DRM 2.0 Brand Niemann Enterprise Architecture Team, US EPA, and Co-Chair, SICoP August 7, 2006.
1 EPA Data Architecture for the Data Reference Model: Build to Exchange, Share, and Reuse Brand L. Niemann, Senior Enterprise Architect, US EPA, and Co-Chair,
1 Collaboration: Communities of Practice Using Wiki Technology OGETA Forum Meeting, Atlanta, Georgia Atlanta-Augusta Room, 3rd Floor, Sam Nunn Atlanta.
Sharing Data: Issues and Opportunities Transportation Research Board Annual Meeting January 22, 2006 Leni Oman Director of Transportation Research Washington.
A New Enterprise Data Management Strategy for the US EPA
Brand Niemann, US EPA & Co-Chair SICoP
Federal Communities of Practice: IBM Contributions
Brand Niemann, US EPA and
Harvard CRM Service Strategy
About The Federal Data Architecture Subcommittee (DAS) 2008
Presentation transcript:

1 A New Enterprise Information Architecture and Data Management Strategy for the U.S. Government Part 10: Web 2.0 for Earth Science Collaboration for Information Access Brand L. Niemann Senior Enterprise Architect, US EPA Enterprise Architecture Team, and Co-Chair, Federal Semantic Interoperability Community of Practice (SICoP) Presentation to Water Management Cluster Meeting at the Earth Science Information Partnership Conference January 9-10, 2008

2 Abstract The National Science Foundation’s Cyber- infrastructure Vision for 21st Century, National Science Foundation states that web services, service-oriented architectures, and ontologies are emerging as central facets of the cyber- infrastructure software ecosystem and that the creation of end-to-end cyber-infrastructure systems by groups of individuals with common interests is permitting the establishment of Virtual Organizations (VOs) that are revolutionizing the conduct of science and engineering research and education.

3 Abstract This paper describes the same kind of effort as the “Building Semantically Enabled Scientific Applications” for presentation at the Federation of Earth Science Information Partners Meeting on “The Changing Climate: Making Earth Science Relevant Again in Light of Global Warming”, based on multiple communities of practices and their collaborations. The New Enterprise Information Architecture and Data Management Strategy is expressed as Web 2.0: Locate, Collaborate, and Integrate.

4 Abstract The cyber-infrastructure described in this paper includes three parts: indicators, model-driven assessments, and validation with measurements and analyses. The model-driven assessments are captured in semantically interoperable knowledgebases that consist of semantic models (ontologies) and specific instances or examples. So the essence of this new approach is to apply the best ontology methodologies to the best content generated by the best subject matter experts to provide the best information access and decision support tools.

5 Abstract Some specific examples of water indicators and model-driven assessments are provided in the paper for the Water Management Cluster Meeting Questionnaire (Appendix I) of the Federation of Earth Science Information Partners Meeting. Recommendations are provided for other science topics to use Web 2.0 for Earth Science Collaborations to improve information access.

6 Contents 1. Introduction 2. Web 2.0: Locate, Collaborate, and Integrate 3. Water Indicators and Model-Driven Assessments 4. Recommendations 5. References Appendix I: Water Management Cluster Meeting, January 9, 2008

7 1. Introduction The National Science Foundation’s Cyber-infrastructure Vision for 21st Century, National Science Foundation (1) includes definitions of data, metadata, ontologies, and states that “web services and service-oriented architectures are emerging as a standard framework for interoperability among various software applications on different platforms … (because).. They provide important characteristics such as standardized and well-defined interfaces and protocols, ease of development, and reuse of services and components, making them potential central facets of the cyber-infrastructure software ecosystem.” This same report also states that “the creation of end-to-end cyber-infrastructure systems – comprehensive networked resources – by groups of individuals with common interests is permitting the establishment of Virtual Organizations (VOs) that are revolutionizing the conduct of science and engineering research and education.”

8 1. Introduction Interestingly, the highest rated presentation at the 2007 Semantic Technology Conference was entitled “Building Semantically Enabled Scientific Applications” in which the authors: –developed a technical infrastructure (known as cyber- infrastructure) across several science areas, –developed a semantic data framework, –provided examples of how semantic technologies are deployed in a scientific Virtual Observatory to provide online query, retrieval, and analysis services to a variety of heterogeneous scientific data resources, and –showed how semantics improves access and interoperability, thereby supporting new options for collaboration and access for scientific research. This paper describes the same kind of effort.

9 1. Introduction The New Enterprise Information Architecture and Data Management Strategy is expressed as Web 2.0: Locate, Collaborate, and Integrate and is describe in Section 2. Some specific examples of water indicators and model-driven assessments are provided in Section 3 and recommendations are provided for other science topics to use Web 2.0 for Earth Science Collaboration to improve information access in Section 4. Appendix I provides answers to questions for the Water Management Cluster Meeting, January 9, 2008.

10 2. Web 2.0: Locate, Collaborate, and Integrate Web 2.0 refers to a perceived second-generation of Web based communities and hosted services — such as social networking sites, wikis and folksonomies — that facilitate collaboration and sharing between users. The author has provided "Government by Wiki: A Guided Tour" for the 2007 Spring Government CIO Summit and "Web 2.0 and Government: Current Web 2.0 Initiatives within Government Agencies" for the recent The New New Internet Conference. The author has also suggested a new enterprise information architecture and data management strategy in support of Goal 2: Information securely, rapidly, and reliably delivered to our stakeholders in the Federal CIO Council’s Strategic Plan. The essentials of that are summarized in Table 1. Interestingly, the experiment at the NSF Nanoinformatics Workshop to have both the nanotechnology subject matter experts and the semantic information specialists organize the workshop content showed that the latter do a better job when empowered with Web 2.0 (Wiki) and Web 3.0 (Semantic Wiki) tools.

11 2. Web 2.0: Locate, Collaborate, and Integrate ToolProgramPurpose Web SearchFederal Sitemaps Google: Federal Sitemaps Locate Most searches start with Google, Yahoo, and MSN WikisCOLAB Google: COLAB Wiki Collaborate Need to Share (8) Semantic WikisKnowledgebases Google: DRM 3.0 and Web 3.0 Integrate Responsibility to Provide (8) Table 1. Characteristics of Web 2.0: Locate, Collaborate, and Integrate in Support of Goal 2(7) Note: The FEA DRM 2.0 was written in the COLAB and is being implemented in Semantic Wikis! See Slide 15 for Footnotes.

12 2. Web 2.0: Locate, Collaborate, and Integrate Some recent communities of practices and their collaborations that were documented with the new Web 2.0: Locate, Collaborate, and Integrate paradigm are summarized in Table 2 for the recent Collaborative Expedition Workshop #68 and the upcoming National Academy of Science’s Transportation Research Board Annual Meeting.

13 2. Web 2.0: Locate, Collaborate, and Integrate DateTitleGoogleReference June NSF Nanoinformatics Workshop (11) June 18-19, 2007W3C/WSRI Workshop at the National Academy of Sciences (12) September 12-13, 2007Growing Communities on Karst 2007 Conference (13) October 15-16, 2007Shenandoah Valley Science Symposium Shenandoah Valley Science (14) See Slides for Footnotes. Table 2. Examples of Web 2.0: Locate, Collaborate, and Integrate

14 2. Web 2.0: Locate, Collaborate, and Integrate DateTitleGoogleReference November 9, 2007Mid-Atlantic Regional Planning Roundtable (15) November 15-16, 2007Sustainable Water Resources Roundtable (16) November 14-16, 2007Puget Sound Mashup of the EPA National Environmental Information Symposium Puget Sound Mashup (17) See Slides for Footnotes. Table 2. Examples of Web 2.0: Locate, Collaborate, and Integrate

15 2. Web 2.0: Locate, Collaborate, and Integrate Footnotes: –(7) Federal CIO Council’s Strategic Plan (FY ) –(8) March 6, 2007, GovExec.com, Intelligence chief establishes information- sharing panel. Mike McConnell, Director of National Intelligence: Move the intelligence community beyond the "need to share" philosophy toward a "responsibility to provide" model. –(11) June 12-13, 2007, NSF Workshop on Nanoinformatics, Sponsored by the National Nanotechnology Initiative and National Nanomanufacturing Network Nanoinformatics: bin/wiki.pl?NanoinformaticsStrategiesWorkshop_2007_06_1213http://colab.cim3.net/cgi- bin/wiki.pl?NanoinformaticsStrategiesWorkshop_2007_06_1213 –(12) June 18-19, 2007, W3C/WSRI Workshop at the National Academy of Sciences –(13) September 12-13, 2007, Growing Communities on Karst 2007: Conference Proceedings –(14) October , Shenandoah Valley Science Collaborative.

16 2. Web 2.0: Locate, Collaborate, and Integrate Footnotes (Continued): –(15) November 9, th Mid-Atlantic Regional Planning Roundtable –(16) November 15-16, 2007, Sustainable Water Resources Roundtable, NOAA Science Center, Silver Spring, MD –(17) November 14-16, 2007, Puget Sound Mashup of the EPA National Environmental Information Symposium October 11, 2007, Mary McCaffery, It’s All About the Strategy: Mashing Up the Environmental LOB and GEOSS. – December 20, 2007, Mary McCaffery, Puget Sound Information Challenge: Experiences and Lessons Learned (Presentation at the AIC Meeting that outlines the Puget Sound Information Challenge, an exercise held at the 2007 Environmental Information Symposium, and discusses the results and successes of this activity, as well as the implications for future collaborative efforts. –

17 2. Web 2.0: Locate, Collaborate, and Integrate The “Building Semantically Enabled Scientific Applications” example and the presentations and discussions at the recent Sustainable Water Resources Roundtable suggest three parts: –indicators, –model-driven assessments, and –validation with measurements and analyses. The model-driven assessments are captured in semantically interoperable knowledgebases that consist of semantic models (ontologies) and specific instances or examples. Since building knowledgebases and ontologies is relatively new and challenging, SICoP has provided several tutorials and a Knowledgebase Pilot of “Ontology for the Twenty First Century: An Introduction with Recommendations”. So the essence of this new approach is to apply the best ontology methodologies to the best content generated by the best subject matter experts to provide the best information access and decision support tools.

18 3. Water Indicators and Model- Driven Assessments The presentations and discussions at the Shenandoah Valley Science Symposium and the Sustainable Water Resources Roundtable emphasized the need for three activities for water: –indicators, –model-driven assessments, –and validation with measurements and analyses. The model-driven assessments (e.g. the CEQ Environmental Indicators Model and Information System) are captured in various schematic diagrams that need to be made more semantically explicit so they can be used to improve information access and even be executable to provide better decision support. This is the “third order of order” explained and illustrate in the previous paper in this series.

19 3. Water Indicators and Model- Driven Assessments By equating inflow and outflow, with the assumption that annual change in storage is relatively small, the water budget can be expressed as: –Precipitation + Surface-Water Inflow = Evapo-transpiration + Surface-Water Outflow In this case the equation forms the basis for the semantic model (ontology) at the top of the knowledgebase based on a most trusted reference knowledge source for the instances and examples (see Figure 1). For environmental indicators (including those for water), Figure 2 shows an ontology based on concepts, semantic linking, and metadata database using the environmental indicators in a trusted reference knowledge source (EPA’s 2007 Report on the Environment) for the knowledgebase. Figure 3 shows a Taxonomy Based on Using the Sustainable Water Resources Indicators in a Trusted Reference Knowledge Source (TRKS) for the Knowledgebase and the same for a similar TRKS not shown.

20 3. Water Indicators and Model- Driven Assessments Figure 1. Ontology Based on Concept Extraction and Semantic Linking Using the Water Budget Equation in a Trusted Reference Knowledge Source for the Knowledgebase

21 3. Water Indicators and Model- Driven Assessments Figure 2. Ontology Based on Concepts, Semantic Linking, and Metadata Database Using the Environmental Indicators in a Trusted Reference Knowledge Source for the Knowledgebase

22 3. Water Indicators and Model- Driven Assessments Figure 3. Taxonomy Based on Using the Sustainable Water Resources Indicators in a Trusted Reference Knowledge Source for the Knowledgebase

23 3. Water Indicators and Model- Driven Assessments Finally, the Pilot Wiki Pages on Water Cycle Ontology and High-Resolution Drought Forecasting System for the Shenandoah Valley Science Plan (Figure 4) outline that the Water cycle ontology will be incorporated into the SWEET Ontology with the Noesis, a smart semantic search tool, using the CUAHSI Controlled Vocabulary and Water Ontology Work, and the Global Change Master Directory water controlled words, among other things as will be discussed in the Water Management Cluster Meeting, January 9, 2008 (see Appendix I).

24 3. Water Indicators and Model- Driven Assessments Figure 4. Pilot Wiki Pages on Water Cycle Ontology High-Resolution Drought Forecasting System and for the Shenandoah Valley Science Plan

25 4. Recommendations This New Enterprise Information Architecture and Data Management Strategy for the U.S. Government explains and illustrates the terms Reduce, Reuse, and Recycle and is based on: –The premise of reusing the data and information rather than changing the data systems themselves: –Putting the business and technical rules, logic, etc. into the data itself using markup languages. –The concepts and standards of the Semantic Web: Also called the Data Web or Web 3.0.

26 4. Recommendations The most important tenets of the reuse are: –Bring the data and the metadata back together. –Bring the structured and unstructured data and information back together. –Bring the data and information description and context back together. The new enterprise information architecture and data management strategy supports all three orders of orders and the four functionalities of the DRM 3.0 / Web 3.0.

27 4. Recommendations This knowledgebase pilot supports the four functionalities of the DRM 3.0 / Web 3.0, namely: –(a) Data and Metadata: Full text of documents, standards, meeting notes, etc.; –(b) Harmonization: Different ways in which the same words are used; –(c) Enhanced Search: Across all content and showing context (e.g. words around the term or concepts); and –(d) Mashups: A website or application that combines content from more than one source into an integrated experience (repurposing, reuse). This knowledgebase pilot also supports the three orders of order as follows: –(a) we organize the knowledgebases themselves; –(b) we add structured semantic metadata to each knowledgebase; and –(c) the knowledgebases have taxonomies/ontologies that support semantic linking and searching.

28 4. Recommendations The following are recommended: –(1) Water Indicators and Assessments should be included as a new Line of Business in the OMB Federal Enterprise Architecture. –(2) The National Academy of Public Administration White Paper on “Institutional Arrangements for a System of Indicators on the Nation’s Environment” should be implemented using a new enterprise information architecture and data management strategy and an intensive pilot to select indicators for a nationally important issue like water quantity should be based on the collaborative work reported here. –(3) Build a Puget Sound Knowledgebase using EPA’s 2007 Report on the Environment Knowledgebase framework to produce a regional state of the environment report for the Puget Sound. NOAA has a trusted reference knowledge source that would provide an excellent basis for that.

29 5. References January 9-10, 2008, Federation of Earth Science Information Partners Meeting: The Changing Climate: Making Earth Science Relevant Again in Light of Global Warming, Washington, DC. A New Enterprise Information Architecture and Data Management Strategy for the U.S. Government Part 10: Web 2.0 for Earth Science Collaboration for Information Access Paper and Slides. Also see Water Cycle Ontology for Water Cluster Breakout SessionFederation of Earth Science Information Partners MeetingPaper SlidesWater Cycle OntologyWater Cluster Breakout Session

30 Appendices Appendix I: Water Management Cluster Meeting, January 9, 2008: –1. Briefly describe your participation in water management activities: Shenandoah Valley Science Collaboration, Mid-Atlantic Regional Planning Roundtable, Sustainable Water Resources Roundtable, Earth Science Information Partners, etc. –2. What data sources and technologies do you use? Data Sources: The best content from subject matter experts that have been peer reviewed and vetted. Technologies: Web 2.0 (Wikis) and Web 3.0 (Semantic Wikis and Semantic Desktops) to create and maintain knowledgebases (= semantic models + instances and examples). The semantic models are taxonomies/ontologies that support four key functionalities: namely (1) Data and Metadata: Full text of documents, standards, meeting notes, etc.; (2) Harmonization: Different ways in which the same words are used; (3) Enhanced Search: Across all content and showing context (e.g. words around the term or concepts); and (4) Mashups: A website or application that combines content from more than one source into an integrated experience (repurposing, reuse).

31 Appendices Appendix I: Water Management Cluster Meeting, January 9, 2008 (continued): –3. What are the key products and services that you CURRENTLY use? See Answer 2 above. –4. What are products and services that you would like to see developed? Products and services that support A New Enterprise Information Architecture and Data Management Strategy for the U.S. Government - Part 9: Reduce, Reuse, Recycle dochttp://colab.cim3.net/file/work/SICoP/EPADRM3.0/BNiemann doc

32 Appendices Appendix I: Water Management Cluster Meeting, January 9, 2008 (continued): –5. Please provide contact information and web site: Google: Brand Niemann – Google: Water Cycle Ontology and High-Resolution Drought Forecasting System Collaborations with ESIP - Will Pozzi, et. al. – – bin/wiki.pl?HighResolutionDroughForecastingSystemhttp://colab.cim3.net/cgi- bin/wiki.pl?HighResolutionDroughForecastingSystem Google: Shenandoah Valley Science –