Brand Niemann, US EPA & Co-Chair SICoP

Slides:



Advertisements
Similar presentations
DELIVERING SHAREPOINT AS A SERVICE
Advertisements

Delivering Digital Services Information Management Theme Presented By: Deborah Cowell, FAA, AIT Date:August 27, 2014.
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 2 Brand Niemann Senior Enterprise Architect, US EPA, and Co-chair,
DoDAF 3.0: A Web 2.0 and SOA Mashup!
1 Improved Access to EPA Information: Before and After with Web 2.0 Brand Niemann Senior Enterprise Architect, US EPA, and Co-chair, Federal SOA CoP and.
Third-generation information architecture November 4, 2008.
Build the Binary Group in the Cloud Brand Niemann Senior Enterprise Architect Binary Group August 5, Updated August 8,
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
IT Governance and Management
1 Welcome & Overview 2 nd Annual Workshop “What are National Security Threats?” Kathleen D. Morrison Co-Director, JTAC Professor of Anthropology Director,
Semantic Interoperability Community of Practice (SICoP) Semantic Web Applications for National Security Conference Hyatt Regency Crystal City, Regency.
Big Data and Social Media & Web Analytics Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Getting Smarter with Information An Information Agenda Approach
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Enterprise 2.0: Social Media, Collaboration and Innovation in Organizational Context.
1 Semantic Cloud Computing & Open Linked Data Pattern Brand Niemann Invited Expert to the NCIOC SCOPE and Services WGs September 22, 2009.
SharePoint Server 2013 Features and Scenarios for IT Professionals First Lastname, Title March, 2014 Software Assurance Planning Services.
Information Sharing Begins With Me Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
1 Briefing for EPA and OEI Communications Coordinators and Press Officers Brand Niemann US EPA Senior Enterprise Architect and Federal CoP Leader January.
XBRL Seminar: The New Data Reference Model
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
Web Site Performance An analytical approach for benchmarking and tuning.
1 Data Architecture, Modeling, and Networks Brand L. Niemann January 5, 2007.
1 Building DRM 3.0 and Web 3.0 for Managing Context Across Multiple Documents and Organizations Mills Davis and Brand Niemann, SICoP Co-Chairs, and Lucian.
OEI’s Services Portfolio December 13, 2007 Draft / Working Concepts.
1 "Wikis: The Good, The Bad, and the Ugly" Brand Niemann Senior Enterprise Architect, US EPA and COIC Semantic Interoperability CoP, Co-Chair Panel at.
UK Repository Search Project Phase II Project Overview Phil Cross Vic Lyte September 2006.
U.S. Department of Agriculture eGovernment Program eGovernment Working Group Meeting February 11, 2004.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
1 Data and Information Architecture: Not Just for Enterprise Architects! Gartner Enterprise Architecture Conference June 2007, Nashville, TN Gaylord.
EPA Geospatial Segment United States Environmental Protection Agency Office of Environmental Information Enterprise Architecture Program Segment Architecture.
1 Shift Happens! Briefing for the EPA Enterprise Architecture Team Brand Niemann Senior Enterprise Architect, US EPA, and Federal Web 2.0/3.0 Community.
Cross Information Sharing and Integration for the Intelligence Community: 13 th SOA for eGovernment Conference Dr. Brand Niemann Director and Senior Enterprise.
1 Tutorial for the EAWG: Solution Architecture for 2010 Brand Niemann Senior Enterprise Architect U.S. EPA January 28, 2010.
1 A New Enterprise Information Architecture and Data Management Strategy for the U.S. Government Part 10: Web 2.0 for Earth Science Collaboration for Information.
1 DAS Annual Review June 2008 “Build to Share” Suzanne Acar, US DOIAdrian Gardner, US National Weather ServiceCo-Chair, Federal DAS
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 4 Interagency and Non-government (in process) Brand Niemann.
EGovOS Panel Discussion CIO Council Architecture & Infrastructure Committee Subcommittee Co-Chairs March 15, 2004.
© 2010 Deep Web Technologies, Inc. Taking the Library Back from Google Abe Lederman, President and CTO Deep Web Technologies May 12, 2010.
Information Architecture The Open Group UDEF Project
Advancing Science: OSTI’s Current and Future Search Strategies Jeff Given IT Operations Manager Computer Protection Program Manager Office of Scientific.
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
1 Federal Sitemaps: An XML-Based Standard for Searching the Invisible Web Presentation at the XML CoP Meeting Mills Davis and Brand Niemann, SICoP Co-Chairs,
1 Harmonizing Taxonomies: Draft for Discussion at the OASIS eGov Technical Committee Meeting Brand Niemann US Environmental Protection Agency January 6,
Presented by Eliot Christian, USGS Accessibility, usability, and preservation of government information (Section 207 of the E-Government Act) April 28,
TEMPLATE DESIGN © Crawling is the process of automatically exploring a web application to discover the states of the application.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
GSA IT Strategic Plan 2009 – 2011 August 2007 US General Services Administration 1.
Chapter 1 Overview of Databases and Transaction Processing.
JMFIP Financial Management Conference
Thoughts on IT Enterprise Architecture Maturity Models for the
Data Architecture, Modeling, and Networks
Discovering Computers 2010: Living in a Digital World Chapter 14
Data Reference Model Implementation Through Iteration & Testing
Federal Communities of Practice: IBM Contributions
Strategies for improving Web site performance
Search Engine Optimisation
Brand Niemann, US EPA and
What is a Learning Collaborative?
Continuous Improvement through Accreditation AdvancED ESA Accreditation MAISA Conference January 27, 2016.
Universal Core Task Force Connecting People With Information
EIN 6133 Enterprise Engineering
Making online federal agency information more accessible
About The Federal Data Architecture Subcommittee (DAS) 2008
Introducing Semantic Web Technologies:
Introducing Semantic Web Technologies:
1/18/2019 Transforming the Way the DoD Manages Data Implementing the Net Centric Data Strategy using Communities of Interest Introduction
2/15/2019 Transforming the Way the DoD Manages Data Implementing the Net Centric Data Strategy using Communities of Interest Introduction
PlainLanguage.gov success story
Employee engagement Delivery guide
Presentation transcript:

A Needle in a Haystack: What Web Users Are Searching For: The Federal Sitemaps Initiative Brand Niemann, US EPA & Co-Chair SICoP Excellence in Government Conference Washington Convention Center Breakout Session II: April 4, 11:15 am to 12:15 pm Google: Federal Sitemaps Google: SICoP

Prospectus 'A Needle in a Haystack: What Web Users Are Searching For.‘ The federal government is both the world's largest information source and the inventor of the Internet. So why is it so hard for federal employees and citizens to find the information they need? Hear how Google and other leaders in Internet search technology make government more open, transparent and customer-focused. Gain new perspectives on how to embrace technologies to increase your agency's presence on the Web."

Agenda Moderator: Jon Desenberg Google: JL Needham PerformanceWeb.Org Google: JL Needham Sitemaps at FOSE 2007 and the need for agencies to balance their investment in web and site search (see next slide). Science.Gov: Walt Warnick OSTI's specific (and ongoing) experience with implementing sitemaps to make deep web information accessible to researchers using search engines. State Department, Luigi Canali Managing its web publishing centrally and, in particular, implementing sitemaps to ensure automatic communication of newly added content to Google. Federal CIO Council’s Sitemaps Initiative: Brand Niemann Broader policy context and the value of the Federal government as a whole embracing the Sitemap protocol and similar standards.

Web search vs. site search Supporting the two levels of search All of the open and accessible deep web Search scope A segment of your public sites’ content Citizens and professionals User Professionals and citizens Search engine crawling intervals Freshness Customizable Limited by robots.txt, dynamic content Crawling Limited by server capacity and cost High-level stats Reporting tools More detailed, all facets Free Cost Varies

Federal Government Context Government information is estimated to be about 80% unstructured and about 90% of the structured information is estimated to be invisible to search engine crawlers and users. In addition, because: (1) the UK government recently announced that hundreds of their websites are being consolidated or shut down to make access to information easier for people and (2) the recent SICoP Special Conference on Building DRM 3.0 and Web 3.0 in support of the Federal CIO Council Strategic Plan for FY 2007-2009 Goal 2 (Information securely, rapidly, and reliably delivered to our stakeholders) to provide implementation strategies, best practices, and success stories, It seems appropriate to pilot a process that deals with all of these issues at the same time.

EPA Context Total: 27 Sample list of EPA sites with uncrawlable elements: http://spreadsheets.google.com/pub?key=pUb62ZKHnzgqEoGF4LFf3Gw

EPA Webmaster Experience “Sitemaps as a method for discovering database content is something that I heartily endorse. It makes sense, and it's good to have a data standard for doing it. Google, et. al. are to be commended for that. Too bad it's such a minimalist protocol! As we work to expose database contents to our internal search engine, we will keep in mind the need to express that content in a Sitemap protocol as well. EIMS is our first target database, hopefully tackling it this spring.” Source: John Shirey, Notes on Federal Sitemaps Discussion, January 10, 2007.

EPA Pilot March 15th, EPA Web Workgroup Presentation: Objectives: Structure unstructured EPA information. Make EPA databases visible to search engine crawlers and users. Consolidate EPA information to make it easier to use. Provide semantic metadata and linking in support of DRM 3.0 and Web 3.0 applications. Pilot Content: The new EPA Strategic Plan, Report on the Environment, Enterprise Architecture, and Performance Results were used to illustrate the “long tail” of search (being successful with obscure queries). See http://colab.cim3.net/file/work/SICoP/2007-03-15/SICoPEPAWWG03152007.ppt

Policy Context The CIO Council's XML Community of Practice (xml.gov) and the Semantic Interoperability Community of Practice (SICoP) encourage adoption and implementation of the Sitemap protocol by federal agencies because it: Supports the E-Government Act of 2002 (Pub. L. No. 107-347). Supports the Federal Enterprise Architecture's Data Reference Model 2.0. Supports the SICoP DRM 2.0 Implementation - Knowledge Reference Model. Supports the new CIOC Strategic Plan FY 2007-2009.

Policy Context Policy Response E-Government Act of 2002 Organize and categorize information intended for public access and ensure it is searchable across agencies. Federal Enterprise Architecture's Data Reference Model 2.0 Identify how information and data are created, maintained, accessed, and used. SICoP DRM 2.0 Implementation - Knowledge Reference Model Use of increasing metadata to provide increasingly powerful search results. See next slide. CIOC Strategic Plan FY 2007-2009 Provide updates to the FEA Data Reference Model (DRM) and establish DRM implementation strategies, best practices, and success stories.

From Search to Knowing Source: Figure 10 in SICoP White Paper Series Module 2: Semantic Wave 2006 - Executive Guide to the Business Value of Semantic Technologies, May 15, 2006, Principal Author Mills Davis, Project10X.

From Search to Knowing From bottom-to-top, the amount, kinds, and complexity of metadata, modeling, context, and knowledge representation increases. From left-to-right, reasoning capabilities advance from (a) information recovery based on linguistic and statistical methods, to (b) discovery of unexpected relevant information and associations through mining, to (c) intelligence based on correlation of data sources, connecting the dots, and putting information into context; to (d) question answering ranging from simple factoids to complex decision-support, and (e) smart behaviors including robust adaptive and autonomous action.

From Search to Knowing Moving from lower right to upper left, the diagram depicts a spectrum of progressively more capable categories of knowledge representation together with standards and formalisms used to express metadata, associations, models, contexts, and modes of reasoning. As the amount and expressive power of the semantics and knowledge increases, so does the value of the reasoning capacity it enables.

Upcoming Events April 25, 2007, SICoP Special Conference 2: Building Knowledgebases for Cross-Domain Semantic Interoperability Google: DRM 3.0 and Web 3.0 May 6-8, 2007, The 22nd Semi-Annual Spring Government CIO Summit Government by Wiki: New Tools for Collaboration, Information-Sharing, and Decision-Making. Web 2.0 Essentials for Government: Tying It All Together in a Service System.