SICoP 2011: Transforming Government through Innovation with Semantic Technologies Semantic Tech and Business Conference, November 29 – December 1, 2011 Brand Niemann, Director and Senior Data Scientist, Semantic Community Mills Davis, Managing Director, Project 10X Chuck Rehberg, CTO, Semantic Insights 1
Some Examples of Build Systems of Systems in the Cloud Semantic Community - US EPA and Federal CIO Council 1105 Government Information Group – Wyatt Kash SEMIC.EU – Francisco Garcia Moran DoD CIO – Teri Takai Army Systems of Systems Engineering – Terry Edwards NIEM PMO – Donna Roy FAA – NextGen – Steve Cooper Intelligence Community – Gus Hunt and Todd Myers METI – Hiroyuki Hotta DoD DCMO CTO – Dennis Wisnosky 2
DoD Deputy Chief Management Officer (DCMO) Request for Information (RFI) 3
Build DoD in the Cloud 4
5
System of Systems Architecture 6 S Semantic Index of Linked Data (e.g. Excel) Dynamic Case Management (e.g. Be Informed) Data Science Library (e.g. Spotfire) Data Science Products (e.g. Spotfire)
DoD CIO: IT Enterprise Strategic Management Tutorial Brand Niemann Director and Senior Data Scientist Semantic Community October 28,
My Best Strategic Advice to You Is: Service StepsToolProcess 1. Provision Instead of Procure Amazon Federal (secure) Web Services AWS’s Migrating Applications to the Cloud 2. Linked Data on Mobile Devices for the Warfighter MindTouch, Excel, SIRA, & Spotfire Build DoD in the Cloud 3. Dynamic Case Management for Business Agility Be InformedBe Structured DemosTitleURL AWS Gov Cloud Summit IIAWS Provision and MigrateWeb Army Weapons 2011Knowledge and Capability Management Services for ASA(ALT) SOSE Web SemTech 2011 Washington DC So Our Veterans Can Be Informed Web 8
Applying Data Science to Build a Knowledgebase Knowledgebase = Model + Instances Model = Vocabulary, Taxonomy, and Ontology/Rules Instances = Linked Data Semantically Linked to the Model 9
Applying Data Science to Build a Knowledgebase Model = – Vocabulary – Glossary in MindTouch – Taxonomy - Contents and Resources in MindTouch – Ontology/Rules in Be Informed Instances = – Linked Data Semantically Linked to the Model – MindTouch, Excel and Spotfire 10
Business Process Best Content to a Knowledgebase (MindTouch) – DoD IT Enterprise Strategy and Roadmap, etc. Knowledgebase to Spreadsheet (Excel) – Linked Data to Subparts of the Knowledgebase Excel to Dashboard (Spotfire) – Data integration and interoperability interface Spotfire Dashboard to Dynamic Case Management (Be Informed) – Structured process for updating data in the dashboard 11
Best Content to a Knowledgebase (MindTouch) 12 Well-defined URLs to subparts of the document and figure and tables.
Best Content to a Knowledgebase (MindTouch) 13 Desktop publishing- like control of content.
Knowledgebase to Spreadsheet (Excel) 14
Knowledgebase to Spreadsheet (Excel) 15
Excel to Dashboard (Spotfire) 16 PC Desktop
Excel to Dashboard (Spotfire) 17 Web Player See the Data Sort/Facet Search the Data Download the Data Share the Data (iPad)
Spotfire to Be Informed Be Structured Questions: – Greenfield: Are you starting the report from scratch? – Legacy: Are you updating the report? – Dynamic Case Management: Does Spotfire need to be updated frequently? So the Business Process Pattern Includes: – Greenfield, Legacy, and Dynamic Case Management Definitions (and Rules!): – DoD Dictionary and JCDIS Manual Executable Model: – Semantically Linked Taxonomy, Ontology, and Rules (started in previous slides) 18
Spotfire to Be Informed 19 Semantic Insights Research Assistant (SIRA). See next slide.
SIRA: Builds a simple report and an ontology of the content 20
Spotfire to Be Informed 21 Web Player DoD Dictionary Terms (5093) and Acronyms (6599) DoD Campaign Plan (100 Linked data URLs) GAO DoD Enterprise Architecture (113 Linked data URLs) JCDIS Manual (405 Linked data URLs) Note: There are additional Spotfire files with DoD data: Build DoD (Vocabulary) in the Cloud Build DoD Semantic Data Architecture in the Cloud Build the DoD IG Report to Congress in the Cloud See: Binary Services for DoD CIO and DHS NIEM PMO, PowerPoint, October 14, 2011, 48 slides.
Work in Process 22
Federation of Data Sets Strategy One Table: – Two Columns Example: Column 1: Section and Column 2: URL – Note: A Column 3: Description could be in the URL – Example: See Slide 14 – Three Columns: Example: Column 1: Subject, Column 2: Object, and Column 3: Predicate – Note: This is the Semantic Web’s Linked Open Data Cloud as Linked Open Data for Network Analytics! – Example: See Next Slide – Four Columns: Examples: Column 1: Subject, Column 2: Attribute, Column 3: From, and Column 4: To, or Column 1: City, Column 2: Country, Column 3: Longitude, and Column 4: Latitude – Note: This is the format for Spotfire’s Network Analytics Module developed for the CIA – Example: See Slide 25 23
Linked Open Data Cloud as Linked Open Data for Network Analytics 24 Is it easy to add columns for who links to who? Not in a single table. SPARQL can't do cross-tabulation (Richard Cyganiak).
Spotfire Network Analytics 25
Federation of Data Sets Strategy 26 Multiple Data Tables
Spotfire Information Designer 27
Spotfire Information Designer 28
Federated SPARQL Query: CKAN 29 Federated SPARQL Query across 14 data sources for 40,000 results (Richard Cyganiak). See Slide 31.
Federated SPARQL Query: Virtuoso 30 Linked Data Virtualization meets Oracle via ODBC (Kingsley Idehen)
Spotfire: Semantic Web Data 31 W3C Linked Data Community Directory (David Wood, as of November 28, 2911) Web Player
DoD RFI Questions Can the interview be done at/after the November 30 th Industry Discussion? Can one see the EIW work done so far in presentations and Web content? Can one use the EIW public content to pilot semantic technologies while all of this RFI process is going on? Is the overall goal to significantly reduce the number and cost of DoD Information Systems as my first story found? Is the RFI goal to federate selected DoD content for delivery to the warfighter on mobile devices over secure networks? E.g. my TBI knowledgebase Is the RFI goal to deliver public DoD content so the public can better understand and use it? E.g. my Weapons Systems 2011 knowledgebase Is the RFI goal to use the most advanced semantic technology that I know of to do Dynamic Case Management (e.g. Be Informed)? Is the DoD considering forming a Semantic Interoperability Center of Excellence like the EU’s SEMIC.EU? Should industry form a non-profit consortium with multiple organizations to do this kind of work under say a CRADA like the NCOIC is doing for the FAA NextGen and National Geospatial Intelligence Agency? Will there be a follow-up event to discuss the results of the RFI with the participants/public? How many have responded to the RFI/are registered to attend the two Industry Events? 32