Copyright Antidot™ 1 Linked Enterprise Data LEVERAGING THE SEMANTIC WEB STACK IN A CORPORATE ENVIRONMENT ISWC 2012 – BOSTON FABRICE LACROIX –

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

© 2006 IBM Corporation Features of an Enterprise-ready Triple Store Ben Szekely June, 2006.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Making Search Relevant SchemaLogic Gary Carlson Chief Taxonomist
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
SAP BI ConnectorDuet Enterprise for Microsoft SharePoint and SAP SAP NetWeaver Gateway productivity accelerator for Microsoft Synch enterprise data.
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
From Relational to Semantics A Methodology Arka Mukherjee, Ph.D. Founder / CTO Global IDs David Schaengold Director,
Information and Business Work
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
© 2006 IBM Corporation IBM Software Group Relevance of Service Orientated Architecture to an Academic Infrastructure Gareth Greenwood, e-learning Evangelist,
1 Software architecture adjustments for a changing business.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.
A web-based repository service for vocabularies and alignments in the Cultural Heritage domain Lourens van der Meij Antoine Isaac Claus Zinn.
Libraries and Institutional Content Management Systems
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
Corporate Efficiency Meeting Improving Your Business Processes Using SharePoint and Beyond.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
What Can Do for You! Fabian Christ
SecureAware Building an Information Security Management System.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
November 2003 Presented to “Commercializing RDF” Semantic Software Solutions for Enterprise Web Management International World Wide Web Conference 2004.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
© 2007 by Prentice Hall 1 Introduction to databases.
Business and IT Working Together to Streamline Corporate Reporting Stephen Hord, Director of Product Development – UBmatrix.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
The Development of the Siemens Knowledge Community Support By: Matt Greaves.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Delivering Fixed Content to Oracle Portal Doug Daniels & Ken Barrette Quest Software.
The Semantic Logger: Supporting Service Building from Personal Context Mischa M Tuffield et al. Intelligence, Agents, Multimedia Group University of Southampton.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Using Open Data to Create Value for Citizens. Data.gov Provides instant access to ~400,000 datasets in easy to use formats Contributions from UN, World.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Integrated business-information system for sales process support Bitrix24 Marta Alić, prof. University of Applied Sciences, Zagreb.
All rights reserved © 2005 Eminent System. April 14, Oct 04, 2007 EMINENT SYSTEM.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Knowledge Modeling and Discovery. About Thetus Thetus develops knowledge modeling and discovery infrastructure software for customers who: Have high-value.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Copyright All right reserved 1 i - LIKE Linked Data enrichment for an e-learning system Networked interactions to create, learn and share knowledge.
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
1 DMS-DQS-SUPSC03-PRE-12-E © DEIMOS Space S.L., 2007 A Semantic Data Grid for Satellite Mission Quality Analysis Reuben Wright Deimos Space.
© 2009 OpenLink Software, All rights reserved. Mapping Relational Databases to RDF with OpenLink Virtuoso Orri Erling - Program Manager, Virtuoso.
On Demand RDF Databases in the Cloud Presentation for the Ontology Forum March 3, 2016.
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
Cut down on the time it takes employees to process invoices using Square 9’s SmartSearch integration with Microsoft Dynamics GP. SmartSearch allows invoice.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Giuseppina Inserra INFN Catania
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Lifting Data Portals to the Web of Data
Experience Management
Vonk FHIR Engine Christiaan Knaap 27 September 2018.
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Project Advance Introduction.
Construction of Enterprise Knowledge Graphs
A framework for ontology Learning FROM Big Data
Presentation transcript:

Copyright Antidot™ 1 Linked Enterprise Data LEVERAGING THE SEMANTIC WEB STACK IN A CORPORATE ENVIRONMENT ISWC 2012 – BOSTON FABRICE LACROIX –

Copyright Antidot™ 2 Antidot – who we are French-based Software Vendor  Since 1999 | Paris, Lyon, Aix-en-Provence  Information access | Data management Mission: Provide our customers with innovative customizable solutions that help them create value with their data, and make their employees more aware and efficient.

Copyright Antidot™ 3 Clients Publishing Healthcare EnterprisesE-commerce

Copyright Antidot™ 4 Unstructured documents files, ECM, collaborative spaces intranet, extranet, Web sites s, instant messaging

Copyright Antidot™ 5 Structured data CRM, ERP, directory knowledge bases business applications (production, support)

Copyright Antidot™ 6 IS are bloated 1 practice => 1 need => 1 application => 1 silo Information system is driven by the process Data are numerous, various and scattered

Copyright Antidot™ 7 Solutions or workarounds? BIMDM SOASearch

Copyright Antidot™ 8 Solutions and workarounds Enterprise Search brings little value to users  Document oriented  Does not solve real business problems Google like Verity like

Copyright Antidot™ 9 What we want

Copyright Antidot™ 10 What we want LDAP CRM Production ERP ECM Files Support

Copyright Antidot™ 11 Changing the paradigm Switching from an application view to a data centric way of thinking.

Copyright Antidot™ 12 Bring out the implicit Build the Giant Enterprise Graph

Copyright Antidot™ 13 LED Linked Enterprise Data application of the Semantic Web technologies and Linked Data principles to the enterprise infrastructure

Copyright Antidot™ 14 What works for the Web… Federating silos on the Web

Copyright Antidot™ 15 …can’t always be used in corporate IS  Legacy apps can’t be "Sparql’ed"  80% un- or semi- structured data don’t fit in the model as such  Defining vocabularies/ontologies for silos is too complex and expensive  Don’t want RDF per se but valuable information  External data is available in XML/JSON through Web Services  Staff trained for RDB, XML, Web apps.  No Risk and stability strategy: SemWeb technology considered as new and immature

Copyright Antidot™ 16 The RDF/storage approach Setting up a global RDF repository does not work either  ITs are afraid by the "RDF everywhere" activists

Copyright Antidot™ 17 Semantic Web technology still is the right solution in corporate environment BUT it is not an aim JUST use it as a means

Copyright Antidot™ 18 Just do it Think of it as a stream paradigm  build new objects using existing data  without interfering with the existing infrastructure  with SemWeb somewhere under the hood

Copyright Antidot™ 19 Enterprise Graph HowTo Construct the graph  generate triples from data  create triples from documents Leverage the graph  enrich  infer Browse the graph  select resources  build objects Trash the graph

Copyright Antidot™ 20 How: extract & normalize Harvest and normalize  as in an ETL  fetch, clean, transform…  normalize records (names, IDs) to prepare the linking step For databases  db2triples : an RDB2RDF implementation by Antidot (open source, W3C validated)

Copyright Antidot™ 21 How: semantize Don’t transform everything in RDF  cherry-pick a subset of interesting fields for each object and create their RDF triples counterpart  interesting == needed for linking or inferring Semantize

Copyright Antidot™ 22 How: semantize Triples generation  Be smart: avoid upfront ontology design, use small vocabularies  Be pragmatic: transform XML tags and field names to predicates  Be agile: only insert what you need. And when you need more, add more. Semantic Web fuels the modeling, linking and information building process

Copyright Antidot™ 23 Enterprise Graph HowTo Construct the graph  generate triples from data  create triples from documents Leverage the graph  enrich  infer Browse the graph  select resources  build objects Trash the graph

Copyright Antidot™ 24 How: semantize Unstructured documents  Extract metadata and transform them as needed to RDF. ➡ Ex: author => dc:creator  Use of text-mining to extract named entities: people, organizations, products… ➡ generate those entities list using the data sources: directory for employees, CRM for companies and people, ERP for products ➡ create triples like doc_URI quotes entity_URI

Copyright Antidot™ 25 How: semantize Unstructured documents  Compare documents using various and dedicated algorithms ➡ is the same ➡ is included ➡ is similar ➡ is related  Generates new triples ➡ create triples like is_sub_version_of

Copyright Antidot™ 26 Enterprise Graph HowTo Construct the graph  generate triples from data  create triples from documents Leverage the graph  enrich  infer Browse the graph  select resources  build objects Trash the graph

Copyright Antidot™ 27 How: enrich Enrich the graph  run specific algorithms to generate more links and triples (classifiers, topic detection, …)  insert external data gathered from the LOD or other external datasets or APIs

Copyright Antidot™ 28 How: infer Create new knowledge  add rules according to your needs IF a coworker is quoted in documents THEN the business unit is bound to the documents AND this coworker belongs to a business unit

Copyright Antidot™ 29 Enterprise Graph HowTo Construct the graph  generate triples from data  create triples from documents Leverage the graph  enrich  infer Browse the graph  select resources  build objects Trash the graph

Copyright Antidot™ 30 How: build Build  select resources corresponding to objects seeds (using Sparql queries)  for each seed, follow links smartly in order to create basic objects Build

Copyright Antidot™ 31 How: build Finalize  decorate the new knowledge objects with data set apart (not loaded in the triplestore)  now we have rich user-actionable objects Build Finalize

Copyright Antidot™ 32 Enterprise Graph HowTo Construct the graph  generate triples from data  create triples from documents Leverage the graph  enrich  infer Browse the graph  select resources  build objects Trash the graph

Copyright Antidot™ 33 How: expose Make the new information available to users and to the entire IS Enrich Harvest Classify Semantize Normalize Annotate Indexation AFS search engine RDF Triplestore (Linked Data) Relational DB

Copyright Antidot™ 34 Conclusion It works!  The triples we create and the inference rules we add are dictated by the goal / application ➡ usage and value oriented  We benefit from the lazy-flexible-dynamic modeling of RDF-RDFS-OWL ➡ we are agile  What matters is the graph. But the graph is not the triplestore ➡ storage independent

Copyright Antidot™ 35 There’s an app for that Antidot Information Factory  a software solution designed specifically to leverage structured and unstructured data  enable large-scale processing of existing data  automate publishing of enriched or newly created information. Harvest Normalize Semantize Enrich Build Expose

Copyright Antidot™ 36 The Giant Enterprise Graph Now we have a path to let SemWeb enter the enterprise

Copyright Antidot™ 37 THANKS FOR YOUR ATTENTION QUESTIONS? Discuss Understand Learn Exchange