Developing Semantic Web Sites: Results and Lessons Learnt Enrico Motta, Yuangui Lei, Martin Dzbor, Vanessa Lopez, John Domingue, Jianhan Zhu, Liliana Cabral,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

AeroDAML Applying Information Extraction to Generate DAML Annotations Dr. Paul Kogut Lockheed Martin Management & Data Systems.
1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
Intelligent Technologies Module: Ontologies and their use in Information Systems Revision lecture Alex Poulovassilis November/December 2009.
Modelling Data-Intensive Web Sites with OntoWeaver Knowledge Media Institute The Open University Yuangui Lei, Enrico Motta, John Domingue {y.lei, e.motta,
WP8: User Centred Applications Enrico Motta, Marta Sabou, Vanessa Lopez, Laurian Gridinoc, Lucia Specia Knowledge Media Institute The Open University Milton.
16/11/ IRS-II: A Framework and Infrastructure for Semantic Web Services Motta, Domingue, Cabral, Gaspari Presenter: Emilia Cimpian.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
1 UIM with DAML-S Service Description Team Members: Jean-Yves Ouellet Kevin Lam Yun Xu.
Learning Semantic Information Extraction Rules from News The Dutch-Belgian Database Day 2013 (DBDBD 2013) Frederik Hogenboom Erasmus.
Page 1 Integrating Multiple Data Sources using a Standardized XML Dictionary Ramon Lawrence Integrating Multiple Data Sources using a Standardized XML.
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Information and Business Work
Towards an NLP `module’ The role of an utterance-level interface.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
OWL-AA: Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation 2006 Spring Research Conference Yihong Ding.
Low-cost semantics-enhanced web browsing with Magpie Enrico Motta Knowledge Media Institute The Open University, UK.
Overall Information Extraction vs. Annotating the Data Conference proceedings by O. Etzioni, Washington U, Seattle; S. Handschuh, Uni Krlsruhe.
By : Vanessa López, Enrico Motta Knowledge Media Institute. Open University Ontology-driven question answering in: AQUALog 9 th International Conference.
Integrating data sources on the World-Wide Web Ramon Lawrence and Ken Barker U. of Manitoba, U. of Calgary
Integration of Information Extraction with an Ontology M. Vargas-Vera, J.Domingue, Y.Kalfoglou, E. Motta and S. Buckingham Sum.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Automatic Data Ramon Lawrence University of Manitoba
Hands On Session John Domingue & Liliana Cabral Knowledge Media Institute The Open University, UK.
Toward Semantic Web Information Extraction B. Popov, A. Kiryakov, D. Manov, A. Kirilov, D. Ognyanoff, M. Goranov Presenter: Yihong Ding.
Characterizing Semantic Web Applications Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK.
Semantic Web for E-Science and Education Enrico Motta Knowledge Media Institute The Open University, UK.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
Knowledge Management in Geodise Geodise Knowledge Management Team Liming Chen, Barry Tao, Colin Puleston, Paul Smart University of Southampton University.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
1 The BT Digital Library A case study in intelligent content management Paul Warren
There are 5 dimensions we need to consider to characterise the next version of ASPL –New Services E.g., impact analysis –Ontologies Domain, argumentation,
September 30, 2002EON 2002Slide 1 Integrating Ontology Storage and Ontology-based Applications A lesson for better evaluation methodology Peter Mika:
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
Populating A Knowledge Base From Text Clay Fink, Tim Finin, Christine Piatko and Jim Mayfield.
Motivation “Businesses spend up to $100 billion each year to train workers. Yet estimates are that less than 10% of this training transfers to the job.
Supporting Civil-Military Information Integration in Military Operations Other than War Paul Smart, Alistair Russell and Nigel Shadbolt
Semantic Network as Continuous System Technical University of Košice doc. Ing. Kristína Machová, PhD. Ing. Stanislav Dvorščák WIKT 2010.
1 Open Ontology Repository: Architecture and Interfaces Ken Baclawski Northeastern University 1.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
Benchmarking ontology-based annotation tools for the Semantic Web Diana Maynard University of Sheffield, UK.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
From Domain Ontologies to Modeling Ontologies to Executable Simulation Models Gregory A. Silver Osama M. Al-Haj Hassan John A. Miller University of Georgia.
Group A Next Generation Information Access Group.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Dr. Paula Matuszek (610)
Component Generation Technology for Semantic Tool Integration 1 Gabor Karsai and Jeff Gray Institute for Software Integrated Systems Vanderbilt University.
CREAM: Semantic annotation system May 24, 2013 Hee-gook Jun.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
1 MedAT: Medical Resources Annotation Tool Monika Žáková *, Olga Štěpánková *, Taťána Maříková * Department of Cybernetics, CTU Prague Institute of Biology.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz
Co-funded by the European Union Semantic CMS Community Reference Architecture for Semantic CMS Copyright IKS Consortium 1 Lecturer Organization Date of.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Characterizing Knowledge on the Semantic Web with Watson Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, Enrico Motta.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
ArrayExpress Ugis Sarkans EMBL - EBI
Encoding Extraction as Inferences
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Knowledge Based Workflow Building Architecture
Probabilistic Databases
Presentation transcript:

Developing Semantic Web Sites: Results and Lessons Learnt Enrico Motta, Yuangui Lei, Martin Dzbor, Vanessa Lopez, John Domingue, Jianhan Zhu, Liliana Cabral, Alex Goncalves, Victoria Uren

Motivation for KMi Sem Web Key Objective To generate a live, declarative representation of what happens in KMi, which can support smart queries and the specification of intelligent services producing smart inferences on the basis of this data Initial version was ready in 1998 PlanetOnto System (95-98)

Relates-event PeopleProjectOrganizationTechnology Event Story

Story Database NewsBoy NewsHound Modelling Language (OCML) Planet KB KA Tool Query Interface Planet Ontology Web Browser WebOnto Architecture of Planet-Onto

Story Database NewsBoy NewsHound Modelling Language (OCML) Planet KB KA Tool Query Interface Planet Ontology Web Browser WebOnto Architecture of Planet-Onto

Key Criteria for Sem Web Site Emphasis on Automatic KA Fully automated generation of information No knowledge capture bottleneck Manual annotation is welcome but should not be a core part of the process Manual annotation should not require sophisticated KR skills Ideally manual annotation should take place through side effects generating from normal work activities Architecture Keep the semantic layer separated (and to some extent independent) from the actual web site Interoperability Semantic Web Site ought to be open Semantic representation publicly available to any reasoning engine who wants to use the information

DBs Mapping Specs KMi Semantic Web Site Docs XML mark-up Mapping Engine Domain ontology Raw KB Data Verification Engine KB Source DataIntegration LayerVerification LayerTarget Data Information Extraction Engine (Espotter)

Ontological Structure KMi Semantic Web KMi Ontology AKT Portal Ontology AKT Support Ontology AKT Reference Ontology Publications Projects Research Areas People Organizations Technologies News Key Categories

Data verification Finding and eliminating duplicate data Recognizing ambiguous data, e.g. finding correct person instances for names like John, Victoria Using a lexicon component to record the mappings between strings and instance names found in the previous processes Using contextual information to decide

NumberPeopleOrganizationsProjectsResearch Area Total Manual data ESpotter finds ESpotter Recall-rate Initial Evaluation PeopleOrganizatio ns ProjectsResearch AreaTotal Total (discovered) Wrong Precision rate Recall Precision

KMi Semantic Web

So What? At a basic level, the architecture works Automatic generation is key Services still limited Developing interesting services requires non trivial effort Brittleness is a problem You rapidly reach the boundaries of the knowledge held in KMi resources and performance decreases Badly needs integration with other similar resources No API. Data available only as sources

What should happen next Integration with other similar activities Hence this workshop…. Ability to bring in knowledge expressed in other ontologies Need for standardised APIs/knowledge servers Develop mechanisms for semantic annotation by side-effect Improve text mining technology to improve both the quantity and the quality of the knowledge Develop more value-adding services

Services defined for a particular Class in a particular Ontology are available to any system who asks for them Intg. with Sem Web Services