Towards a new generation of semantic web applications Prof. Enrico Motta, PhD Knowledge Media Institute The Open University Milton Keynes, UK.

Slides:



Advertisements
Similar presentations
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Advertisements

Opportunistic Reasoning for the Semantic Web: Adapting Reasoning to the Environment Carlos Pedrinaci Tim Smithers and Amaia Bernaras.
Lifecycle Support for Networked Ontologies And related research in KMi Mathieu dAquin and Marta Sabou And also Enrico Motta, Martin Dzbor, Lucia Sepia,
WP8: User Centred Applications Enrico Motta, Marta Sabou, Vanessa Lopez, Laurian Gridinoc, Lucia Specia Knowledge Media Institute The Open University Milton.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
SIG2: Ontology Language Standards WebOnt Briefing Ian Horrocks University of Manchester, UK.
1 Publishing Linked Sensor Data Semantic Sensor Networks Workshop 2010 In conjunction with the 9th International Semantic Web Conference (ISWC 2010), 7-11.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Using Watson for Building Intelligent Applications in E-learning Mathieu d’Aquin The Knowledge Media Institute, The Open University
Using the Semantic Web Mathieu d’Aquin Knowledge Media Institute, the Open University
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Web3.0 and Language Resources Marta Sabou Knowledge Media Institute (KMi) The Open University Exploiting Semantic Web Ontologies: An Experimental Report.
Exploiting large scale web semantics to build end user applications Enrico Motta Professor of Knowledge Technologies Knowledge Media Institute The Open.
Exploiting the Semantic Web: Next Generation Semantic Web Applications in KMi Watson, PowerMagpie, PowerAqua, … Mathieu d’Aquin Laurian Gridinoc Vanessa.
Next Generation Semantic Web Applications Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK.
Sharing Knowledge in Adaptive Learning Systems Miloš Kravčík Dragan Gašević Fraunhofer FIT, GermanySimon Fraser University, Canada
Developing Semantic Web Sites: Results and Lessons Learnt Enrico Motta, Yuangui Lei, Martin Dzbor, Vanessa Lopez, John Domingue, Jianhan Zhu, Liliana Cabral,
Ontology-Based Applications in the Age of the Semantic Web Prof Enrico Motta, PhD Knowledge Media Institute The Open University Milton Keynes, UK.
Watson Supporting Next Generation Semantic Web Applications Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Marta Sabou, Sofia Angeletou, Enrico.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Research Problems in Semantic Web Search Varish Mulwad ____________________________ 1.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Exploiting Large-Scale Semantics on the Web Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK.
The Semantic Web Prof. Enrico Motta Knowledge Media Institute The Open University.
Characterizing Semantic Web Applications Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK.
Alternatives to Metadata IMT 589 February 25, 2006.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Knowledge representation
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
SemSearch: A Search Engine for the Semantic Web Yuangui Lei, Victoria Uren, Enrico Motta Knowledge Media Institute The Open University EKAW 2006 Presented.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
© Paul Buitelaar – November 2007, Busan, South-Korea Evaluating Ontology Search Towards Benchmarking in Ontology Search Paul Buitelaar, Thomas.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
Using TREC for cross-comparison between classic IR and ontology-based search models at a Web scale Miriam Fernández 1, Vanessa López 2, Marta Sabou 2,
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
Towards Distributed Information Retrieval in the Semantic Web: Query Reformulation Using the Framework Wednesday 14 th of June, 2006.
Metadata Schema for CERIF Andrei Lopatenko Vienna University of Technology
Aligner automatiquement des ontologies avec Tuesday 23 rd of January, 2007 Rapha ë l Troncy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
And the Watson Plugin for the NeOn Toolkit. IST NeOn-project.org The Semantic Web is growing… #SW Pages.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Characterizing Knowledge on the Semantic Web with Watson Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, Enrico Motta.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
The Semantic Web By: Maulik Parikh.
A Research Programme for the Semantic Web
Exploiting Large Scale Web Semantics
Ontology Evolution: A Methodological Overview
CSc4730/6730 Scientific Visualization
Language Technologies and the Semantic Web: An Essential Relationship.
Semantic Markup for Semantic Web Tools:
Presentation transcript:

Towards a new generation of semantic web applications Prof. Enrico Motta, PhD Knowledge Media Institute The Open University Milton Keynes, UK

The Semantic Web A large scale, heterogenous collection of formal, machine processable, web accessible, ontology-based statements (semantic metadata) about web resources and other entities in the world, expressed in a XML-based syntax

The Semantic Web (pragmatic def.) The collection of all statements expressed in one of the following formalisms: {OWL, RDF, DAML, DAML+OIL, RDF-A…}, which can be accessed on the web

Enrico Motta kmi director enrico motta Enrico Motta <akt:hasHomePage rdf:resource=" Person Organization String Organization-Unit partOf hasAffiliation worksInOrgUnit hasJobTitle Ontology Metadata

Ontology Metadata UoD

Proposition #1 The SW today has already reached a level of scale good enough to make it a very useful source of knowledge to support intelligent applications This is unprecedented in the history of AI

So, let's have a look at the semantic web as it is today….

Charting the web

Charting the web (2)

Proposition #2 The SW may well provide a solution to one of the classic AI challenges: how to construct and manage large volumes of knowledge to construct truly intelligent problem solvers and address the brittleness of traditional KBS

Knowledge Representation Hypothesis Any mechanically embodied intelligent process will be comprised of structural ingredients that we as external observers naturally take to represent a propositional account of the knowledge that the overall process exhibits, and independent of such external semantic attribution, play a formal but causal and essential role in engendering the behaviour that manifests that knowledge Brian Smith, 1982

Intelligence as a function of possessing domain knowledge Large Body of Knowledge Intelligent Behaviour KA Bottleneck

The Knowledge Acquisition Bottleneck Large Body of Knowledge Intelligent Behaviour KA Bottleneck Knowledge

SW as Enabler of Intelligent Behaviour Intelligent Behaviour

KBS vs SW Systems Classic KBSSW Systems Representation'Clean''Dirty' SizeSmall/MediumExtra Huge Repr. SchemaHomogeneousHeterogeneous QualityHighVery Variable Degree of trustHighVery Variable

Key Paradigm Shift Classic KBSSW Systems IntelligenceA function of sophisticated, task-centric problem solving A side-effect of size and heterogeneity (Collective Intelligence)

Overall Goal Our research programme is to contribute to the development of this large-scale web of data and develop a new generation of web applications able to exploit it to provide intelligent functionalities

So, how can we exploit this emerging, large scale semantic resource? Some examples….

–Label similarity methods e.g., Full_Professor = FullProfessor Ontology Matching 0.5 –Structure similarity methods Using taxonomic/property related information

New paradigm: use of background knowledge A B Background Knowledge (external source) A’ B’ R R

External Source = One Ontology Aleksovski et al. EKAW’06 Map (anchor) terms into concepts from a richly axiomatized domain ontology Derive a mapping based on the relation of the anchor terms Assumes that a suitable (rich, large) domain ontology (DO) is available.

External Source = Web van Hage et al. ISWC’05 rely on Google and an online dictionary in the food domain to extract semantic relations between candidate terms using IR techniques AB rel + OnlineDictionary IR Methods Precision increases significantly if domain specific sources are used: 50% - Web; 75% - domain texts. Does not rely on a rich Domain Ont,

Proposal: rely on online ontologies (Semantic Web) to derive mappings ontologies are dynamically discovered and combined AB rel Semantic Web Does not rely on any pre- selected knowledge sources. M. Sabou, M. d’Aquin, E. Motta, “Using the Semantic Web as Background Knowledge in Ontology Mapping", Ontology Mapping Workshop, ISWC’06. Best Paper Award External Source = SW

How to combine online ontologies to derive mappings?

Strategy 1 - Definition Find ontologies that contain equivalent classes for A and B and use their relationship in the ontologies to derive the mapping. AB rel Semantic Web A1’A1’ B1’B1’ A2’A2’ B2’B2’ An’An’ Bn’Bn’ O1O1 O2O2 OnOn For each ontology use these rules: … These rules can be extended to take into account indirect relations between A’ and B’, e.g., between parents of A’ and B’:

Strategy 1- Examples BeefFood Semantic Web Beef RedMeat Tap Food MeatOrPoultry SR-16FAO_Agrovoc ka2.rdf Researcher AcademicStaff Semantic Web Researcher AcademicStaff ISWC SWRC

Strategy 2 - Definition Principle: If no ontologies are found that contain the two terms then combine information from multiple ontologies to find a mapping. AB rel Semantic Web A’ BC C’ B’rel Details: (1) Select all ontologies containing A’ equiv. with A (2) For each ontology containing A’: (a) if find relation between C and B. (b) if find relation between C and B. Details: (1) Select all ontologies containing A’ equiv. with A (2) For each ontology containing A’: (a) if find relation between C and B. (b) if find relation between C and B.

Strategy 2 - Examples Vs. (midlevel-onto) (Tap) Ex1: Vs.Ex2: (r1) (pizza-to-go) (SUMO) (Same results for Duck, Goose, Turkey) (r1) Vs.Ex3: (pizza-to-go) (wine.owl) (r3)

Evaluation: 1600 mappings, two teams Average precision: 70% (comparable to best in class) (derived from 180 different ontologies) Matching AGROVOC (16k terms) and NALT(41k terms) Large Scale Evaluation M. Sabou, M. d’Aquin, W.R. van Hage, E. Motta, “Improving Ontology Matching by Dynamically Exploring Online Knowledge“.

Chart 2

Proposition #3 Using the SW to provide dynamically background knowledge to tackle the Agrovoc/NALT mapping problem provides the first ever test case in which the SW, viewed as a large scale heterogeneus resource, has been successfully used to address a real- world problem

Next Generation Semantic Web Applications NG SW Application Able to exploit the SW at large –Dynamically retrieving the relevant semantic resources –Combining several, heterogeneous Ontologies

Typically use a single ontology –Usually providing a homogeneous view over heterogeneous data sources. –Limited use of existing SW data Typically closed to semantic resources Contrast with 1st generation SW Applications 1st generation SW applications are far more similar to traditional KBS (closed semantic systems) than to 'real' SW applications (open semantic systems)

It is still early days..

Current Gateway to the Semantic Web

Limitations of Swoogle Very limited quality control mechanisms –Many ontologies are duplicated –No quality information provided Limited Query/Search mechanisms –Only keyword search; no distinction between types of elements –need for more powerful query methods (e.g., ability to pose formal queries; ability to distinguish between classes and instances, etc…) Limited range of ontology ranking mechanisms –Swoogle only uses a 'popularity-based' one No support for ontology modularization

A New Gateway to the Semantic Web

Ontology Structuring Relations extends inconsistent-with

Ontology Structuring Relations extends Inconsistent-with inconsistent-with

Formal Queries and relation discovery…

Current state of Watson Initial version implemented Demo version available online –See However still rather unstable….. –Stable version to be available within 4-6 weeks Initial crawl of the SW has already produced interesting results….

Some initial figures… Lots of ontologies are in OWL FULL (3x the number of OWL Lite) … but most of the ontologies use only a very restricted sub-part of the expressivity of OWL and DAML, e.g., –only 147 go beyond ALC –role transitivity is used in only 11 ontologies…….. Almost 20% of semantic resources appear to be duplicates

Next Generation Semantic Web Applications PowerMagpie PowerAqua

Folksonomies Tags are great to organize data!!! But they don’t help much when searching…

Finding tagged images Flower Rose Lilac Flower Tulip Flowers CutFlower Tulip

Flower Rose Lilac Flower Tulip Flowers CutFlower Tulip Finding tagged images – FLOWER

What if … Rose Tulip Flower Lilac …folksonomies were semantically richer

Flower Rose Lilac Flower Tulip Flowers CutFlower Tulip Finding tagged images – FLOWER (II) Rose Tulip Flower Lilac

Learning Relations Between Tags Tags {camera, digital, photograph} {damage, flooding, hurricane, katrina, Louisiana} Clusters digital cameraphotograph takenWith Ontology NLP/Clustering Find and combine Online ontologies L.Specia, E. Motta, "Integrating Folksonomies with the Semantic Web", ESWC 2007.

In More Detail…

Examples

Key Research Tasks Overall Infrastructure –crawling, storing, structuring, querying the SW Ontology Selection –In the context of dynamically identifying the sources of knowledge relevant to the needs of a system Ontology Mapping –When integrating information from different ontologies –When mapping query/specs to ontologies Ontology Modularization –Find the sub-modules relevant to a system's query. Semantic Markup Generation –From various types of sources

New task context Key point is that NG-SW applications require solutions in a new dynamic context (run-time rather than design-time) –Example: Ontology Mapping Much current work focuses on design-time mapping of complete ontologies –Example: Ontology Selection Current work focuses on user-mediated ontology selection –Example: Ontology Modularization Current work by and large assumes that the user is in the loop

References Ontology Selection –Sabou, M., Lopez, V., Motta, E. (2006). "Ontology Selection for the Real Semantic Web: How to Cover the Queen’s Birthday Dinner?". Proceedings of EKAW 2006 Ontology Modularization –D'Aquin, M., Sabou, M., Motta, E. (2006). "Modularization: A key for the dynamic selection of relevant knowledge components". ISWC 2006 Workshop on Ontology Modularization Watson –d’Aquin, M., Sabou, M., Dzbor, M., Baldassarre, C., Gridinoc, L., Angeletou, S. and Motta, E.: "WATSON: A Gateway for the Semantic Web". Poster Session at ESWC 2007

References (2) Ontology Mapping –Lopez, V., Sabou, M., Motta, E. (2006). "Mapping the real semantic web on the fly". ISWC 2006 –Sabou, M., D'Aquin, M., Motta, E. (2006). "Using the semantic web as background knowledge for ontology mapping". ISWC 2006 Workshop on Ontology Mapping. Intg. of folksonomies and SW –L.Specia, E. Motta, "Integrating Folksonomies with the Semantic Web", ESWC 2007.

'Vision' Papers Motta, E., Sabou, M. (2006). "Next Generation Semantic Web Applications". 1st Asian Semantic Web Conference, Beijing. Motta, E., Sabou, M. (2006). "Language Technologies and the Evolution of the Semantic Web". LREC 2006, Genoa, Italy. Motta, E. (2006). "Knowledge Publishing and Access on the Semantic Web: A Socio-Technological Analysis". IEEE Intelligent Systems, Vol.21, 3, (88-90).

Conclusions SW provides an unprecedented opportunity to build a new generation of intelligent systems, able to exploit large scale, heterogeneous KBs This new class of systems is fundamentally different in many respects both from traditional KBS and even from early SW applications The size of the SW is increasing steadily and the infrastructure is getting more and more robust. These developments should enable more and more new generation SW applications to emerge within 2-3 years

Current Gateway to the Semantic Web