SemSearch: A Search Engine for the Semantic Web Yuangui Lei, Victoria Uren, Enrico Motta Knowledge Media Institute The Open University EKAW 2006 Presented.

Slides:

Advertisements

Similar presentations

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.

Advertisements

Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.

TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.

Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,

Crawling, Ranking and Indexing. Organizing the Web The Web is big. Really big. –Over 3 billion pages, just in the indexable Web The Web is dynamic Problems:

Key-word Driven Automation Framework Shiva Kumar Soumya Dalvi May 25, 2007.

OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.

Information Retrieval in Practice

Search Engines and Information Retrieval

Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,

Chapter 2 Data Models Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.

Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.

Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.

Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.

1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.

Watson Supporting Next Generation Semantic Web Applications Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Marta Sabou, Sofia Angeletou, Enrico.

Information Retrieval in Practice

By : Vanessa López, Enrico Motta Knowledge Media Institute. Open University Ontology-driven question answering in: AQUALog 9 th International Conference.

Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.

ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.

Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.

Overview of Search Engines

Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.

CS 586 – Distributed Multimedia Information Management Prof. Dennis McLeod.

Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.

Search Engines and Information Retrieval Chapter 1.

 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.

Survey of Semantic Annotation Platforms

Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.

1 SAMT’08 Semantic-driven multimedia retrieval with the MPEG Query Format Ruben Tous and Jaime Delgado Distributed Multimedia Applications Group (DMAG)

Database Support for Semantic Web Masoud Taghinezhad Omran Sharif University of Technology Computer Engineering Department Fall.

PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.

Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.

Querying Structured Text in an XML Database By Xuemei Luo.

©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)

SPARQL Query Graph Model (How to improve query evaluation?) Ralf Heese and Olaf Hartig Humboldt-Universität zu Berlin.

Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.

GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.

Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,

Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.

Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.

Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.

BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.

Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.

VLDB2005 CMS-ToPSS: Efficient Dissemination of RSS Documents Milenko Petrovic Haifeng Liu Hans-Arno Jacobsen University of Toronto.

ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.

1 Information Retrieval LECTURE 1 : Introduction.

Natural Language Interfaces to Ontologies Danica Damljanović

Triple Storage. Copyright  2006 by CEBT Triple(RDF) Storages  A triple store is designed to store and retrieve identities that are constructed from.

Co-funded by the European Union Semantic CMS Community Reference Architecture for Semantic CMS Copyright IKS Consortium 1 Lecturer Organization Date of.

A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.

Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.

The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.

Sesame A generic architecture for storing and querying RDF and RDFs Written by Jeen Broekstra, Arjohn Kampman Summarized by Gihyun Gong.

Ontology Technology applied to Catalogues Paul Kopp.

GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011

PAIR project progress report Yi-Ting Chou Shui-Lung Chuang Xuanhui Wang.

A Visual Web Query System for NeuronBank Ontology Weiling Li, Rajshekhar Sunderraman, and Paul Katz Georgia State University, Atlanta, GA.

NEDA ALIPANAH, MARIA ADELA GRANDO DBMI 11/19/2012.

INHA UNIVERSITY, KOREA Rainer Simon Austrian Institute of Technology.

Information Retrieval in Practice

Web Service Modeling Ontology (WSMO)

Semantic Database Builder

Exploring Scholarly Data with Rexplore

Magnet & /facet Zheng Liang

Semantic Markup for Semantic Web Tools:

CS246: Information Retrieval

Information Retrieval and Web Design

Information Retrieval and Web Design

Presentation transcript:

SemSearch: A Search Engine for the Semantic Web Yuangui Lei, Victoria Uren, Enrico Motta Knowledge Media Institute The Open University EKAW 2006 Presented by Jungyeon, Yang

Copyright  2008 by CEBT Outline  Research background  SemSearch overview  Query interface  Search process  Implementation & examples  Conclusions

Copyright  2008 by CEBT Research background  Semantic search: extending traditional search with the semantic web technology Exploiting the explicit meaning of documents (i.e., ontology-based metadata)  Current semantic search tools Form-based, e.g., SHOE, Magnet QA-based, e.g., AquaLog, ORAKEL Keyword-based, e.g., TAP, Squiggle, DOSE

Copyright  2008 by CEBT Support for ordinary end users  Form-based tools Forms are intuitive Issues: knowledge overhead; scalability  QA-based tools Easy to use Issue: heavy NLP.  Keyword-based tools Easy to post queries; quick response Issue: typically one keyword only; general knowledge of the problem domain required

Copyright  2008 by CEBT The goal of our search engine  Hide the complexity of semantic search from end users: Low barrier to access: easy to post queries – Avoiding the form-based routine Dealing with relatively complex queries – Supporting multiple keywords Precise and self-explanatory results: – Results satisfy user queries – Results are easy to understand Quick response – Avoiding linguistic processing

Copyright  2008 by CEBT SemSearch Architecture Google-like User Interface Layer Semantic Query Layer Formal Query Language Layer (SPARQL, SERQL, etc.) Semantic Data Layer End users  Semantic entity indexing engine  Semantic entity search engine  Formal query construction engine  Query engine  Ranking engine  Google-like query interface Text Search Layer

Copyright  2008 by CEBT The Google-like query interface  Extending the traditional keyword search languages by allowing the specification of: The queried subject (the type of expected search results) The combination of keywords  Three operations are used: Operator “:” captures the query subject “and”/”or” specifies the combination of keywords  Query formats: One keyword: finding entities that have relations with the keyword match Multiple keywords: “subject:keyword1 and/or keyword2 and/or keyword3”, e.g., “ ”,  Advantages: More flexible than form-based query interface More powerful than state-of-art keyword-based semantic search interfaces

Copyright  2008 by CEBT The search process  Step1: making sense of the user queries  Step2: translating user queries into formal queries  Step3: Querying the back-end semantic data repository  Step4: Ranking the querying results

Copyright  2008 by CEBT Making sense of user queries  Finding out the semantic meaning of keywords Class, (e.g., the keyword “phd students”) Relation, (e.g., “author”) Instance, (e.g., “Enrico”, ”KMi director”)  Method: text search labels (rdfs:label) Short literals also used in the case of instances matching – When searching for “KMi director”, the instances can be picked up.  Two components in the search engine The semantic entity index engine The semantic entity search engine

Copyright  2008 by CEBT Translating user queries into formal queries  The search engine takes as input the semantic matches of user search terms  The search engine takes outputs an appropriate formal query according to the semantic meanings of keywords  One user query  Each keyword  multiple matches  SEARCH ENGINE  multiple formal queries.

Copyright  2008 by CEBT Simple user queries  There are only two keywords involved:  Fixed number of combination types Subject matchKeyword matchExample Class Property Instance InstanceProperty Instance PropertyInstance Property The SeRQL query templates are defined

Copyright  2008 by CEBT select {Is}, {R}, {Ik} from {Is} rdf:type {Cs}, {Ik} rdf:type {Ck}, {Is} R {Ik} union select {Is}, {R}, {Ik} from {Is} rdf:type {Cs}, {Ik} rdf:type {Ck}, {Ik} R {Is} A template example  Pattern: Subject -> Class Cs; Keyword -> Class Ck  Results: associated with exploratory links.  Example: news stories about phd students  A simplified template in Sesame SeRQL:

Copyright  2008 by CEBT Complex user queries   Instances of the subject which either have relations with all the keywords or have relations with some of the keywords.  Operational problem the number of combination gets big when there are many keywords involved and there are lots of matches for each keyword.  Rules for combination reduction: Only considering the subject keyword as class entities Choosing the closest matches to the keyword as possible Choosing the most specific class match among the class matches.

Copyright  2008 by CEBT Query construction  In SeRQL Three building blocks – Head block: what needs to be retrieved, i.e., – Body block: how to retrieve the triples – Condition block: conditions need to be satisfied Union block : in order to cover bidirectional relations SELECT DISTINCT label(ArtefactTitle), MuseumName FROM {Artefact} arts:created_by {} arts:first_name {"Rembrandt"}, {Artefact} arts:exhibited {} dc:title {MuseumName}, {Artefact} dc:title {ArtefactTitle} WHERE isLiteral(ArtefactTitle) AND lang(ArtefactTitle) = "en" AND label(ArtefactTitle) LIKE "*night*"

Copyright  2008 by CEBT Query construction algorithm No Adding query blocks for class-property relations retrieval Yes Adding query blocks for class-class relations retrieval Yes Adding blocks for class-instance relations retrieval Has keyword match? Yes Initializing the query blocks Composing queries using the blocks No Is class? Is property? Is instance? Yes No

Copyright  2008 by CEBT Simple query example

Copyright  2008 by CEBT Refinement support

Copyright  2008 by CEBT Complex query example

Copyright  2008 by CEBT Conclusions  A keyword-based semantic search engine has been developed Google-like query interface Supporting relatively complex queries Providing relatively quick response

Copyright  2008 by CEBT Opinions  Pros Google-like query interface (intuitive) Supporting relatively complex queries  Cons Limitation of the target data form. (RDF) Ranking Simple semantic matching  Issues Finding out the semantic meaning of keyword Storage modeling Strategy of the semantic match between keyword and semantic entity