Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.

Slides:



Advertisements
Similar presentations
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Advertisements

Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
MICRODATA IN HTML 5.0 Technologies for Web Application Development Martin Nečaský Department of Software Engineering, Faculty of Mathematics and Physics,
Hermes: News Personalization Using Semantic Web Technologies
Making the Web searchable, or the Future of Web Search Peter Mika Yahoo! Research Barcelona.
Using Watson for Building Intelligent Applications in E-learning Mathieu d’Aquin The Knowledge Media Institute, The Open University
Using the Semantic Web Mathieu d’Aquin Knowledge Media Institute, the Open University
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Human Language Technologies. Issue Corporate data stores contain mostly natural language materials. Knowledge Management systems utilize rich semantic.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 The Architecture of a Large-Scale Web Search and Query Engine.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Exploiting the Semantic Web: Next Generation Semantic Web Applications in KMi Watson, PowerMagpie, PowerAqua, … Mathieu d’Aquin Laurian Gridinoc Vanessa.
March 17, 2008SAC WT Hermes: a Semantic Web-Based News Decision Support System* Flavius Frasincar Erasmus University Rotterdam.
Watson Supporting Next Generation Semantic Web Applications Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Marta Sabou, Sofia Angeletou, Enrico.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
The Web of Linked Data Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Research Problems in Semantic Web Search Varish Mulwad ____________________________ 1.
Behshid Behkamal Ferdowsi University of Mashhad Web Technology Lab.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad An Introduction to Linked Data, Its Applications and Challanges.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Introduction to the Semantic Web. Questions What is the Semantic Web? Why do we want it? How will we do it? Who will do it? When will it be done?
CSM06 Information Retrieval Lecture 1a – Introduction Dr Andrew Salway
Keyword Query Routing.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
1 Linked Open Innsbruck STI Innsbruck.
Microsearch and SearchMonkey Interfaces for Semantic Search Peter Mika Researcher, Data Architect Yahoo! Research.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Semantic Enhancement: Key to Massive and Heterogeneous Data Pools Violeta Damjanovic, Thomas Kurz, Rupert Westenthaler, Wernher Behrendt, Andreas Gruber,
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
 Structured Data An Introduction to Semantic Web “It is very hard for search engines to understand the structure and semantics of data embedded in an.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Conceptual structures in modern information retrieval Claudio Carpineto Fondazione Ugo Bordoni
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
Discovering libraries’ gold through collection-level descriptions ELAG 2014, Bath Valentine Charles Data specialist.
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
And the Watson Plugin for the NeOn Toolkit. IST NeOn-project.org The Semantic Web is growing… #SW Pages.
Introduction to the Semantic Web Jeff Heflin Lehigh University.
Yahoo! BOSS Open up Yahoo!’s Search data via web services Developer & Custom Tracks Big Goal – If you’re in a vertical and you perform a search, you should.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
RDFa How and Why Ralph R. Swick World Wide Web Consortium
Cloud based linked data platform for Structural Engineering Experiment
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Federated & Meta Search
Web IR: Recent Trends; Future of Web Search
The State of the Semantic Web
DBpedia 2014 Liang Zheng 9.22.
LOD reference architecture
Introduction to Information Retrieval
Information Retrieval and Web Design
Information Retrieval and Web Design
Classifications and Linked Open Data Formalizing the structure and content of statistical classifications Item 9.1 Standards Working Group Luxembourg,
Presentation transcript:

Semantic Search: different meanings

Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the syntactic level of matching keywords – Hakia, PowerSet, SearchMonkey Definition 2: Semantic search as the problem of searching large semantic web datasets – Watson, PowerAqua, Swoogle, Sindice, SWSE

Facing keyword-based search problems Relations between search terms: – “books about recommender systems” vs. “systems that recommend books” Polisemy – “mouth” as part of the body vs. “mouth” as part of a stream Synonymy – “movies” vs. “films” Documents about individuals where query keywords do not appear: – “English banks”, individual “Abbey”

Several attempts from the IR community Early 80s: elaboration of conceptual frameworks and their introduction in IR models – Taxonomies (categories + hierarchical relations), e.g., The ODP (Open Directory Project) – Thesaurus (categories + fixed hierarchical & associative relations), e.g., WordNet (used by linguistic approaches) – Algebraic methods such as LSA Limitations: The level of conceptualization is often shallow (specially at the level of relations)

The emergence of the SW Late 90s: introduction of ontologies as conceptual framework (classes + instances (KBs) + arbitrary semantic relations + rules) – Semantic search: Exploiting ontologies as a richer conceptualizations & formal languages to enhance traditional keyword-based document retrieval – Semantic search: Need to search this emergent and continuously growing structured information space (the Web of Data) DPLP, Geonames, DBPedia, BBC Music,... ( penData/DataSets)

The Web of Data  2007  2008  2009 Extracted from: Linked Data Tutorial (Florianópolis)

LOD cloud May 2007 Figure from [4] Facts: Focal points: DBPedia: RDFized vesion of Wikipiedia; many ingoing and outgoing links Music-related datasets Big datasets include FOAF, US Census data Size approx. 1 billion triples, 250k links Extracted from: Linked Data Tutorial (Florianópolis)

LOD cloud September 2008 Facts: More than 35 datasets interlinked Commercial players joined the cloud, e.g., BBC Companies began to publish and host dataset, e.g. OpenLink, Talis, or Garlik. Size approx. 2 billion triples, 3 million links Extracted from: Linked Data Tutorial (Florianópolis)

LOD cloud March 2009 Facts: Big part from Linking Open Drug cloud and the BIO2RDF project Notable new datasets: Freebase, OpenCalais, ACM/IEEE Size > 10 billion triples Extracted from: Linked Data Tutorial (Florianópolis)

The LOD clouds Extracted from: Linked Data Tutorial (Florianópolis)

Commercial interest by publishers

Commercial interest by search engines 2007 Yahoo! Presents Search Monkey

Commercial interest by search engines July-2008 Microsoft buys Powerset

Commercial interest by search engines April 2010 Facebook announced the use of the Open Graph protocol

Commercial interest by search engines May-2009 Google announces Rich Snippets and it’s official use of RDFa and Microformats

Commercial interest by search engines July-2010 Google buys Metaweb (the company behind FreeBase)

Commercial interest by search engines November-2010 Google announced the support of the GoodRelations vocabulary for Google Rich Snippets.

Challenges Exploiting this new information space for semantic search purposes opens new research challenges: – Scalability – Heterogeneity – Uncertainty

Scalability Effective exploitation of the linked data requires infrastructure that scales to a large and ever growing collection of interlinked data!

Heterogeneity Dbpedia:Rudi_Studer Dblp:Studer:Rudi.html SW:/en/rudi_studer Dblp:~ley/db/../author SW:Person Dbpedia:Professor SCHEMA-LEVEL DATA-LEVEL Align Reconcile, Combine Effective exploitation of the data web requires an effective mechanism for finding the relevant data sources integrating data sources combining elements from different data sources

Uncertainty Incomplete Representation of User’s Needs and content meanings –User cannot completely specify the need –The semantic information in the search space is incomplete Effective exploitation requires match user’s needs to data in an imprecise way rank the results be flexible enough to adjust to changes in constraints! “Find action films directed by some Hong Kong film director and starring Chinese martial actors”

The Search Space: different representations

The search space: different representations Unstructured search space – The Web of documents (textual and multimedia content) Structured search space – The Web of data (ontologies + Knowledge Bases) Hybrid search space – Unstructured content is enriched with metadata Embedded annotations Not embedded annotations

The unstructured search space The Web of human-understandable content. The Web of documents and links – CC License Documents Search space

Search engines

The structured search space The Web of machine understandable content. The Web of objects and relations – Creative Commons License objects Search space

Search engines

The hybrid search space Enriching documents with metadata Objects Documents How to interlink documents and data? Search space

Two ways of interlinking metadata and documents Information Extraction By relying on Web publishers – More on the section Data on the (Semantic) Web

Search engines