Research Problems in Semantic Web Search Varish Mulwad ____________________________ 1.

Slides:



Advertisements
Similar presentations
1 Search and Navigate Web Ontologies Li Ding Tetherless World Constellation Rensselaer Polytechnic Institute Aug 22, 2008.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute 1 From OntoSelect to OntoSelect-SWSE.
DAML Ontology Library Mike Dean OntoLog Forum 28 February
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Using Watson for Building Intelligent Applications in E-learning Mathieu d’Aquin The Knowledge Media Institute, The Open University
Using the Semantic Web Mathieu d’Aquin Knowledge Media Institute, the Open University
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Ontology Notes are from:
Roi Adadi David Ben-David.  Semantic Web Document (SWD) ◦ A web page that serializes an RDF graph. ◦ Uses one of the recommended RDF syntax languages,
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Watson Supporting Next Generation Semantic Web Applications Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Marta Sabou, Sofia Angeletou, Enrico.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Swoogle Swoogle Semantic Search Engine Web-enhanced Information Management Bin Wang.
Overview of Search Engines
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Databases & Data Warehouses Chapter 3 Database Processing.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
The Semantic Web Web Science Systems Development Spring 2015.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
A Hybrid Approach for Searching in the Semantic Web Guide: Dr. S. N. Sivanandam Dept of Computer science & Engg P.Raja 07MW06 Final Yr ME-Software Engg.
© Paul Buitelaar – November 2007, Busan, South-Korea Evaluating Ontology Search Towards Benchmarking in Ontology Search Paul Buitelaar, Thomas.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
@ Presented by eBiquity group, UMBC CIKM’04, Nov 12, 2004 SwoogleSwoogle SwoogleSwoogle search and metadata for the semantic web Partial research support.
Semantic Web - an introduction By Daniel Wu (danielwujr)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
UMBC an Honors University in Maryland 1 Information Integration and the Semantic Web Finding knowledge, data and answers Tim Finin University of Maryland,
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Metadata Schema for CERIF Andrei Lopatenko Vienna University of Technology
Metadata Registries Registry: authoritative, centrally controlled store of information – W3C Web Services Glossary, 2004
Organization of the Lab Three meetings:  today: general introduction, first steps in Protégé OWL  November 19: second part of tutorial  December 3:
Using linked data to interpret tables Varish Mulwad September 14,
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 A Sitemap extension to enable efficient interaction with large.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
UMBC an Honors University in Maryland 1 Finding and Ranking Knowledge on the Semantic Web Li Ding, Rong Pan, Tim Finin, Anupam Joshi, Yun Peng and Pranam.
Characterizing Knowledge on the Semantic Web with Watson Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, Enrico Motta.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
@ eBiquity Lab, CSEE, UMBC Swoogle Tutorial (Part I: Swoogle R & D) A brief introduction to Swoogle An overview of Swoogle research A summary of Swoogle.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
UMBC an Honors University in Maryland 1 Searching for Knowledge and Data on the Semantic Web Tim Finin University of Maryland, Baltimore County
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
2014 Semantic-based Code and Documentation Search Engine Reshma Thumma Oct 10,2014 #GHC
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Swoogle: A Semantic Web Search and Metadata Engine Li Ding, Tim Finin, Anupam Joshi, Rong Pan, R. Scott Cost, Yun Peng Pavan Reddivari, Vishal Doshi, Joel.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Cloud based linked data platform for Structural Engineering Experiment
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
SWD = SWO + SWI SWD Rank SWD IR Engine
Presented by ebiqity UMBC Nov, 2004
Visit Swoogle web site at
Property consolidation for entity browsing
OntoRank for RDF documents
Presentation transcript:

Research Problems in Semantic Web Search Varish Mulwad ____________________________ 1

Agenda Introduction Swoogle Swoogle’s Competition – Sindice Semantic Web Search Engine (SWSE) Watson Falcon Research Problems and Issues with Swoogle References ____________________________ 2

Introduction ____________________________ 3 Web Dr.Finin’s FOAF Profile Your Agent Possible because: Data is in machine understandable form like – RDF, OWL But how will agent find all this data ? Search Engines ?

Introduction 4 ____________________________ Traditional Search Engine ResultsSemantic Web Search Engine Results

Swoogle Swoogle is a crawler based indexing and retrieval system for Semantic Web Swoogle crawls and discovers documents written in RDF,OWL Swoogle classifies a Semantic Web Document(SWD) as – Semantic Web Ontology (SWO) – Defines new terms Semantic Web Databases (SWDB) – Makes assertions about individuals ____________________________ 5

Swoogle SWOOGLE DEMO ____________________________ 6

Swoogle Architecture ____________________________ 7

Swoogle Architecture SWD Discovery Component Google crawler using the Google web service Filetypes with extensions “.rdf”, ”.owl”, “.n3” Google limits only 1000 results per query A focussed crawler Crawls documents within a given website Extension and Focus constraints A Swoogle crawler Jena based crawler Explores Semantic Links between SWDs ____________________________ 8

Swoogle Architecture Metadata Creation Basic Metadata Encoding – “RDF/XML”, “N-Triple”, “N3” Language – RDF, RDFS, OWL, DAML + OIL OWL Species – OWL-LITE, OWL-DL, OWL-FULL Relations among SWDs Reference relationship among SWDs Inter ontology relationships ____________________________ 9

Swoogle Architecture Data analysis component Classification of SWD as SWO or SWDB Compute rank of SWD Web based interface Human User Interface – Web Services using REST interface Agent Service ____________________________ 10

Sindice Created at Digital Enterprise Research Institute (DERI) Key features of Sindice include – Sindice collects SWDs and indexes them on resource URIs, Inverse Functional Properties(IFPs) and keywords Sindice uses the Hadoop parallel architecture ____________________________ 11

Sindice Inverse Functional Property (IFP) – An OWL cardinality restriction Sincdice uses three indexes – URI index IFP index Keyword index Benefits - Faster retrieval of data ____________________________ 12

Sindice Hadoop architecture is used in the following manner – Sindice employs Hadoop/Nutch to distribute crawling job across multiple machines Collected data is stored in the Hbase distributed column – based store Efficient handling of large datasets across the cluster using a MapReduce implementation ____________________________ 13

Sindice SINDICE DEMO ____________________________ 14

SWSE Semantic Web Search Engine (SWSE) is also a Semantic Web Search Engine created at Digital Enterprise Research Institute (DERI) SWSE uses a “Multicrawler” – a pipelined architecture for crawling ____________________________ 15

Watson Created at Knowledge Management Institute at the UK Open University Major Design Principles – Considers explicit and implicit relations between Ontologies Ranking of Ontologies with focus on quality over popularity ____________________________ 16

Watson WATSON DEMO ____________________________ 17

Falcon Falcon is a Semantic Web Search engine created at the Institute of Web Science in China Falcon allows keyword based queries on : Objects Concepts Documents Falcon performs class subsumption reasoning ____________________________ 18

Falcon FALCON DEMO ____________________________ 19

Summary Swoogle Keyword based search Searches Ontologies and Instance Data Others Sindice Indexes on URI, IFP, keywords Use of Hadoop Architecture SWSE Pipelined Architecture for Crawling Watson Implicit relations between SWDs Falcon Class Subsumption Reasoning 20 ____________________________

Issues Crawling Swoogle’s crawler is running as a single thread on one machine Limits the number of SWDs dicovered and revisted Possible Solutions Use of Hadoop Architecture Use of Grub ____________________________ 21

Other Issues Crawling large structured Datasets like DBPedia More reasoning More services ____________________________ 22

References Li Ding et al., "Swoogle: A Search and Metadata Engine for the Semantic Web", Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management, November P. Mika, G. Tummarello “Web Semantics in the Clouds”, IEEE Intelligent Systems, Volume 23, Issue 5 (September 2008) E. Oren, R.Delbru, M. Catasta, R. Cyganiak, H. Stenzhorn, G. Tummarello “Sindice.com: A document-oriented lookup index for open linked data.” In International Journal of Metadata, Semantics and Ontologies, 3(1), Mathieu d’Aquin et al., “Watson: A Gateway for the Semantic Web”,Poster session of the European Semantic Web Conference, ESWC 2007 Gong Cheng, Weiyi Ge, Honghan Wu, Yuzhong Qu, “Searching Semantic Web Objects Based on Class Hierarchies” In WWW 2008 Workshop on Linked Data on the Web, 2008 ____________________________ 23

Questions? ____________________________ 24