Knowledge Enabled Information and Services Science SPARQL Query Re-writing for Spatial Datasets Using Partonomy Based Transformation Rules Prateek Jain,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Natural Language Interfaces to Ontologies Danica Damljanović
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Automatically Annotating and Integrating Spatial Datasets Chieng-Chien Chen, Snehal Thakkar, Crail Knoblock, Cyrus Shahabi Department of Computer Science.
Dave Kolas, BBN Technologies Terra Cognita 08 Karlsruhe, Germany 10/26/08 1 Supporting Spatial Semantics with SPARQL.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
W3C Video on the Web Workshop December 2007, San Jose, California Video on the Semantic Sensor Web Amit Sheth Amit Sheth with Cory Henson, Prateek.
Linked Sensor Data Harshal Patni, Cory Henson, Amit P. Sheth Ohio Center of Excellence in Knowledge enabled Computing (Kno.e.sis) Wright State University,
Geographical Service: Gianluca Correndo, Manuel Salvadores, Yang Yang, Nicholas Gibbins, Nigel Shadbolt A compass for the Web of Data.
By : Vanessa López, Enrico Motta Knowledge Media Institute. Open University Ontology-driven question answering in: AQUALog 9 th International Conference.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Citation Semantic Provenance: Trusted Biomedical Data Integration Spatial Semantics for Better Interoperability and Analysis: Challenges And Experiences.
Semantic Mediation & OWS 8 Glenn Guempel
1 Cui Tao PhD Dissertation Defense Ontology Generation, Information Harvesting and Semantic Annotation For Machine-Generated Web Pages.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
A Statistical and Schema Independent Approach to Identify Equivalent Properties on Linked Data † Kno.e.sis Center Wright State University Dayton OH, USA.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Copyright © 2010 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are trademarks of Accenture. Multiple Ontologies in.
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
Implemented Systems Presenter: Manos Karpathiotakis Extended Semantic Web Conference 2012.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Krishnaprasad Thirunarayan, Pramod Anantharam, Cory A. Henson, and Amit P. Sheth Kno.e.sis Center, Ohio Center of Excellence on Knowledge-enabled Computing,
Knowledge Enabled Information and Services Science Extending SPARQL to Support Spatially and Temporally Related Information Prateek Jain, Amit Sheth Peter.
Ontology Alignment for Linked Open Data – ISWC2010 research track Prateek Jain Pascal Hitzler Amit Sheth Kno.e.sis Center Wright State University, Dayton,
Extracting metadata for spatially- aware information retrieval on the internet Pual Clough Presented by Ali Khodaei CS 572.
Linking Open Data with Location: Gazetteers and the Semantic Web Xavier Lopez, Director, Product Management.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,
PERSONALIZED SEARCH Ram Nithin Baalay. Personalized Search? Search Engine: A Vital Need Next level of Intelligent Information Retrieval. Retrieval of.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
Developing “Geo” Ontology Layers for Web Query Faculty of Design & Technology Conference David George, Department of Computing.
SPARQL Query Graph Model (How to improve query evaluation?) Ralf Heese and Olaf Hartig Humboldt-Universität zu Berlin.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
U.S. Department of the Interior U.S. Geological Survey A Consideration of Geospatial Feature Formation in Linked Open Vocabularies Workshop on Linked Open.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Conclusions Presenter: Manolis Koubarakis Extended Semantic Web Conference 2012.
Toward a framework for statistical data integration Ba-Lam Do, Peb Ruswono Aryan, Tuan-Dat Trinh, Peter Wetz, Elmar Kiesling, A Min Tjoa Linked Data Lab,
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
Be.wi-ol.de User-friendly ontology design Nikolai Dahlem Universität Oldenburg.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Ontology Technology applied to Catalogues Paul Kopp.
1 Intelligent Information System Lab., Department of Computer and Information Science, Korea University Semantic Social Network Analysis Kyunglag Kwon.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Ontology Evaluation Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches.
Department of Geography Jeon-Young Kang · Yi Yang
Cross-Ontological Relationships
Query Rewriting Framework for Spatial Data
Associative Query Answering via Query Feature Similarity
Analyzing and Securing Social Networks
A Graph-Based Approach to Learn Semantic Descriptions of Data Sources
Presentation transcript:

Knowledge Enabled Information and Services Science SPARQL Query Re-writing for Spatial Datasets Using Partonomy Based Transformation Rules Prateek Jain, Cory Henson, Amit Sheth Peter Z. Yeh, Kunal Verma Kno.e.sis, Accenture Tech. Labs Wright State University, San Jose, CA Dayton, OH

Knowledge Enabled Information and Services Science Increased availability of spatial information

Knowledge Enabled Information and Services Science But accessing this information can be difficult

Knowledge Enabled Information and Services Science User expected to ask for this information in the “right” way

Knowledge Enabled Information and Services Science Proposed approach Automatically align conceptual mismatches between a user’s query and spatial information of interest through a set of semantic operators. Our approach will reduce the user’s burden of having to know how information of interest is structured, and hence improve accuracy and relevance of the results.

Knowledge Enabled Information and Services Science Outline Introduction Existing Mechanisms for querying RDF Data –Existing approaches –How well do they work? Proposed Approach Future Work

Knowledge Enabled Information and Services Science Why is it important? Spatial data becoming more significant day by day. Crucial for multitude of applications: – GPS – Military – Location Aware Services – weather data… Spatial Data availability on Web continuously increasing. –Sensor streams, satellite imagery –Naïve users contribute and correct spatial data too which can lead to discrepancies in data representation. E.g. Geonames, Wikimapia

Knowledge Enabled Information and Services Science What’s the problem Existing approaches only analyze spatial information and queries at the lexical and syntactic level. Mismatches are common between how a query is expressed and how information of interest is represented. –Question: “Find schools in NJ”. –Answer: Sorry, no answers found! –Reason: Only counties are in states. Natural language introduces much ambiguity for semantic relationships between entities in a query. –Find Schools in Greene County.

Knowledge Enabled Information and Services Science What needs to be done? We need to reduce users’ burden of having to know how information of interest is represented and structured in order to enable access to this information by a broad population. We need to resolve mismatches between a query and information of interest due to differences in granularity in order to improve recall of relevant information. We need to resolve ambiguous relationships between entities due to natural language in order to reduce the amount of wrong information retrieved.

Knowledge Enabled Information and Services Science Existing mechanism for querying RDF

Knowledge Enabled Information and Services Science Known approaches SPARQL Regular Expression Based Querying Approaches

Knowledge Enabled Information and Services Science Common query for testing all approaches! “Find schools located in the state of Ohio”

Knowledge Enabled Information and Services Science In a perfect scenario parent feature School Ohio

Knowledge Enabled Information and Services Science In a not so perfect scenario parent feature SchoolCounty parent feature Ohio

Knowledge Enabled Information and Services Science And finally.. parent feature County School parent feature Ohio Indiana parent Feature School Exchange students

Knowledge Enabled Information and Services Science Proposed Approach

Knowledge Enabled Information and Services Science Proposed Approach Define operators to ease writing of expressive queries by implicit usage of semantic relations between query terms and hence remove the burden of expressing named relations in a query. Define transformation rules for operators based on work by Winston’s taxonomy of part-whole relations. Rule based approach allows applicability in different domains with appropriate modifications. Partonomical Relationship Based Query Rewriting System (PARQ) implements this approach.

Knowledge Enabled Information and Services Science Architecture SELECT ?school WHERE { ?school geo:parentFeature Ohio. } SELECT ?school WHERE { ?state geo:name "Ohio“ ?state geo:parentFeatures ?county. ?county geo:parentFeature ?schools.} Transformation Rules Triple Constraints Query Variables Altered Triple Constraints Altered Query Variables User submits SPARQL Query Query Rewriting Engine Rewritten Query according to the data structure Mapping of ontology properties to Winston’s categories Meta rules for Winston’s Categories +=

Knowledge Enabled Information and Services Science Meta Rules for Winston’s Categories Transitivity –(a φ-part of b) (b φ-part of c) (a φ-part of c) –(Dayton place-part of Ohio) (Ohio place-part of US) (Dayton place-part of US) Overlap –(a place-part of b) (a place-part of b) (b overlaps c) –(Sri L. place-part of Indian Ocean) (Sri L. place-part of Bay of Bengal) (Indian Ocean overlaps with Bay of Bengal) Spatial Inclusion –(a instance of b) (c spatially included in a) (c spatially included in b) –(White House instance of Building) (Barack is in White House) (Barack is in building)

Knowledge Enabled Information and Services Science Perfect Scenario SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?schools geo:parentFeature state. } SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?state geo:parentFeature ?schools } Query Re-Writer

Knowledge Enabled Information and Services Science Slight and Severe Mismatch SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“. ?schools geo:parentFeature ?state. } SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?school geo:parentFeature ?county. ?county geo:parentFeature ?state. } Query Re-Writer

Knowledge Enabled Information and Services Science So where do we stand with all these mechanisms.. Ease of writing ExpressivityWorks in all scenarios Schema agnostic SPARQLX√XX PSPARQL√√X√ Our Approach√√√√

Knowledge Enabled Information and Services Science Evaluation

Knowledge Enabled Information and Services Science Evaluation Evaluation performed on publicly available datasets such as Geonames and British Ordnance Survey Ontology. Utilized 120 questions from National Geographic Bee and 46 questions from trivia related to British Administrative Geography. Questions serialized into SPARQL Queries by 4 human respondents un- familiar with the ontology. Performance of PARQ compared with PSPARQL and SPARQL.

Knowledge Enabled Information and Services Science Sample Queries “In which English county, also known as "The Jurassic Coast" because of the many fossils to be found there, will you find the village of Beer Hackett?” “The Gobi Desert is the main physical feature in the southern half of a country also known as the homeland of Genghis Khan. Name this country.”

Knowledge Enabled Information and Services Science PARQ Vs SPARQL System# of queries answered PrecisionRecall Respondent1PARQ82100%68.3% SPARQL25100%20.83% Respondent2PARQ93100%77.5% SPARQL26100%21.6% Respondent3PARQ61100%50.83% SPARQL19100%15.83% Respondent4PARQ103100%85.83% SPARQL33100%27.5%

Knowledge Enabled Information and Services Science PARQ Vs PSPARQL SystemPrecisionRecallExecution time/query in seconds PARQ100%86.7% PSPARQL6.414%86.7%37.59 Comparison for National Geographic Bee over Geonames SystemPrecisionRecallExecution time/query in seconds PARQ100%89.13%0.099 PSPARQL65.079%89.13%2.79 Comparison for British Admin. Trivia over Ordnance Survey Dataset

Knowledge Enabled Information and Services Science Conclusion Query engines expect user to know the structure of ontology and pose well formed queries. Query engines ignore semantic relations between query terms. Need to exploit semantic relations between concepts for processing queries. Need to provide systems to perform behind the scene rewrite of queries to remove burden of knowing structure of data from the user.

Knowledge Enabled Information and Services Science Future Work Investigating support for more SPARQL constructs such as FILTER, OPTIONAL pattern Testing our approach for its applicability across domains. Systematic comparison between resolving mismatches using query re-writing method viz-a-viz a reasoner. Development of additional transformation operators which cannot be defined in terms of Winston’s categorization such as involving “containment”.

Knowledge Enabled Information and Services Science References SPARQL Kno.e.sis STT Project: Prateek Jain, Peter Z. Yeh, Kunal Verma, Cory Henson and Amit Sheth, SPARQL Query Re-writing for Spatial Datasets Using Partonomy Based Transformation Rules, Third International Conference on Geospatial Semantics (GeoS 2009), Mexico City, Mexico, December 3-4, Alkhateeb, F., Baget, J.-F., Euzenat, J.: Extending SPARQL with regular expression patterns (for querying RDF). Web Semantics 7, 57–73 (2009)

Knowledge Enabled Information and Services Science Thank You!

Knowledge Enabled Information and Services Science Geonames Dataset Description at (100 Million) RDF triples present in the dataset. Most interesting properties “parentFeature” (Administrave Region which contains the entity) and “nearbyFeature”(Entities close to this region).