Knowledge Enabled Information and Services Science Extending SPARQL to Support Spatially and Temporally Related Information Prateek Jain, Amit Sheth Peter.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Query Languages – Steffen Staab
Knowledge Graph: Connecting Big Data Semantics
Dave Kolas, BBN Technologies Terra Cognita 08 Karlsruhe, Germany 10/26/08 1 Supporting Spatial Semantics with SPARQL.
W3C Video on the Web Workshop December 2007, San Jose, California Video on the Semantic Sensor Web Amit Sheth Amit Sheth with Cory Henson, Prateek.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Linked Sensor Data Harshal Patni, Cory Henson, Amit P. Sheth Ohio Center of Excellence in Knowledge enabled Computing (Kno.e.sis) Wright State University,
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
1 Extracting RDF Data from Unstructured Sources Based on an RDF Target Schema Tim Chartrand Research Supported By NSF.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Presented by Zeehasham Rasheed
Citation Semantic Provenance: Trusted Biomedical Data Integration Spatial Semantics for Better Interoperability and Analysis: Challenges And Experiences.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22,
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
A Statistical and Schema Independent Approach to Identify Equivalent Properties on Linked Data † Kno.e.sis Center Wright State University Dayton OH, USA.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Copyright © 2010 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are trademarks of Accenture. Multiple Ontologies in.
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
Semantic modeling of System Requirements Lunch Lieke Verhelst MSc Student for GIMA Feb 27 th 2009.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Querying Tree-Structured Data Using Dimension Graphs Dimitri Theodoratos (New Jersey Institute of Technology, USA) Theodore Dalamagas (National Techn.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Krishnaprasad Thirunarayan, Pramod Anantharam, Cory A. Henson, and Amit P. Sheth Kno.e.sis Center, Ohio Center of Excellence on Knowledge-enabled Computing,
Grant Number: IIS Institution of PI: Arizona State University PIs: Zoé Lacroix Title: Collaborative Research: Semantic Map of Biological Data.
SPARQL Semantic Web - Spring 2008 Computer Engineering Department Sharif University of Technology.
1 SAMT’08 Semantic-driven multimedia retrieval with the MPEG Query Format Ruben Tous and Jaime Delgado Distributed Multimedia Applications Group (DMAG)
The Semantic Web Web Science Systems Development Spring 2015.
Knowledge Enabled Information and Services Science SPARQL Query Re-writing for Spatial Datasets Using Partonomy Based Transformation Rules Prateek Jain,
Ontology Alignment for Linked Open Data – ISWC2010 research track Prateek Jain Pascal Hitzler Amit Sheth Kno.e.sis Center Wright State University, Dayton,
SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Pavan Reddiavri (Ebiquity Labs) “R ♫ P” RDF Access control Policies.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
Developing “Geo” Ontology Layers for Web Query Faculty of Design & Technology Conference David George, Department of Computing.
MyActivity: A Cloud-Hosted Ontology-Based Framework for Human Activity Querying Amin BakhshandehAbkear Supervisor:
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Graph Summaries for Subgraph Frequency Estimation 1 Angela Maduko, 2 Kemafor Anyanwu, 3 Amit Sheth, 4 Paul Schliekelman 1 LSDIS Lab, University of Georgia.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
Ontology Quality by Detection of Conflicts in Metadata Budak I. Arpinar Karthikeyan Giriloganathan Boanerges Aleman-Meza LSDIS lab Computer Science University.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
Toward a framework for statistical data integration Ba-Lam Do, Peb Ruswono Aryan, Tuan-Dat Trinh, Peter Wetz, Elmar Kiesling, A Min Tjoa Linked Data Lab,
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 An infrastructure for context-awareness based on first order logic 송지수 ISI LAB.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Ontology Technology applied to Catalogues Paul Kopp.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
1 Intelligent Information System Lab., Department of Computer and Information Science, Korea University Semantic Social Network Analysis Kyunglag Kwon.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Semantic Sensor Web Amit Sheth LexisNexis Ohio Eminent Scholar
Query Rewriting Framework for Spatial Data
Associative Query Answering via Query Feature Similarity
Introduction Dataset search
Presentation transcript:

Knowledge Enabled Information and Services Science Extending SPARQL to Support Spatially and Temporally Related Information Prateek Jain, Amit Sheth Peter Z. Yeh, Kunal Verma Kno.e.sis, Accenture Technology Labs Wright State University, San Jose, CA Dayton, OH

Knowledge Enabled Information and Services Science Increased availability of spatial information

Knowledge Enabled Information and Services Science But accessing this information can be difficult

Knowledge Enabled Information and Services Science User expected to ask for this information in the “right” way

Knowledge Enabled Information and Services Science Proposed approach Automatically align conceptual mismatches between a user’s query and spatial information of interest through a set of semantic operators. Our approach will reduce the user’s burden of having to know how information of interest is structured, and hence improve accuracy and relevance of the results.

Knowledge Enabled Information and Services Science Outline Introduction Existing Mechanisms for querying RDF Data –Existing approaches –How well do they work? Proposed Approach Future Work

Knowledge Enabled Information and Services Science Why is it important? Spatial data becoming more significant day by day. Crucial for multitude of applications: – GPS – Military – Location Aware Services – weather data… Spatial Data availability on Web continuously increasing. –Sensor streams, satellite imagery –Naïve users contribute and correct spatial data too which can lead to discrepancies in data representation. E.g. Geonames, Wikimapia

Knowledge Enabled Information and Services Science What’s the problem Existing approaches only analyze spatial information and queries at the lexical and syntactic level. Mismatches are common between how a query is expressed and how information of interest is represented. –Question: “Find schools in NJ”. –Answer: Sorry, no answers found! –Reason: Only counties are in states. Natural language introduces much ambiguity for semantic relationships between entities in a query. –Find Schools in Greene County.

Knowledge Enabled Information and Services Science What needs to be done? We need to reduce users’ burden of having to know how information of interest is represented and structured in order to enable access to this information by a broad population. We need to resolve mismatches between a query and information of interest due to differences in granularity in order to improve recall of relevant information. We need to resolve ambiguous relationships between entities due to natural language in order to reduce the amount of wrong information retrieved.

Knowledge Enabled Information and Services Science Existing mechanism for querying RDF

Knowledge Enabled Information and Services Science Known approaches SPARQL Path Expressions

Knowledge Enabled Information and Services Science Common query for testing all approaches! “Find schools located in the state of Ohio”

Knowledge Enabled Information and Services Science In a perfect scenario contains feature School Ohio

Knowledge Enabled Information and Services Science In a not so perfect scenario Contains feature SchoolCounty Contains feature Ohio

Knowledge Enabled Information and Services Science And finally.. Contains feature County School Contains feature Ohio Indiana Contains feature School Exchange students

Knowledge Enabled Information and Services Science Lets test the approaches!

Knowledge Enabled Information and Services Science SPARQL SPARQL Protocol and RDF Query Language. User to express queries over data stores where data stored as RDF or viewed as RDF. W3C Recommendation since Allows for query to have triple patterns, conjunctions, disjunctions and optional patterns.

Knowledge Enabled Information and Services Science SPARQL in perfect scenario PREFIX rdf: PREFIX geo: SELECT ?school WHERE { ?state geo:featureClass geo:A. ?schools geo:featureClass geo:S. ?state geo:name “Ohio”. ?state geo:childrenFeatures ?schools. }

Knowledge Enabled Information and Services Science Results Snapshot of retrieved results Expected ResultsActual Results Wilber High School Cherokee Elementary School Buckeye High School Middletown High School Fairborn Elementary School Since SPARQL works fine for perfect scenario, we do not need to evaluate other approaches for simple scenario.

Knowledge Enabled Information and Services Science SPARQL in not so perfect scenario PREFIX rdf: PREFIX geo: SELECT ?school WHERE { ?state geo:featureClass geo:A. ?schools geo:featureClass geo:S. ?state geo:name “Ohio”. ?state geo:childrenFeatures ?county. ?county geo:childrenFeatures ?schools. } Increase in one triple constraint every additional level.

Knowledge Enabled Information and Services Science Results Still works… Expected ResultsActual Results Wilber High School Cherokee Elementary School Buckeye High School Middletown High School Fairborn Elementary School

Knowledge Enabled Information and Services Science SPARQL in final scenario… PREFIX rdf: PREFIX geo: SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?state geo:childrenFeatures ?county. ?county geo:parentFeature ?school. } User has to know the exact structure and the precise relationships.

Knowledge Enabled Information and Services Science Path Expressions Finds paths in an RDF Graph given a source and a destination. Possible to specify constraints on the intermediate nodes e.g. path length, intermediate node, pattern constraint,… Example: Find any feedback loops (i.e. non simple paths) that involve the compound Methionine. SELECT ??p WHERE { ?x ??p ?x. ?z compound:name “ Methionine”. PathFilter(containsAny(??p, ?z) ) }

Knowledge Enabled Information and Services Science Using path queries for slight and severe mismatch The semantics of the query changes to “Find schools related to Ohio”. SELECT ??school WHERE { ?state ??path ?school ?state geo:featureClass geo:A. ?state geo:name “Ohio”. ?school geo:featureClass geo:S. PathFilter( cost(??path) < 2 ) } User has to know the path length for retrieving correct results.

Knowledge Enabled Information and Services Science If available paths are Ohio Greene County Wilber High School Ohio Montgomery County Dayton School Ohio Adams County Buckeye High School Ohio Lake County Nashville High Ohio Greene County Seattle Aca. has_County has_school has_Countyhas_school has_County has_school has_Countyexchanges_student has_County exchanges_student

Knowledge Enabled Information and Services Science Results Snapshot of retrieved results Expected ResultsActual Results Wilber High School Dayton School Buckeye High School Nashville High School Seattle Academy

Knowledge Enabled Information and Services Science So where do these mechanism stand.. Ease of writing ExpressivityWorks in all scenarios Schema agnostic SPARQLX√XX Path Expression√√X√

Knowledge Enabled Information and Services Science Proposed Approach

Knowledge Enabled Information and Services Science Proposed Approach Define operators to ease writing of expressive queries by implicit usage of semantic relations between query terms and hence remove the burden of expressing named relations in a query. Define transformation rules for operators based on work by Winston’s taxonomy of part-whole relations. Rule based approach allows applicability in different domains with appropriate modifications.

Knowledge Enabled Information and Services Science Architecture SELECT ?school WHERE { ?school geo:childrenFeature Ohio. } SELECT ?school WHERE { ?state geo:name "Ohio“ ?state geo:childrenFeatures ?county. ?county geo:childrenFeatures ?schools.} Transformation Rules Triple Constraints Query Variables Altered Triple Constraints Altered Query Variables User submits SPARQL Query Query Rewriting Engine Rewritten Query according to the data structure Mapping of ontology properties to Winston’s categories Meta rules for Winston’s Categories +=

Knowledge Enabled Information and Services Science Example Rules Transitivity –(a φ-part of b) (b φ-part of c) (a φ-part of c) –(Dayton place-part of Ohio) (Ohio place-part of US) (Dayton place-part of US) Overlap –(a place-part of b) (a place-part of b) (b overlaps c) –(Sri L. place-part of Indian Ocean) (Sri L. place-part of Bay of Bengal) (Indian Ocean overlaps with Bay of Bengal) Spatial Inclusion –(a instance of b) (c spatially included in a) (c spatially included in b) –(White House instance of Building) (Barack is in White House) (Barack is in building)

Knowledge Enabled Information and Services Science Perfect Scenario SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?schools in ?state. } SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?state geo:childrenFeatures ?schools } Query Re-Writer

Knowledge Enabled Information and Services Science Slight and Severe Mismatch SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?schools in ?state. } SELECT ?school WHERE { ?state geo:featureClass geo:A ?schools geo:featureClass geo:S. ?state geo:name "Ohio“ ?state geo:childrenFeatures ?county. ?county geo:childrenFeatures ?schools. } Query Re-Writer

Knowledge Enabled Information and Services Science So where do we stand with all these mechanisms.. Ease of writing ExpressivityWorks in all scenarios Schema agnostic SPARQLX√XX Rho-Operator√√X√ Our Approach√√√√

Knowledge Enabled Information and Services Science Future Work

Knowledge Enabled Information and Services Science Evaluation Evaluate architecture on publicly available datasets such as Geonames, Sensor Ontology. Provide framework to execute schema agnostic complex queries such as –Find sensor systems to track blizzards in Ohio. –Find sensor systems to track blizzards in Ohio between Dec 25 th -27 th –…..

Knowledge Enabled Information and Services Science Conclusion Query engines expect user to know the structure of ontology and pose well formed queries. Query engines ignore semantic relations between query terms. Need to exploit semantic relations between concepts for processing queries. Need to provide systems to perform behind the scene rewrite of queries to remove burden of knowing structure of data from the user.

Knowledge Enabled Information and Services Science References SPARQL Matthew Perry, Amit Sheth, Farshad Hakimpour, Prateek Jain “Supporting Complex Thematic, Spatial and Temporal Queries over Semantic Web Data”, Second International Conference on Geospatial Semantics (GEOS '07) Kemafor Anyanwu, Angela Maduko, and Amit P. Sheth, “SPARQ2L: Towards Support For Subgraph Extraction Queries in RDF Databases”, 16th international conference on World Wide Web (WWW’ 07)

Knowledge Enabled Information and Services Science Thank You!

Knowledge Enabled Information and Services Science Backup

Knowledge Enabled Information and Services Science Sensor Ontology Consists of roughly 90 classes and 33 object properties Data instantiated by querying Mesowest service. Exhibits meronymy between various concepts –Places,System,Compound Observation,…. –Existing queries on dataset use Meronymy implicitly.

Knowledge Enabled Information and Services Science Existing Queries with Sensor Ontology Existing Queries for Sensor Ontology Query a specific sensor for specific property Example: “Find temperature recorded by System1” Query a specific sensor for specific property with certain belief value. (Belief value assigned randomly as of now) Example: “Find temperature recorded by System1 with belief value 0.73” Query from a specific sensor system for a specific feature with a belief value Example: “Find blizzards recorded by System1” Query from a specific sensor system for a specific feature within a time interval Example: “Find blizzards recorded by System1 during 1 st Feb rd Feb 1984”

Knowledge Enabled Information and Services Science Geonames Dataset Description at (100 Million) RDF triples present in the dataset. Most interesting properties “parentFeature” (Administrave Region which contains the entity) and “nearbyFeature”(Entities close to this region).