What to Send First? A Study of Utility in the Semantic Web Mike Dean 1, Prithwish Basu 1, Ben Carterette 2, Craig Partridge 1, and James Hendler 3 1 Raytheon.

Slides:



Advertisements
Similar presentations
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Developing a Generic Toolkit: Architecture and technology issues ALLC/ACH Conference.
Advertisements

Semantic Web in Group Formation Asma Ounnas Learning Societies Lab School of Electronics and Computer Science The University of Southampton, UK
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Experiential Learning Cycle
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Context Dependent Reasoning.
Incremental Materialization of RDF Graph Closures for Stream Reasoning Alexandre Mello Ferreira (PhD student) 22/11/2010.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Machine Reasoning about Anomalous Sensor Data Matt Calder, Francesco Peri, Bob Morris Center for Coastal Environmental Sensoring Networks CESN University.
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
Information Fusion: Moving from domain independent to domain literate approaches Professor Deborah L. McGuinness Tetherless World Constellation, Rensselaer.
1 User Interface Design CIS 375 Bruce R. Maxim UM-Dearborn.
Semantic Web Bootcamp Dominic DiFranzo PhD Student/Research Assistant Rensselaer Polytechnic Institute Tetherless World Constellation.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
1 CENTRIA, Dept. Informática da Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal. 2 Institute of Computer Science,
Okech Odhiambo Faculty of Information Technology Strathmore University
Knowledge representation
The Semantic Web Web Science Systems Development Spring 2015.
Database Support for Semantic Web Masoud Taghinezhad Omran Sharif University of Technology Computer Engineering Department Fall.
September 30, 2002EON 2002Slide 1 Integrating Ontology Storage and Ontology-based Applications A lesson for better evaluation methodology Peter Mika:
Query Routing in Peer-to-Peer Web Search Engine Speaker: Pavel Serdyukov Supervisors: Gerhard Weikum Christian Zimmer Matthias Bender International Max.
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
Laboratory for InterNet Computing CSCE 561 Social Media Projects Ryan Benton October 8, 2012.
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Workshop on Software Product Archiving and Retrieving System Takeo KASUBUCHI Hiroshi IGAKI Hajimu IIDA Ken’ichi MATUMOTO Nara Institute of Science and.
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
OntoQA: Metric-Based Ontology Quality Analysis Samir Tartir, I. Budak Arpinar, Michael Moore, Amit P. Sheth, Boanerges Aleman-Meza IEEE Workshop on Knowledge.
Semi-Automatic Quality Assessment of Linked Data without Requiring Ontology Saemi Jang, Megawati, Jiyeon Choi, and Mun Yong Yi KIRD, KAIST NLP&DBPEDIA.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
Of 38 lecture 6: rdf – axiomatic semantics and query.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Information Architecture The Open Group UDEF Project
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
An Ontology-based Approach to Context Modeling and Reasoning in Pervasive Computing Dejene Ejigu, Marian Scuturici, Lionel Brunie Laboratoire INSA de Lyon,
1 A Medical Information Management System Using the Semantic Web Technology Networked Computing and Advanced INFORMATION MANAGEMENT, NCM '08. Fourth.
1 The Semantic Web Jonathan Jackson GCUU Master’s Seminar Spring 2005.
Towards the Adaptive Semantic Web Peter Dolog Nicola Henze Wolfgang Nejdl Michael Sintek.
Ideologies, Politics and Synthesis.  Ideal- “framework or vision”  -logy “speaking or reasoning about”  Ideologies = shared beliefs, motivations,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Semantic Web for the Working Ontologist Dean Allemang Jim Hendler SNU IDB laboratory.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Service-Oriented Computing: Semantics, Processes, Agents
Service-Oriented Computing: Semantics, Processes, Agents
Service-Oriented Computing: Semantics, Processes, Agents
Analyzing and Securing Social Networks
RDF Presentation and Correct Content Conveyance for Legacy
RDF Presentation and Correct Content Conveyance for Legacy
Presentation transcript:

What to Send First? A Study of Utility in the Semantic Web Mike Dean 1, Prithwish Basu 1, Ben Carterette 2, Craig Partridge 1, and James Hendler 3 1 Raytheon BBN Technologies 2 University of Delaware 3 Rensselaer Polytechnic Institute Joint Large and Heterogeneous Data and Quantified Formalization Workshop (LHD+SemQuant 2012) Joint Large and Heterogeneous Data and Quantified Formalization Workshop (LHD+SemQuant 2012) Boston, Massachusetts 12 November Copyright 2012 Raytheon BBN Technologies

Outline Problem Our Solution Future Work 2 Copyright 2012 Raytheon BBN Technologies

Problem Transfer a knowledge base in a constrained or intermittent communication environment –Tactical military –Large football game or conference Send the most important information first –Prioritize statements based on their utility Account for inference –No need to transfer inferred statements 3 KB Copyright 2012 Raytheon BBN Technologies

Utility The utility of a statement can be calculated by a preference function U(S, s) where S is the set of statements in a knowledge base and s ∈ S Somewhat arbitrarily –Utility ranges from 0 to 1 –The total utility of all statements in S should equal 1 4 Copyright 2012 Raytheon BBN Technologies

Preference Functions Ideally, users would provide a preference function suitable for a given context –Difficult to extract or derive Need a default preference function when nothing more specific is available We selected inverse frequency as the default Motivations –Surprise in previous research on Semantic Information Theory –Term frequency-inverse document frequency in Information Retrieval systems 5 Copyright 2012 Raytheon BBN Technologies

RDF Utility We consider each URI and literal to be a symbol We compute the utility of a statement by averaging the inverse frequencies of its subject, predicate, and object components and then normalizing the results 6 Copyright 2012 Raytheon BBN Technologies

Inference Statements can be used to infer other statements –We want to quantify this by computing the inference contribution of each of these statements Statements can have different utilities in different KBs –We’re particularly interested in the initial (ground) KB and its deductive closure The total inference contribution is 1 – the utility of each of the ground statements in the deductive closure 7 Copyright 2012 Raytheon BBN Technologies

Framework An experiment consists of –A set of statements (KB) –An inference procedure – we used RDF Schema –A preference function – we used inverse frequency –A statement ranking function, which uses various computed values Implemented using Jena, its rule based reasoner, and its Derivation interface We accumulate utility in the deductive closure as statements are transmitted and inferred –Generate a transcript and a cumulative utility graph –An experiment can be summarized by its average cumulative utility 8 Copyright 2012 Raytheon BBN Technologies

Data Sets POTUS – Wikipedia information about Presidents of the United States FOAF – My FOAF profile + FOAF vocabulary Cascade – Discussed later Data sets and code are available at Copyright 2012 Raytheon BBN Technologies

Ranking Gold standard: Ranking by utility in the deductive closure + inference contribution Inference contribution is rather difficult and expensive to compute –Most reasoners provide 1 justification, not all Also tried several heuristics –Utility in the initial KB –Utility in the deductive closure –Random T box, then random A box Base case: random 10 Copyright 2012 Raytheon BBN Technologies

Results of Different Ranking Functions Cumulative utility for 262 statements in the POTUS data set 11 Copyright 2012 Raytheon BBN Technologies

Observations We can effectively order statements to increase or maximize average cumulative utility Using inverse frequency –Inferred RDFS statements are of lower utility –Matches intuitions and practice regarding rdf:Resource, etc. Ranking based on simpler heuristics appears promising –More research is needed 12 Copyright 2012 Raytheon BBN Technologies

Cascade Data :. :A rdfs:subClassOf :B. :B rdfs:subClassOf :C. :C rdfs:subClassOf :D. :D rdfs:subClassOf :E. :a rdf:type :A. Possible to analyze all 5! = 120 possible permutations What order do you think is best? 13 Copyright 2012 Raytheon BBN Technologies

Cascade Data Set (2) Average Cumulative Utility for all 120 permutations of cascade statements 14 Copyright 2012 Raytheon BBN Technologies

Cascade Data Set (3) Statements 0. :D rdfs:subClassOf :E. 1.:B rdfs:subClassOf :C. 2.:C rdfs:subClassOf :D. 3.:a rdf:type :A. 4.:A rdfs:subClassOf :B. Best results: average cumulative utility.639 –01423 –04123 –04213 –10423 –24013 –40123 –40213 – Copyright 2012 Raytheon BBN Technologies

Contributions Introducing utility into the Semantic Web Quantifying inference A new problem An evaluation framework 16 Copyright 2012 Raytheon BBN Technologies

Future Directions Incorporating user-defined preferences Employing more sophisticated inference (e.g. OWL RL) Working with (much) larger data sets Generalizing our framework into a toolkit Considering bits required to encode messages Addressing multi-party situations with different preference functions Modeling information fusion 17 Copyright 2012 Raytheon BBN Technologies