Ontology-Based Free-Form Query Processing for the Semantic Web by Mark Vickers Supported by:

Slides:



Advertisements
Similar presentations
QA-LaSIE Components The question document and each candidate answer document pass through all nine components of the QA-LaSIE system in the order shown.
Advertisements

David W. Embley Brigham Young University Provo, Utah, USA WoK: A Web of Knowledge.
Ontologies for multilingual extraction Deryle W. Lonsdale David W. Embley Stephen W. Liddle Supported by the.
1 Ontology Based Extraction of RDF Data from the World Wide Web Tim Chartrand A Thesis Proposal Research Supported By NSF.
A Framework for Extraction Plans and Heuristics in an Ontology-Based Data-Extraction System Alan Wessman Brigham Young University MS Thesis Defense Based.
Domain-Independent Data Extraction: Person Names Carl Christensen and Deryle Lonsdale Brigham Young University
HyKSS: A Multiple Ontology Approach to Hybrid Search Andrew Zitzelberger Brigham Young University MS Thesis Proposal.
ISP 433/533 Week 2 IR Models.
Schema Matching and Data Extraction over HTML Tables Cui Tao Data Extraction Research Group Department of Computer Science Brigham Young University supported.
Supporting Queries with Imprecise Constraints Ullas Nambiar Dept. of Computer Science University of California, Davis Subbarao Kambhampati Dept. of Computer.
A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Data Frames Version 3 Proposal. Data Frames Version 2 Year matches [2] constant { extract "\d{2}"; context "([^\$\d]|^)\d{2}[^,\dkK]"; } 0.5, { extract.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Ontology-Based Free-Form Query Processing for the Semantic Web Thesis proposal by Mark Vickers.
ER 2002BYU Data Extraction Group Automatically Extracting Ontologically Specified Data from HTML Tables with Unknown Structure David W. Embley, Cui Tao,
Ontology-Based Information Extraction and Structuring Stephen W. Liddle † School of Accountancy and Information Systems Brigham Young University Douglas.
From OSM-L to JAVA Cui Tao Yihong Ding. Overview of OSM.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
By ANDREW ZITZELBERGER A Framework for Extraction Ontology Based Information Management.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 Extracting RDF Data from Unstructured Sources Based on an RDF Target Schema Tim Chartrand Research Supported By NSF.
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
Semantic Web Queries by Mark Vickers Funded by NSF.
Answering Imprecise Queries over Autonomous Web Databases Ullas Nambiar Dept. of Computer Science University of California, Davis Subbarao Kambhampati.
Semantic Understanding An Approach Based on Information-Extraction Ontologies David W. Embley Brigham Young University.
Semantic Understanding An Approach Based on Information-Extraction Ontologies David W. Embley Brigham Young University.
Data Frame Augmentation of Free Form Queries for Constraint Based Document Filtering Andrew Zitzelberger.
1 Cui Tao PhD Dissertation Defense Ontology Generation, Information Harvesting and Semantic Annotation For Machine-Generated Web Pages.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Automatic Creation and Simplified Querying of Semantic Web Content An Approach Based on Information-Extraction Ontologies Yihong Ding, David W. Embley,
Thesis Proposal Mini-Ontology GeneratOr (MOGO) Mini-Ontology Generation from Canonicalized Tables Stephen Lynn Data Extraction Research Group Department.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
Organizing Information Digitally Norm Friesen. Overview General properties of digital information Relational: tabular & linked Object-Oriented: inheritance.
Cross-Language Hybrid Keyword and Semantic Search David W. Embley, Stephen W. Liddle, Deryle W. Lonsdale, Joseph S. Park, Andrew Zitzelberger Brigham Young.
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Interoperability in Information Schemas Ruben Mendes Orientador: Prof. José Borbinha MEIC-Tagus Instituto Superior Técnico.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
Question Answering.  Goal  Automatically answer questions submitted by humans in a natural language form  Approaches  Rely on techniques from diverse.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.
Chapter 13 Query Processing Melissa Jamili CS 157B November 11, 2004.
An Aspect of the NSF CDI InitiativeNSF CDI: Cyber-Enabled Discovery and Innovation.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Question Answering over Implicitly Structured Web Content
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Publication Spider Wang Xuan 07/14/2006. What is publication spider Gathering publication pages Using focused crawling With the help of Search Engine.
Google’s Deep-Web Crawl By Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, and Alon Halevy August 30, 2008 Speaker : Sahana Chiwane.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
Research Topics/Areas. Adapting search to Users Advertising and ad targeting Aggregation of Results Community and Context Aware Search Community-based.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
An Aspect of the NSF CDI Initiative CDI: Cyber-Enabled Discovery and Innovation.
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 Question Answering and Logistics. 2 Class Logistics  Comments on proposals will be returned next week and may be available as early as Monday  Look.
Relevance Feedback Prof. Marti Hearst SIMS 202, Lecture 24.
David W. Embley Brigham Young University Provo, Utah, USA.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Learning Usage of English KWICly with WebLEAP/DSR
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Cross-language Information Retrieval
Ontology Evolution: A Methodological Overview
David W. Embley Brigham Young University Provo, Utah, USA
CS246: Information Retrieval
Information Retrieval and Web Design
Presentation transcript:

Ontology-Based Free-Form Query Processing for the Semantic Web by Mark Vickers Supported by:

The Problem Searching the web for an answer to a question is hard. Searching the web for an answer to a question is hard. Returns documents (usually too many) Returns documents (usually too many) Can it instead return just the right answers? Can it instead return just the right answers? Semantic web Semantic web Proposed ontology-based framework for making information machine-readable Proposed ontology-based framework for making information machine-readable Better access to information Better access to information How should semantic web be searched? How should semantic web be searched?

Solution: AskOntos – a Query System for the Semantic Web Allows free-form queries Allows free-form queries Processes queries using information extraction Processes queries using information extraction Returns tables of extracted values Returns tables of extracted values

Extraction Ontologies Object sets Relationship sets Participation constraints Lexical Non-lexical Primary object set Aggregation Generalization/Specialization

Extraction Ontologies Value Expression: \s*[$]\s*(\d{1,3})*(\.\d{2})? Key Word Phrase Left Context: $ Data Frame: Internal Representation: float Value Phrase Key Word Expression: ([Pp]rice)|([Cc]ost)| … Operation Phrase Operator: > Expression: (more\s*than)|(more\s*costly)|…

Query AskOntos Ontology Matching Form XQuery Answer Extraction Ontology Repository Extracted Values AskOntos Overview Extracted Query Values Extracted Data Web

Step 1. Parse Query “Find me the and of all s – I want a ”pricemileagere d Nissan1998or newer >= Operator

Step 2. Find Corresponding Ontology Similarity value: 6 Similarity value: 2 >= Operator “Find me the price and mileage of all red Nissans – I want a 1998 or newer”

Step 3. Formulate XQuery Expression Conjunctive queries run over selected ontology’s extracted values Conjunctive queries run over selected ontology’s extracted values 7 Nissan MakeIns YearIns red ColorIns

Value-phrase-matching words determine conditions Value-phrase-matching words determine conditions Conditions: Conditions: Color = “red” Color = “red” Make = “Nissan” Make = “Nissan” Year >= 1998 Year >= 1998 >= Operator Step 3. Formulate XQuery Expression

1: for $doc in document("file:///c:/ontos/owlLib/Car.OWL")/rdf:RDF 2: for $Record in $doc/owl:Thing 3: 4: let $id := "CarIns") 5: let $Color := $id)]/car:ColorValue/text() 6: let $Make := $id)]/car:MakeValue/text() 7: let $Year := $id)]/car:YearValue/text() 8: let $Price := $id)]/car:PriceValue/text() 9: let $Mileage := $id)]/car:MileageValue/text() 10: 11: where($Color="red" or empty($Color)) and 12: ($Make="Nissan" or empty($Make)) and 13: ($Year>="1998" or empty($Year)) 14: return 15: {$Price} 16: {$Mileage} 17: {$Color} 18: {$Make} 19: {$Year} 20: For each owl:Thing Get the instance ID and extracted values Check conditions Return values Step 3. Formulate XQuery Expression

Step 4. Run XQuery Expression Over Ontology’s Extracted Data Uses Qexo 1.7, GNU’s XQuery engine for Java Uses Qexo 1.7, GNU’s XQuery engine for Java Use XSLT to transform results to HTML table Use XSLT to transform results to HTML table

Evaluation of AskOntos Measure success by: Measure success by: Ability to match query to correct ontology Ability to match query to correct ontology Ability to translate free-form queries into formal queries Ability to translate free-form queries into formal queries We create: We create: Extraction ontologies for: car ads, diamonds, … Extraction ontologies for: car ads, diamonds, … Queries for preliminary evaluation: 10 Conjunctive queries for car ads Queries for preliminary evaluation: 10 Conjunctive queries for car ads Future work: do more evaluation Future work: do more evaluation

Query Translation Metrics “Find me the price and mileage of all red Nissans – I want a 1998 or newer.” Human conversion for $doc in document("file:///.../Car.OWL")/rdf:RDF for $Record in $doc/owl:Thing … where($Color="red" or empty($Color)) and ($Make="Nissan" or empty($Make)) and ($Year="1998" or empty($Year)) return {$Price} {$Color} {$Make} {$Year} Automated conversion PrecisionRecall PROJECT100%80% SELECT88%88% PROJECT: {Price,Color, Make, Year} SELECT: {(Color,=,“red”), (Make,=,“Nissan”), (Year,=,“1998”)} PROJECT: {Price, Mileage,Color, Make, Year} SELECT: {(Color,=,“red”), (Make,=,“Nissan”), (Year,>=,“1998”)}

Preliminary Results PrecisionRecall PROJECT100%95% SELECT78%76% 1. Find me a 1994 red Nissan for $ Find me the price and mileage of all red Nissans – I want a 1998 or newer. 3. Find me a black Ford for under $ it should be a 1990 or newer and have less than 120K miles on it. 4. Show me the year of all chevy corvettes for less than $25, I want the year, price, mileage, and color of all Toyota Camrys 6. What 2002 cars cost less than $9,000? 7. I want a 1998 or newer Ford for $10,000 or less 8. I want the year of any Honda with at least 200K miles on it 9. What colors can I get a camry in between I want to see all 2001 Toyota 4 Runners with less than 100K miles, that are blue, and have AC  8. I want the year of any Honda with at least 200K miles on it  6. What 2002 cars cost less than $9,000?

Conclusion/Contributions AskOntos AskOntos Is a free-form query system for the semantic web Is a free-form query system for the semantic web Applies information extraction for query processing Applies information extraction for query processing Answers questions with extracted data values Answers questions with extracted data values Contributions Contributions Web queries that use semantic annotations Web queries that use semantic annotations Web queries returning answers from extracted data Web queries returning answers from extracted data Processing free-form queries using ontologies Processing free-form queries using ontologies

TREC 2004 QA Question Topics

Related Research AskOntos NLIDB(60’s-Now) Syntactic analysis Syntactic analysis No structural analysis No structural analysis Portability to new domains Portability to new domains Design new ontology Design new ontology Bernstein et al. (2005) Subset of English (ACE) Subset of English (ACE) Conjuntive queries Conjuntive queries AQUA (2004) Target: Single domain environment Target: Single domain environment Target: Semantic web Target: Semantic web Part of speech recognition Part of speech recognition No recognition No recognition Returns passages Returns passages Returns extracted values Returns extracted values SHOE (2000) Form-based interface Form-based interface Free-form interface Free-form interface QUEST (1999) Graph-based interface Graph-based interface Free-from interface Free-from interface

Evaluating the Framework AgeFuneralDateViewingRelationship/RelativeName RecallPrecisionRecallPrecisionRecallPrecisionRecallPrecision New Ontos 60%50%68%76%80%63%74%43% Legacy Ontos 57%38%63%75%93%18%73%41% Four of eighteen object sets shown above. Data from Salt Lake Tribune and Arizona Daily Star Input:  Obituaries ontology  25 obituaries from two newspapers

Scaling to the Web Ontologies crawl and harvest web pages Ontologies crawl and harvest web pages Ontologies extract values from pages Ontologies extract values from pages Ontologies indexed (??) Ontologies indexed (??) Queries extracted by relevant ontologies Queries extracted by relevant ontologies Rely on Google-like technology Rely on Google-like technology