AT, 26.02.2003 1 Anja Theobald University of the Saarland, Germany An Ontology for Domain-oriented Semantic.

Slides:



Advertisements
Similar presentations
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme Presented by Smitashree Choudhury.
Advertisements

Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Chapter 5: Introduction to Information Retrieval
Improved TF-IDF Ranker
Astronomy How would you define this term?. Astronomy The study of everything beyond Earth!
 Andisheh Keikha Ryerson University Ebrahim Bagheri Ryerson University May 7 th
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
A Framework for Ontology-Based Knowledge Management System
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
1 Configurable Indexing and Ranking for XML Information Retrieval Shaorong Liu, Qinghua Zou and Wesley W. Chu UCLA Computer Science Department {sliu, zou,
Experiments on Using Semantic Distances Between Words in Image Caption Retrieval Presenter: Cosmin Adrian Bejan Alan F. Smeaton and Ian Quigley School.
Semantic (Language) Models: Robustness, Structure & Beyond Thomas Hofmann Department of Computer Science Brown University Chief Scientist.
Disambiguation Algorithm for People Search on the Web Dmitri V. Kalashnikov, Sharad Mehrotra, Zhaoqi Chen, Rabia Nuray-Turan, Naveen Ashish For questions.
Vector Space Model CS 652 Information Extraction and Integration.
Chapter 19: Information Retrieval
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
1 Intelligent Crawling Junghoo Cho Hector Garcia-Molina Stanford InfoLab.
Touring the Night Sky. What is Astronomy? Astronomy is the branch of science that studies objects beyond Earth. These “objects” include the Sun, billions.
CS344: Introduction to Artificial Intelligence Vishal Vachhani M.Tech, CSE Lecture 34-35: CLIR and Ranking in IR.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Indexing Knowledge Daniel Vasicek 2014 March 27 Introduction Basic topic is : All Human Knowledge Who Cares? Simple Examples.
INTRODUCTION TO ASTRONOMY SNC 1D. ASTRONOMY Astronomy is the study of the universe and its contents. Ex. examining the atmosphere of Mars People who watch.
1 Chapter 19: Information Retrieval Chapter 19: Information Retrieval Relevance Ranking Using Terms Relevance Using Hyperlinks Synonyms., Homonyms,
Computational Linguistics WTLAB ( Web Technology Laboratory ) Mohsen Kamyar.
Information Retrieval in Folksonomies Nikos Sarkas Social Information Systems Seminar DCS, University of Toronto, Winter 2007.
1 BINGO! and Daffodil: Personalized Exploration of Digital Libraries and Web Sources Martin Theobald Max-Planck-Institut für Informatik Claus-Peter Klas.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Dimitrios Skoutas Alkis Simitsis
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
Chapter 6: Information Retrieval and Web Search
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Algorithmic Detection of Semantic Similarity WWW 2005.
© 2004 Chris Staff CSAW’04 University of Malta of 15 Expanding Query Terms in Context Chris Staff and Robert Muscat Department of.
Clustering C.Watters CS6403.
Digital libraries and web- based information systems Mohsen Kamyar.
Clustering More than Two Million Biomedical Publications Comparing the Accuracies of Nine Text-Based Similarity Approaches Boyack et al. (2011). PLoS ONE.
CIS 530 Lecture 2 From frequency to meaning: vector space models of semantics.
© Copyright 2008 STI INNSBRUCK A Semantic Model of Selective Dissemination of Information for Digital Libraries.
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme ISWC 2008 Hyewon Lim January.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Schema-Free XQuery Based on the work of: Yanyao Li, Cong Yu and H.V.Jagadish From the University of Michigan From the University of Michigan Presented.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Contextual Text Cube Model and Aggregation Operator for Text OLAP
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
Database System Concepts, 5th Ed. ©Sang Ho Lee Chapter 19: Information Retrieval.
Visual Information Retrieval
Web News Sentence Searching Using Linguistic Graph Similarity
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Associative Query Answering via Query Feature Similarity
Information Retrieval on the World Wide Web
Information Retrieval
How to publish in a format that enhances literature-based discovery?
Citation-based Extraction of Core Contents from Biomedical Articles
Magnet & /facet Zheng Liang
Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Topic: Semantic Text Mining
Presentation transcript:

AT, Anja Theobald University of the Saarland, Germany An Ontology for Domain-oriented Semantic Similarity Search On XML Data (BTW) February 25 – 28, 2003 Leipzig, Germany

AT, Motivation Query on Web Data:  Ranking based on content data and structure (XML,…)  Grouping results by their topics  Using Ontologies for similarity search movie astronomy sports

AT, Outline 5. Similarity of Ontology Nodes 2. Ontologies - a Linguistic Challenge 3. Graph-based Ontology 4. Quantification: Edge Weights 6. Ontology-based Query Processing 0. Why we need Ranked Retrieval and Ontologies? 1. XXL Search Engine

AT, XXL Search Engine Visual XXL WWW Crawler Path Indexer Content Indexer Name Ontology Indexer Content Ontology Indexer EPI ECI NOI COI Query Processor EPI Handler ECI Handler Name Ontology Handler Content Ontology Handler XXL Query: SELECT * FROM INDEX WHERE #.~universe AS U AND U.#.~appearance AS A AND U.#.S ~ „star“ … XML Document sun …light and heat… … … …

AT, Ontologies – a linguistic challenge  ontology:...representational vocabulary of words including hier- archical relationships and associative relationships between these words [Gruber93]... symbolized stands for refers to sense:...a celestial body of hot gases... word: star object:

AT, Word – Sense – Synset  synset(s) = { w | (w,s)  U}  U = {(w,s) | w  Σ *, s  S: word w has sense s}  words w  Σ* + word senses + synonym relationship

AT, Disambiguation: Synset – Category  synset(s) = { w | (w,s)  U}// U = {(w,s) | word w has sense s} + hypernym relationship sense s: synset(s): sense 1: (astronomy) a celestial body of hot gases… star sense 4: a plane figure with 5 or more points… star  category(s) = { synset(s‘) | synset(s‘) is hypernym of synset(s)} celestial body, heavenly body natural object object, physical object entity, physical thing plane figure, 2-dim. figure figure abstraction shape, form attribute

AT, Disambiguation: Synset – Category sense s: synset(s): entity, physical thing object, physical object natural object celestial body, heavenly body sense 1: (astronomy) a celestial body of hot gases… star  synset(s) = { w | (w,s)  U}// U = {(w,s) | word w has sense s} + hypernym relationship plane figure, 2-dim. figure figure abstraction shape, form attribute sense 4: a plane figure with 5 or more points… star  category(s) = { synset(s‘) | synset(s‘) is hypernym of synset(s)}

AT, Example Ontology entity, physical thing [entity, physical thing] food [substance, matter] milk [foodstuff,...] cows‘milk [milk] group, grouping [group, grouping] galaxy,... [collection,...] milky way [galaxy,...] natural object [object,...] sun [star] universe, cosmos [collection,...] star [celestial body,...] Beta Centauri [star] [0. 71] abstraction [abstraction] star [plane figure, 2-dim figure] hexagram [star] [0.83] [0.94]

AT, Graph-based Ontology  Ontology G=(V,E) x = (synset(s), category(s))  V e = (x,y, type, weight)  E  Construction:  Use:  word:... extracted from a document  category, type:... extracted from an existing thesaurus (interchangable!!!)  weight:... expresses semantic similarity of connected words  sim:... expresses semantic similarity of ontology nodes

AT, Quantification: Edge Weight  semantic similarity of connected synsets according to their concepts  vector space measures / probabilistic measures galaxy, extragalactic nebula [collection,aggregation,accumulation,assemblage] star [celestial body,heavenly body] sun [star]  DICE coefficient:…using web search engines for word frequencies… Y := (cel  heav)  (star) X := (coll  …  ass)  (galaxy  extr…) X  Y := X  Y [0.172] [0.113]

AT, Similarity of Ontology Nodes entity [entity] cows‘ milk [milk] milk [liquid] protein [macromolecule] group [group] galaxy [collection] milky way [galaxy] sun [star] universe [collection] star [celestial body] Beta Centauri [star] [0.2] natural object [object] [0.6][0.5] [0.8] [0.1] [0.3] [0.6] sim(milky way, sun) |p|=3: 3/ / /3 0.8 = 1.2

AT, Similarity of Ontology Nodes entity [entity] cows‘ milk [milk] milk [liquid] protein [macromolecule] group [group] galaxy [collection] milky way [galaxy] sun [star] universe [collection] star [celestial body] Beta Centauri [star] [0.2] natural object [object] [0.6][0.5] [0.8] [0.1] [0.3] [0.6] sim(milky way, sun) |p|=3: 3/ / /3 0.8 = 1.23/ / /3 0.6 = 1.3

AT, Similarity of Ontology Nodes entity [entity] cows‘ milk [milk] milk [liquid] protein [macromolecule] group [group] galaxy [collection] milky way [galaxy] sun [star] universe [collection] star [celestial body] Beta Centauri [star] [0.2] natural object [object] [0.6][0.5] [0.8] [0.1] [0.3] [0.6] sim(milky way, sun) |p|=3: 3/ / /3 0.8 = 1.23/ / /3 0.6 = 1.3 sim(milky way, sun) = 0.42 sim(milky way, cows‘ milk) = 0.2

AT, Ontology-based Query Processing  XXL Query:... WHERE #.~universe AS U AND U.#.~appearance AS A AND U.#.S ~ „star“  XXL Query Representation: ~universe ~appearance % % ~ “star”  XML Documents: … sun …light and heat… … … …

AT, Ontology-based Query Processing  XXL Query:... WHERE #.~universe AS U AND U.#.~appearance AS A AND U.#.S ~ „star“ sim(universe, galaxy) sim(star, sun) *  tfidf (sun) 0.43  XXL Query Representation: ~universe ~appearance % % ~ “star” 1.0 sim(app, app) 1.0  XML Data Graph: galaxy object “…light and heat…” description sun appearance location history

AT, Ontology-based Query Processing  XXL Query:... WHERE #.~universe AS U AND U.#.~appearance AS A AND U.#.S ~ „star“ sim(universe, galaxy) sim(star, sun) *  tfidf (sun) 0.43  XXL Query Representation: ~universe ~appearance % % ~ “star” 1.0 sim(app, app) 1.0  XML Data Graph: galaxy object “…light and heat…” description sun appearance location history  (result graph) = 0.4

AT, ENDE - Vielen Dank! Gibt es etwa noch Fragen?