Concept Description Vectors and the 20 Questions Game

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Semantic Memory for Avatars in Cyberspace Julian Szymański, Tomasz Sarnatowicz, Włodzisław Duch Department of Informatics, Nicolaus Copernicus University,
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Document Clustering Content: 1.Document Clustering Essentials. 2.Text Clustering Architecture 3.Preprocessing 4.Different Document Models 1.Probabilistic.
A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.
PIM Platform Free text search. When you type in the search field a suggestion tool helps you to find a concept from the ontology.
Considering Semantic similarity using ontologies in the realm of web search Name: siming sun UNI: SS3489.
Concept Description Vectors and the 20 Questions Game Włodzisław Duch Tomasz Sarnatowicz Julian Szymański.
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
Semantic Memory Architecture for Knowledge Acquisition and Management Włodzisław Duch Julian Szymański.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Semantic Memory Knowledge Acquisition Through Active Dialogues Włodzisław Duch, Julian Szymański The knowledge representation using relations between concepts.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Semantic Memory Architecture for Knowledge Acquisition and Management Julian Szymański & Włodzisław Duch Department of Electronic, Telecommunication and.
IR Models: Review Vector Model and Probabilistic.
Representation of hypertext documents based on terms, links and text compressibility Julian Szymański Department of Computer Systems Architecture, Gdańsk.
Mrs. Maninder Kaur 1Maninder Kaur
Lecture 10 – Semantic Networks 1 Two questions about knowledge of the world 1.How are concepts stored? We have already considered prototype and other models.
People Fractions. Problem 1 of 20 Answer 1 = 10 Problem 2 of 20 Answer 2 = 5.
Knowledge Representation & Acquisition for Large-Scale Semantic Memory Julian Szymański Dept. of Electronic, Telecommunication & Informatics, Gdańsk University.
CSI315CSI315 Web Development Technologies Continued.
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
Knowledge and Memory: How we conceptualize information.
Steps Toward an AGI Roadmap Włodek Duch ( Google: W. Duch) AGI, Memphis, 1-2 March 2007 Roadmaps: A Ten Year Roadmap to Machines with Common Sense (Push.
Clustering User Queries of a Search Engine Ji-Rong Wen, Jian-YunNie & Hon-Jian Zhang.
Ontology-Based Information Extraction: Current Approaches.
Information Retrieval by means of Vector Space Model of Document Representation and Cascade Neural Networks Igor Mokriš, Lenka Skovajsová Institute of.
Information Retrieval and Web Search Cross Language Information Retrieval Instructor: Rada Mihalcea Class web page:
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
June 5, 2006University of Trento1 Latent Semantic Indexing for the Routing Problem Doctorate course “Web Information Retrieval” PhD Student Irina Veredina.
Comparing and Ranking Documents Once our search engine has retrieved a set of documents, we may want to Rank them by relevance –Which are the best fit.
Particle Swarm Optimization Speaker: Lin, Wei-Kai
Progress Report (Concept Extraction) Presented by: Mohsen Kamyar.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
1 Information Retrieval LECTURE 1 : Introduction.
Knowledge based Question Answering System Anurag Gautam Harshit Maheshwari.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
A Patent Document Retrieval System Addressing Both Semantic and Syntactic Properties Liang Chen*,Naoyuki Tokuda+, Hisahiro Adachi+ *University of Northern.
Question Classification using Support Vector Machine Dell Zhang National University of Singapore Wee Sun Lee National University of Singapore SIGIR2003.
BioHealth Informatics Group 1 Techniques for segmenting large description logic ontologies Julian Seidenberg, Alan Rector
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
From Frequency to Meaning: Vector Space Models of Semantics
Automatic Writing Evaluation
Large Scale Search: Inverted Index, etc.
Lexicons, Concept Networks, and Ontologies
ece 627 intelligent web: ontology and beyond
11.30 – Semantic job search- showcase
Sorting Data are arranged according to their values.
Intro to Recursion.
Machine Learning in Natural Language Processing
Tomasz Maszczyk and Włodzisław Duch Department of Informatics,
Small is Again Beautiful in Description Logics
Sorting Data are arranged according to their values.
Semantic Memory for Avatars in Cyberspace
Automatic Detection of Causal Relations for Question Answering
Ying Dai Faculty of software and information science,
Designing Algorithms for Multiplication of Fractions
CS246: Information Retrieval
IES 511 Machine Learning Dr. Türker İnce (Lecture notes by Prof. T. M
Publisher : TRANSACTIONS ON NETWORKING Author : Haoyu Song, Jonathan S
Chaitali Gupta, Madhusudhan Govindaraju
Semantic and Declarative Technologies AIT Budapest
A Semantic Peer-to-Peer Overlay for Web Services Discovery
Presentation transcript:

Concept Description Vectors and the 20 Questions Game Włodzisław Duch Tomasz Sarnatowicz Julian Szymański

Permanent container for general knowledge Semantic Memory Permanent container for general knowledge

Hierarchical Model Collins & Quillian, 1969

Semantic network Collins & Loftus, 1975

Semantic Memory

All the concepts and keywords create a Semantic Space All the concepts and keywords create a semantic matrix

Concept Description Vectors CDV – a vector of properties describing a single concept Most of elements are 0’s – sparse vector

Data Sources I Machine readable dictionaries and ontologies: Wordnet ConceptNet Sumo/Milo ontology

Data Sources II Dictionaries data retrieval On-line sources Approach Merriam Webster Wordnet (gloss) MSN Encarta Approach Word morphing Phrases extraction (with POS tagger) Statistical analysis

Data access Binary dictionary search 220 = 1048576 Binary search – not acceptable in complex semantical applications Narrowing concept space by subsequent queries

20 Questions Game Algorithm p(keyword=vi) is fraction of concepts for which the keyword has value vi Candidate concepts O(A) are selected according to: O(A) = {i; |CDVi-A| is minimal} where CDVi is a vector for concept i and A is a partial vector of retrieved answers

Word puzzles 20Q game reversed Concept – known Keywords – the ones that would lead to the concept