Semantic Memory Architecture for Knowledge Acquisition and Management Włodzisław Duch Julian Szymański.

Slides:



Advertisements
Similar presentations
Semantic Memory for Avatars in Cyberspace Julian Szymański, Tomasz Sarnatowicz, Włodzisław Duch Department of Informatics, Nicolaus Copernicus University,
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Spelling Correction for Search Engine Queries Bruno Martins, Mario J. Silva In Proceedings of EsTAL-04, España for Natural Language Processing Presenter:
Semantic Access to Data from the Web Raquel Trillo *, Laura Po +, Sergio Ilarri *, Sonia Bergamaschi + and E. Mena * 1st International Workshop on Interoperability.
Creating a Similarity Graph from WordNet
Representation/organization in LTM Typical empirical testing paradigm: propositional verification task – rt to rose is flower, vs. rose is plant. Set Theoretical.
IS530 Lesson 12 Boolean vs. Statistical Retrieval Systems.
Introduction Information Management systems are designed to retrieve information efficiently. Such systems typically provide an interface in which users.
Concept Description Vectors and the 20 Questions Game Włodzisław Duch Tomasz Sarnatowicz Julian Szymański.
 How many pages does it search?  How does it access all those pages?  How does it give us an answer so quickly?  How does it give us such accurate.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Internet Resources Discovery (IRD) Search Engines Quality.
Knowing Semantic memory.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Computer comunication B Information retrieval. Information retrieval: introduction 1 This topic addresses the question on how it is possible to find relevant.
Chapter Seven The Network Approach: Mind as a Web.
Semantic Memory Knowledge Acquisition Through Active Dialogues Włodzisław Duch, Julian Szymański The knowledge representation using relations between concepts.
Knowledge information that is gained and retained what someone has acquired and learned organized in some way into our memory.
Semantic Memory Architecture for Knowledge Acquisition and Management Julian Szymański & Włodzisław Duch Department of Electronic, Telecommunication and.
Overview of Long-Term Memory laura leventhal. Reference Chapter 14 Chapter 14.
Representation of hypertext documents based on terms, links and text compressibility Julian Szymański Department of Computer Systems Architecture, Gdańsk.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Internet Research, Second Edition- Illustrated 1 Internet Research: Unit A Searching the Internet Effectively.
Knowledge Representation & Acquisition for Large-Scale Semantic Memory Julian Szymański Dept. of Electronic, Telecommunication & Informatics, Gdańsk University.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
General Knowledge Dr. Claudia J. Stanny EXP 4507 Memory & Cognition Spring 2009.
Steps Toward an AGI Roadmap Włodek Duch ( Google: W. Duch) AGI, Memphis, 1-2 March 2007 Roadmaps: A Ten Year Roadmap to Machines with Common Sense (Push.
Aardvark Anatomy of a Large-Scale Social Search Engine.
By : Garima Indurkhya Jay Parikh Shraddha Herlekar Vikrant Naik.
Search Engines & Search Engine Optimization (SEO).
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
Planning a search strategy.  A search strategy may be broadly defined as a conscious approach to decision making to solve a problem or achieve an objective.
Universit at Dortmund, LS VIII
Knowledge Representation CPTR 314. The need of a Good Representation  The representation that is used to represent a problem is very important  The.
Module 7 Lesson 1.
Memory for General Knowledge and Concepts and Categorization
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
M Machine Learning F# and Accord.net. Alena Dzenisenka Software architect at Luxoft Poland Member of F# Software Foundation Board of Trustees Researcher.
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
LATENT SEMANTIC INDEXING Hande Zırtıloğlu Levent Altunyurt.
Search Engine Architecture
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Memory systems What kinds of things do we know? Is all memory the same? Types of memory systems Semantic and episodic memory.
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Flickr Tag Recommendation based on Collective Knowledge BÖrkur SigurbjÖnsson, Roelof van Zwol Yahoo! Research WWW Summarized and presented.
Internet Research – Illustrated, Fourth Edition Unit A.
1 Information Retrieval LECTURE 1 : Introduction.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Knowledge based Question Answering System Anurag Gautam Harshit Maheshwari.
Dichotomous Keys! Chapter 2.1 (pages 52-53).
Long-Term Memory Information not lost from STM is then passed to LTM Information not lost from STM is then passed to LTMCapacity Virtually unlimited Duration.
A code-centric cluster-based approach for searching online support forums for programmers Christopher Scaffidi, Christopher Chambers, Sheela Surisetty.
Verbal Representation of Knowledge
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
INFORMATION RETRIEVAL MEASUREMENT OF RELEVANCE EFFECTIVENESS 1Adrienn Skrop.
Tutoring & Help Systems Deepthi Bollu for CSE495 10/31/2003.
Crawling When the Google visit your website for the purpose of tracking, Google does this with help of machine, known as web crawler, spider, Google bot,
Search Engine Optimization
Visual Information Retrieval
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Search Engine Architecture
Concept Description Vectors and the 20 Questions Game
Data Mining Chapter 6 Search Engines
Identify Different Chinese People with Identical Names on the Web
Search Engine Architecture
Information Retrieval and Web Design
The Network Approach: Mind as a Web
Presentation transcript:

Semantic Memory Architecture for Knowledge Acquisition and Management Włodzisław Duch Julian Szymański

Semantic Memory Endel Tulving „Episodic and Semantic Memory” 1972 Semantic memory refers to the memory of meanings and understandings. It stores koncept-based, generic, context-free knowledge. One of types of long-term memory. Together with episodic memory make up the category of declarative memory. (the others are episodic and procedural) Semantic memory includes generalized knowledge that does not involve memory of a specific event. Pernament container for general knowledge (facts, ideas, words, problem solving)

Hierarchical Model Collins & Quillian, 1969

Semantic network Collins & Loftus, 1975

Knowledge representation wCRK

Interactive semantic space

Concept Description Vectors Cobra is_aanimal is_abeast is_abeing is_abrute is_acreature is_aentity is_afauna is_aobject is_aorganism is_areptile is_aserpent is_asnake is_avertebrate hasbelly hasbody part hascell haschest hascosta hasdigit hasface hashead hasrib hastail hasthorax

Semantic Space exploration Binary dictionary search Binary dictionary search 2 20 = Binary search – not acceptable in complex semantical applications Binary search – not acceptable in complex semantical applications Semantic space can be search using context – based algorithm. Similar to word game. Semantic space can be search using context – based algorithm. Similar to word game. Concept space narrowed by subsequent user answers Concept space narrowed by subsequent user answers

20 questions game algorithm, where p(keyword=v i ) is fraction of concepts for which the keyword has value v i Subspace of candidate concepts O(A) are selected according to: O(A) = {i; d=|CDV i -ANSW| is minimal},where CDV i is a vector for i-concept and ANSW is a partial vector of retrieved answers ● we can deal with user mistakes choosing d > minimal

Data aquisition How to obtain semantic data? How to obtain semantic data? Wordnet Wordnet Relations for Semantic category: animal Relations for Semantic category: animal 7543 objects and 1696 features 7543 objects and 1696 features Truncated using word popularity rank: Truncated using word popularity rank:  IC – information content is an amount of appearances of the particular word in WordNet descriptions  GR - GoogleRank is an amount of web pages returned by Google search engine for a given word  BNC are the words statistics taken from British National Norpus. - Semantic Space reduced to 889 objects and 420 features

Active learning Data from wordnet: Data from wordnet: Not complete Not complete Not common sence Not common sence Sometimes specialised concepts Sometimes specialised concepts Basic dialogs for obtaining new relations Basic dialogs for obtaining new relations I give up. Tell me what did you think of? I give up. Tell me what did you think of? Tell me what is characteristic for ? Tell me what is characteristic for ? Knowledge correction : Knowledge correction :, where: W 0 – initial weight, initial knowledge ANS – answer given by user N – amount of answers β - parametr for indicating importance initial knowledge

The game Giraffe: Giraffe: [is vertebrate] Y,[is mammal] Y, [has hoof] Y, [is equine] N, [is bovine] N, [is deer] N, [is swine] N, [has horn] N, [has horn] N,[is sheep] N,[is antelope] N,[is bison] N. System correctly guess concept giraffe. - Yuppi i’ve won! Let’s talk about giraffe. Tell me what is characteristic for giraffe? After entering keyword. Semantic memory is reorganised, and ready to play new games. Lion: Lion: [is vertebrate] Y,[is mammal] Y, [has hoof] N, [has paw] Y, [is canine] N,[is cat] Y, [is wildcat] Y The different way for organizing concept lion in WordNet taxonomy, causes the game goes in wrong way and system fails guess this concept: [is leopard] N,[is painter] N,[is puma] N,[is lynx] N, [is leopard] N,[is painter] N,[is puma] N,[is lynx] N, [is lynx] N. I give up. What it was? Lion … [is lynx] N. I give up. What it was? Lion … After giving right answer system reorganizes its knowledge and next game for searching concept lion is finished with success: [is vertebrate] Y, [is mammal] Y, [has hoof] N, [has paw] Y, [is canine] N, [is cat] Y,[is wildcat] Y, [is leopard] N, [is canine] N, [is cat] Y,[is wildcat] Y, [is leopard] N, [has mane] Y, I guess it is lion.

Avatar

Experimental results How many games do we need do clarify semantic space? How many games do we need do clarify semantic space? proportion failed games N f performed to achieve first success. proportion failed games N f performed to achieve first success. The semantic memory error: The semantic memory error: where N s is amount of the games finished with success and N is total games amount, for searching first 10 concepts were 0.22 How it changes during learning process? How it changes during learning process? Avg Density features / object Avg Density features / object

Thank you