Kohonen Mapping and Text Semantics Xia Lin College of Information Science and Technology Drexel University.

Slides:



Advertisements
Similar presentations
INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Advertisements

November 18, 2010Neural Networks Lecture 18: Applications of SOMs 1 Assignment #3 Question 2 Regarding your cascade correlation projects, here are a few.
Un Supervised Learning & Self Organizing Maps. Un Supervised Competitive Learning In Hebbian networks, all neurons can fire at the same time Competitive.
Self Organizing Maps. This presentation is based on: SOM’s are invented by Teuvo Kohonen. They represent multidimensional.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
Adaptive Offset Subspace Self- Organizing Map with an Application to Handwritten Digit Recognition Huicheng Zheng, Pádraig Cunningham and Alexey Tsymbal.
Lecture 16 Graphs and Matrices in Practice Eigenvalue and Eigenvector Shang-Hua Teng.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Visualizating the Non-Visual: Spatial Analysis and Interaction with Information from Text Documents J.A. Wise, J.J. Thomas, K. Pennock, D. Lantrip, M.
CONTENT BASED FACE RECOGNITION Ankur Jain 01D05007 Pranshu Sharma Prashant Baronia 01D05005 Swapnil Zarekar 01D05001 Under the guidance of Prof.
Vector Space Information Retrieval Using Concept Projection Presented by Zhiguo Li
Ameriranikistan Muhammad Ahmad Kyle Huston Farhad Majdeteimouri Dan Mackin.
Lecture 21 SVD and Latent Semantic Indexing and Dimensional Reduction
SLIDE 1IS 240 – Spring 2007 Prof. Ray Larson University of California, Berkeley School of Information Tuesday and Thursday 10:30 am - 12:00.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
1 CS 430 / INFO 430 Information Retrieval Lecture 9 Latent Semantic Indexing.
Wise, Thomas, Pennock, Lantrip, Pottier, Schur, and Crow
Lecture 09 Clustering-based Learning
Projection methods in chemistry Autumn 2011 By: Atefe Malek.khatabi M. Daszykowski, B. Walczak, D.L. Massart* Chemometrics and Intelligent Laboratory.
Document (Text) Visualization Mao Lin Huang. Paper Outline Introduction Visualizing text Visualization transformations: from text to pictures Examples.
Computational Intelligence: Methods and Applications Lecture 11 Multi-Dimensional Scaling & SOM Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Latent Semantic Analysis Hongning Wang VS model in practice Document and query are represented by term vectors – Terms are not necessarily orthogonal.
KOHONEN SELF ORGANISING MAP SEMINAR BY M.V.MAHENDRAN., Reg no: III SEM, M.E., Control And Instrumentation Engg.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Student : Sheng-Hsuan Wang Department.
Self-organizing Maps Kevin Pang. Goal Research SOMs Research SOMs Create an introductory tutorial on the algorithm Create an introductory tutorial on.
AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Drexel University Philadelphia,
Artificial Neural Networks Dr. Abdul Basit Siddiqui Assistant Professor FURC.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Graph self-organizing maps for cyclic and unbounded graphs.
Self organizing maps 1 iCSC2014, Juan López González, University of Oviedo Self organizing maps A visualization technique with data dimension reduction.
Co-Cited Author Maps as Interfaces to Digital Libraries: Kohonen and PFNet Displays for the Humanities Howard D. White Jan Buzydlowski Xia Lin College.
Community Architectures for Network Information Systems
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Adaptive nonlinear manifolds and their applications to pattern.
A Scalable Self-organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation Dmitri G. Roussinov Department of.
NEURAL NETWORKS FOR DATA MINING
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A Taxonomy of Similarity Mechanisms for Case-Based Reasoning.
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
Self Organization of a Massive Document Collection Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Teuvo Kohonen et al.
Information Visualization: Ten Years in Review Xia Lin Drexel University.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
Self Organizing Feature Map CS570 인공지능 이대성 Computer Science KAIST.
Media Arts and Technology Graduate Program UC Santa Barbara MAT 259 Visualizing Information Winter 2006George Legrady1 MAT 259 Visualizing Information.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
TreeSOM :Cluster analysis in the self- organizing map Neural Networks 19 (2006) Special Issue Reporter 張欽隆 D
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Clustering More than Two Million Biomedical Publications Comparing the Accuracies of Nine Text-Based Similarity Approaches Boyack et al. (2011). PLoS ONE.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Visual Data Analysis By: Anna Samouris, Tyerell Warren, LaVonda Suggs & Ashley Turner.
Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.
1 CS 430: Information Discovery Lecture 11 Latent Semantic Indexing.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining massive document collections by the WEBSOM method Presenter : Yu-hui Huang Authors :Krista Lagus,
Semiconductors, BP&A Planning, DREAM PLAN IDEA IMPLEMENTATION.
Citation-Based Retrieval for Scholarly Publications 指導教授:郭建明 學生:蘇文正 M
Self-organizing maps applied to information retrieval of dissertations and theses from BDTD-UFPE Bruno Pinheiro Renato Correa
Unsupervised learning: simple competitive learning Biological background: Neurons are wired topographically, nearby neurons connect to nearby neurons.
Semantic Interoperability for Geographic Information Systems Tobun Dorbin Ng Artificial Intelligence Lab The University of Arizona.
INFO Week 7 Indexing and Searching Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
Uczenie konkurencyjne: przykłady zastosowań Wykład 7 Włodzisław Duch Uniwersytet Mikołaja Kopernika Google: W. Duch.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Self-Organizing Network Model (SOM) Session 11
Proceedings of Infoviz’95
Special Topics in Data Mining Applications Focus on: Text Mining
Unsupervised Learning and Neural Networks
Lecture 22 Clustering (3).
Visualizing Document Collections
cs5984: Information Visualization Chris North
Feature mapping: Self-organizing Maps
Artificial Neural Networks
Presentation transcript:

Kohonen Mapping and Text Semantics Xia Lin College of Information Science and Technology Drexel University

Ten years ago Lin, X., Soergel, D., & Marchionini, G. A self-organizing semantic map for information retrieval. SIGIR’89, pp Applied Kohonen’s mapping to a small document set: – 140 documents – 25 indexing words

A Semantic Map for 140 AI Documents Citation database retrieval Intelligent library search online application Machine learningknowledge expert systems natural process language research network others

Features of the Semantic Map Reveal frequencies and distribution of underlying data. Preserve metric relationships “as faithfully as possible” while mapping from high- dimensional data to a two-dimensional display Display co-occurrence structures through its neighborhood structures.

Why do you prefer using Self- organizing Map (SOM) to textual information? The power of abstraction The feature of self-organization The format of output -- rich information for display

Information Abstraction SOM utilizes statistical information of text in a unique way – Both individual data and their inter- relationships are represented. – Learning takes place gradually To tolerate uncertainty/fuzziness in the input data – It represents large amount of data economically Similar to the way the brain processes/stores information?

Information Organization SOM uses the input data to make a random network become an organized network. – Each piece of information will find its own identity (the best place) on the map. – All the related information should be organized together. – A compromise or enforcement of both “individual responsibilities” and “social responsibilities.”

Information Visualization SOM’s output is an associative network that can be used to implement various interactive functions of the interface – A good overview of underlying data – A variety of topologic structures Sizes of groups, distances, weights of vectors, patterns of inputs, etc. – A space of both documents and terms Effective use all the space of the two-dimensional area.

How much semantics are represented in Kohonen’s map? It’s an open question. Understanding can be gained through comparisons and applications.

dove hen duck goose owl hawk eagle cat fox wolf tiger lion cow dog horse zerba (a) Hierarchical cluster fox wolf eagle cat dog tiger lion horse zebra cow owl hawk hen goose duck dove (c) Kohonen's feature map dog goose dove hen tiger hawk owl wolf duck lion eagle fox cat horse zebra cow (b) Principal component analysis

graph tree minor survey time response user computer interface human EPS M3 M4 M2 M1 C2 C5 C1 C3 C4 system (a) Display of the Latent semantic indexing result System User Human Interface Graph Tree C4 C1 C3 C5 C2 M1 M2 M3 M4 (d) Document and term map by the feature map computer Response time EPS minors survey

Visual SiteMap