cs5984: Information Visualization Chris North

Slides:



Advertisements
Similar presentations
ENV Envisioning Information Lecture 6 – Document Visualization Ken Brodlie
Advertisements

INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Multi-Dimensional Data Visualization
Visualization Taxonomies and Techniques Text: Documents and Collections University of Texas – Pan American CSCI 6361, Spring 2014.
Self Organization of a Massive Document Collection
Web search results clustering Web search results clustering is a version of document clustering, but… Billions of pages Constantly changing Data mainly.
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
9/18/2001Information Organization and Retrieval Vector Representation, Term Weights and Clustering (continued) Ray Larson & Warren Sack University of California,
1 CS 430 / INFO 430 Information Retrieval Lecture 15 Usability 3.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Debates: Overview+Detail vs. Focus+Context 2-D vs. 3-D cs5984: Information Visualization Chris North.
2-D: Focus+Context cs5984: Information Visualization Chris North.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Construction cs5984: Information Visualization Chris North.
Information Retrieval Models - 1 Boolean. Introduction IR systems usually adopt index terms to process queries Index terms:  A keyword or group of selected.
Kohonen Mapping and Text Semantics Xia Lin College of Information Science and Technology Drexel University.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Document Collections cs5984: Information Visualization Chris North.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Visual Overview Strategies cs5984: Information Visualization Chris North.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining massive document collections by the WEBSOM method Presenter : Yu-hui Huang Authors :Krista Lagus,
New Literacies By: Brandon Creswell, Charles Zhang LRC 320.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Your Poster Title Goes Here
cs5984: Information Visualization Chris North
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
A Context Sensitive Searching and Ranking
Multi-Dimensional Data Visualization 2
Personalized Social Image Recommendation
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
Professor John Canny Spring 2003
Text Visualization Lecture 11
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
Visualizing Document Collections
cs5984: Information Visualization Chris North
#11 Ch.2.3 Thinking Maps.
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
#17 Ch.2 Thinking Maps.
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
Magnet & /facet Zheng Liang
INFORMATION VISUALIZATION (CS 5984) PRESENTATION
Your Poster Title Goes Here
Your Poster Title Goes Here
Your Poster Title Goes Here
Your Poster Title Goes Here
Your Poster Title Goes Here
PCS CONTENT ON CMS.
y = mx + b y – y1 = m (x – x1) Topic: Writing Equations given two
Your Poster Title Goes Here
cs5984: Information Visualization Chris North
Your Poster Title Goes Here
Your Poster Title Goes Here
Multi-Dimensional Data Visualization 3
cs5984: Information Visualization Chris North
Vector Models for IR Gerald Salton, Cornell SMART System
cs5984: Information Visualization Chris North
Your Poster Title Goes Here
Your Poster Title Goes Here
Presentation transcript:

cs5984: Information Visualization Chris North Document Collections cs5984: Information Visualization Chris North

Structured document collections Data Spaces Multi-dimensional 1d 2d 3d Trees Networks Structured document collections

Document Collections Unstructured document collections Examples: Focus on Full Text Examples: acm dig lib, ieee Encyclopedia on cdrom Web search engine Tasks: search by keywords Browse by topics Contents Size Related documents Document sections

Goal Create a “map” of the document collection Similar documents near Dissimilar document far “Grocery store” concept

Vectorization Aardvark 1 2 0 Banana 2 1 0 Chris 0 0 3 … Doc1 Doc2 Doc3 Aardvark 1 2 0 Banana 2 1 0 Chris 0 0 3 … Similarity of docs = Mathematical comparison of direction of doc vectors

Map Layout documents’ n-D vectors onto 2-D map Kohonen feature map (self-organizing map) Neural network iterates to layout the documents (similar to spring model for graph layout) Concept -> Color & keyword # docs in concept -> size Concept similarity -> x,y Documents -> dots

Xia Lin

Web http://websom.hut.fi/websom/ http://maps.map.net/start

Today Wise, “Themescapes”, book pg 442 maulik, chris r

Assignment Read for Tues Read for Thurs Homework #3: due today Hearst, “Tilebars”, web umer, ashwini Read for Thurs Fox, “Envision”, web aejaaz, ravi Homework #3: due today Mid-Project status report: due Tues