cs5984: Information Visualization Chris North

Slides:



Advertisements
Similar presentations
All Rights Reserved, Copyright © FUJITSU LABORATORIES LTD An approach to KNOW-WHO using RDF Nobuyuki Igata, Hiroshi Tsuda, Isamu Watanabe and Kunio.
Advertisements

ENV Envisioning Information Lecture 6 – Document Visualization Ken Brodlie
INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
Structured Encryption and Controlled Disclosure Melissa Chase Seny Kamara Microsoft Research Asiacrypt '10 1.
Ranked Retrieval INST 734 Module 3 Doug Oard. Agenda  Ranked retrieval Similarity-based ranking Probability-based ranking.
To See, or Not to See—Is That the Query? Robert R. Korfhage Dept. of Information Science University of Pittsburgh 1991 Reviewed by Yi-Bu Chen LIS 551 Information.
Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004.
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
Interfaces for Retrieval Results. Information Retrieval Activities Selecting a collection –Talked about last class –Lists, overviews, wizards, automatic.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
© Anselm Spoerri Lecture 10 Visual Tools for Text Retrieval (cont.)
1 CS 430 / INFO 430 Information Retrieval Lecture 15 Usability 3.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Vector Space Model CS 652 Information Extraction and Integration.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
1/16 Final project: Web Page Classification By: Xiaodong Wang Yanhua Wang Haitang Wang University of Cincinnati.
2-D: Focus+Context cs5984: Information Visualization Chris North.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Trees (Hierarchical Information) cs5984: Information Visualization Chris North.
Web Document Clustering By Sang-Cheol Seok. 1.Introduction: Web document clustering? Why ? Two results for the same query ‘amazon’ Google : currently.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Social Networking Algorithms related sections to read in Networked Life: 2.1,
CSM06 Information Retrieval Lecture 6: Visualising the Results Set Dr Andrew Salway
L8 - March 28, 2006copyright Thomas Pole , all rights reserved 1 Lecture 8: Software Asset Management and Text Ch. 5: Software Factories, (Review)
University of Malta CSA3080: Lecture 4 © Chris Staff 1 of 14 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
Document Collections cs5984: Information Visualization Chris North.
Media Arts and Technology Graduate Program UC Santa Barbara MAT 259 Visualizing Information Winter 2006George Legrady1 MAT 259 Visualizing Information.
1 CS 430: Information Discovery Lecture 25 Cluster Analysis 2 Thesaurus Construction.
Semantic Wordfication of Document Collections Presenter: Yingyu Wu.
Interaction LBSC 734 Module 4 Doug Oard. Agenda Where interaction fits Query formulation Selection part 1: Snippets  Selection part 2: Result sets Examination.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
© Anselm Spoerri Lecture 10 Search Visualization –“Search User Interfaces” by Hearst –Visual Query Formulation –Search Result Visualization –InfoCrystal.
3-D cs5984: Information Visualization Chris North.
User Interfaces for Information Access Prof. Marti Hearst SIMS 202, Lecture 26.
CS 440 Database Management Systems Web Data Management 1.
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
Multiple Views cs5984: Information Visualization Chris North.
SIMS 202, Marti Hearst Final Review Prof. Marti Hearst SIMS 202.
cs5984: Information Visualization Chris North
A Context Sensitive Searching and Ranking
Search Engine Architecture
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
Text Visualization Lecture 11
Visualization of Web Search Results in 3D
The Anatomy of a Large-Scale Hypertextual Web Search Engine
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
Visualizing Document Collections
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
Document Clustering Matt Hughes.
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
International Marketing and Output Database Conference 2005
Magnet & /facet Zheng Liang
INFORMATION VISUALIZATION (CS 5984) PRESENTATION
Search Engine Architecture
cs5984: Information Visualization Chris North
Information Retrieval and Web Design
Information Retrieval and Web Design
CS276 Information Retrieval and Web Search
Multi-Dimensional Data Visualization 3
cs5984: Information Visualization Chris North
Presentation transcript:

cs5984: Information Visualization Chris North Document Collections 2 cs5984: Information Visualization Chris North

Approaches Clustering (last time) Themescapes, … Network Keyword

Clustering With Full text Galaxy of News pg 452

Clustering Good: Bad: Map of collection Major themes and sizes Relationships between themes Scales up Bad: Where to locate documents with multiple themes? Both mountains, between mountains, …? Relationships between documents, within documents? Algorithm becomes (too) critical

Network Show inter-relationships Matrix or Complete Graph Similarity measure between all pairs of docs Threshold level Salton, pg 413

Variations Docs + Paragraphs Themes

Network Better for smaller, more detailed map Scale up: Network visualization Good: Can see more complex relationships between/within documents Can act like hyperlinks! Bad: Finding specific documents Scale up difficult

Combination: Thinkmap http://www.thinkmap.com/article.cfm?articleID=38

Keyword Search engine, keyword query “Information Retrieval” Rank ordered list “Information Retrieval”

Today Hearst, “Tilebars”, web umer, ashwini

VIBE Korfhage, http://www.pitt.edu/~korfhage/interfaces.html Documents located between query keywords using spring model

VR-VIBE

InfoCrystal Spoerri, pg 140 Venn Diagram, all possible combinations A&B&C&D A&C&D C&B C

Keyword Good: Bad: Reduces the browsing space Map according to user’s interests Bad: What keywords do I use? What about other related documents that don’t use these keywords? No initial overview Mega-hit, zero-hit problem

Assignment Mid-Project status report: due today Read for Thurs Fox, “Envision”, web, video aejaaz, ravi

Upcoming Weeks I’m at CHI all next week Tues: Go to VE, SciViz lab: Torg 3050 Bowman, Kriz, Kelso Thurs: McCrickard Read for Tues Apr 10 DeFanti, “Scientific Visualization”, pg 39 Sayle, “Rasmol”, web Yuying, ?