Download presentation
Presentation is loading. Please wait.
Published byΚρέων Φραγκούδης Modified over 5 years ago
1
cs5984: Information Visualization Chris North
Document Collections cs5984: Information Visualization Chris North
2
Structured document collections
Data Spaces Multi-dimensional 1d 2d 3d Trees Networks Structured document collections
3
Document Collections Unstructured document collections Examples:
Focus on Full Text Examples: acm dig lib, ieee Encyclopedia on cdrom Web search engine Tasks: search by keywords Browse by topics Contents Size Related documents Document sections
4
Goal Create a “map” of the document collection Similar documents near
Dissimilar document far “Grocery store” concept
5
Vectorization Aardvark 1 2 0 Banana 2 1 0 Chris 0 0 3 …
Doc1 Doc2 Doc3 Aardvark Banana Chris … Similarity of docs = Mathematical comparison of direction of doc vectors
6
Map Layout documents’ n-D vectors onto 2-D map
Kohonen feature map (self-organizing map) Neural network iterates to layout the documents (similar to spring model for graph layout) Concept -> Color & keyword # docs in concept -> size Concept similarity -> x,y Documents -> dots
7
Xia Lin
10
Web
11
Today Wise, “Themescapes”, book pg 442 maulik, chris r
12
Assignment Read for Tues Read for Thurs Homework #3: due today
Hearst, “Tilebars”, web umer, ashwini Read for Thurs Fox, “Envision”, web aejaaz, ravi Homework #3: due today Mid-Project status report: due Tues
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.