Visualization Taxonomies and Techniques Text: Documents and Collections University of Texas – Pan American CSCI 6361, Spring 2014.

Slides:



Advertisements
Similar presentations
PROQUEST SIRS ISSUES RESEARCHER INSIGHT INTO TODAYS LEADING ISSUES Online Tutorial sks.sirs.com | proquestk12.com.
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Support.ebsco.com History Reference Center Tutorial.
Publishers Web Sites Standard Features. Objectives Access publishers websites Identify general features available on most publishers websites Know how.
Visualizing Information: Using WebTheme to Visualize Internet Search Results Karen Buxton and Mary Frances Lembo The Value of Information: American Society.
Web Mining.
History Study Center Primary and secondary sources documenting global history 2010.
ENV Envisioning Information Lecture 6 – Document Visualization Ken Brodlie
Annual Reviews: A Nonprofit Scientific Publisher Bringing the Best Review Literature to the Worldwide Scientific Community for over 75 Years.
This tutorial is designed to take you through the features and content of Oxford African American Studies Center. Please click "Start the Tour" below for.
© 2011 Delmar, Cengage Learning Chapter 1 Getting Started with Dreamweaver.
TO BEGIN PRESS F5. This course is for anyone who would like learn how to find and access electronic books and journals. The course will show you where.
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
UMB Faculty Research Profiles Overview SciVal ® is a registered trademark of Elsevier Properties S.A.
Welcome to the OED Online tour. The Oxford English Dictionary is widely regarded as the accepted authority on the English language. It is an unsurpassed.
Create a website with Google Sites
PaperLens Understanding Research Trends in Conferences using PaperLens Work by Bongshin Lee, Mary Czerwinski, George Robertson, and Benjamin Bederson Presented.
IVITA Workshop Summary Session 1: interactive text analytics (Session chair: Professor Huamin Qu) a) HARVEST: An Intelligent Visual Analytic Tool for the.
Information Retrieval in Practice
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
1 CS 430 / INFO 430 Information Retrieval Lecture 15 Usability 3.
Visualizating the Non-Visual: Spatial Analysis and Interaction with Information from Text Documents J.A. Wise, J.J. Thomas, K. Pennock, D. Lantrip, M.
World Wide Web1 Applications World Wide Web. 2 Introduction What is hypertext model? Use of hypertext in World Wide Web (WWW) – HTML. WWW client-server.
Tuple – InfoVis Publication Browser CS533 Project Presentation by Alex Gukov.
Getting Started Universal Navigation –Conveniently located at the top right of every page Quick Search –Found at the top left of every page Tools –Print.
IS 247 Information Visualization and Presentation 10 May 2002 James Reffell Moryma Aydelott Jean-Anne Fitzpatrick The NewsHound Project Presents:
DoW text: Task and WP leaders will prepare syntheses reports of the project progress, its results and its implications. These synthesis reports will be.
Overview of Search Engines
From Words to Meaning to Insight Julia Cretchley & Mike Neal.
CIAO Columbia International Affairs Online A Wealth of Information in International Affairs.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
Web Document Analysis: How can Natural Language Processing Help in Determining Correct Content Flow? Hassan Alam, Fuad Rahman and Yuliya Tarnikova Human.
Document (Text) Visualization Mao Lin Huang. Paper Outline Introduction Visualizing text Visualization transformations: from text to pictures Examples.
Presented by Chad Kafka This Month’s Topic: Wikispaces Advanced Today’s session is an introduction to what a WIKI is and how they can be used in education.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Jaegul Choo1*, Changhyun Lee1, Chandan K. Reddy2, and Haesun Park1
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Quick Training Guide SpringerLink Interface. Quick Training Guide - New SpringerLink2 Homepage overview Search / Advanced Search Browse by Subject Collection.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
SIRS Issues Researcher Insight into today’s Leading Issues sks.sirs.com | proquestk12.com.
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
Kohonen Mapping and Text Semantics Xia Lin College of Information Science and Technology Drexel University.
Understanding Text Corpora with Multiple Facets Lei Shi, Furu Wei, Shixia Liu, Xiaoxiao Lian, Li Tan and Michelle X. Zhou IBM Research.
Advanced Scientific Visualization
1 CS430: Information Discovery Lecture 18 Usability 3.
Document Collections cs5984: Information Visualization Chris North.
V Material obtained from summer workshop in Guildford County, July-2014.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Home Browse by time period or geographic region Browse featured content in the slideshow Perform a quick search hat.
Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Anita Cellucci.
Visualizing textual data CPSC A. Butt / Feb. 26 '09.
Web Design. What is the Internet? A worldwide collection of computer networks that links millions of computers by – Businesses (.com.net) – the government.
Bongshin Lee, Greg Smith, George Robertson, Mary Czerwinski, Desney Tan Computational User Experiences (CUE) Visualization and Interaction Research Group.
Introduction to HTML Simple facts yet crucial to beginning of study in fundamentals of web page design!
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
Advanced Scientific Visualization
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
Over 1,000 books, journals, videos and reference material
Getting Started with Dreamweaver
Proceedings of Infoviz’95
Multi-Dimensional Data Visualization
Team FE Final Presentation
Visualizing Document Collections
cs5984: Information Visualization Chris North
Science Reference Center
Introduction to Information Retrieval
cs5984: Information Visualization Chris North
cs5984: Information Visualization Chris North
Presentation transcript:

Visualization Taxonomies and Techniques Text: Documents and Collections University of Texas – Pan American CSCI 6361, Spring 2014

Text: Documents and Collection Overview Visualizations for –Document sets –Words & sentences –Analysis metrics –Concepts and themes Recall, sensemaking –Gaining a better understanding of the facts at hand in order to take some next steps –(Better definitions in VA lecture) Information Visualization can help make a large document collection more understandable more rapidly

Overviews of Documents Document Cards Provide a quick browsing, overview UI, maybe especially useful for small screens Paper has useful detail about text processing Document Cards –Strobelt et al. TVCG (InfoVis) ‘09 –Compact visual representation of a document –Show key terms and important images

Document Cards Video without sound –

Representation Layout algorithm searches for empty space rectangles to put things

Interaction Hover over non-image space shows abstract in tooltip Hover over image and see caption as tooltip Click on page number to get full page Click on image goes to page containing it Clicking on a term highlights it in overview and all tooltips

Interaction Hover over non-image space shows abstract in tooltip Hover over image and see caption as tooltip Click on page number to get full page Click on image goes to page containing it Clicking on a term highlights it in overview and all tooltips

Bohemian Bookshelf Video –

Bohemian Bookshelf Serendipitous browsing, Video, Thudt et al CHI ‘12

Th Visualize one’s history –With whom and when has a person corresponded –What words were used Answer questions like: –What sorts of things do I (the owner of the archive) talk about with each of my contacts? –How do my conversations with one person differ from those with other people?

Th Text analysis to seed visualization Monthly & yearly words

Th Query UI

PaperLens Video, but no sound

PaperLens Focus on academic papers Visualize doc metadata such as author, keywords, date, … Multiple tightly-coupled views Analytics questions Effective in answering questions regarding: –Patterns such as frequency of authors and papers cited –Themes –Trends such as number of papers published in a topic area over time –Correlations between authors, topics and citations

PaperLens a) Popularity of topic b) Selected authors c) Author list d) Degrees of separation of links e) Paper list f) Year-by-year top ten cited papers/ authors – can be sorted by topic

NetLens More Document Info Highlight entities within documents –People, places, organizations – Document summaries Document similarity and clustering Document sentiment

Jigsaw

Targeting sense-making scenarios –Visual analytics Variety of visualizations ranging from word-specific, to entity connections, to document clusters Primary focus is on entity-document and entity-entity connection Search capability coupled with interactive exploration –Stasko, Görg, & Liu Information Visualization ‘08 Will see video after Visual Analytics section

Jigsaw Document view

JigSaw List View Entities listed by type

JigSaw Document Cluster View Entities listed by type

Jigsaw Document Grid View Here showing sentiment analysis of docs

Jigsaw Video

Kohonen’s Feature Maps Organizing Document Collections AKA Self-Organizing Maps (SOMs) –Complex, non-linear relationships between high dimensional data items into simple geometric relationships on a 2-d display Uses neural network techniques –Lin Visualization ‘92 Different, colored areas correspond to different concepts in collection Size of area corresponds to its relative importance in set Neighboring regions indicate commonalities in concepts Dots in regions can represent documents

Kohonen’s Feature Maps Organizing Document Collections

Work at PNNL

Group has developed a number of visualization techniques for document collections – –Galaxies –Themescapes –ThemeRiver –...

Galaxies Presentation of documents where similar ones cluster together

Themescapes Self-organizing maps didn’t reflect density of regions all that well Use 3D representation, and have height represent density or number of documents in region

WebTheme

Spire Video

Maps of Science – Visualize the relationships of areas of science, emerging research disciplines, the impact of particular researchers or institutions, etc. Often use documents as the “input data”

Maps of Science Book and web site

Map of Science

Map of Science

Map of Science Science Related Wikipedia Activity –

Temporal Issues Semantic map gives no indication of the chronology of documents Can we show themes and how they rise or fall over time? ThemeRiver

Time flows from left->right Each band/current is a topic or theme Width of band is “strength” of that topic in documents at that time

Topic Modeling High interest topic in text analysis and visualization Latent Dirichlet Allocation Unsupervised learning Produces “topics” evident throughout doc collection, each modeled by sets of words/terms Describes how each document contributes to each topic

TIARA Keeps basic ThemeRiver metaphor –Liu et al CIKM ‘09, KDD ‘10, VAST ‘10 Embed word clouds into bands to tell more about what is in each Magnifier lens for getting more details Uses Latent Dirichlet Allocation to do text analysis and summarization

TIARA

Tiara Features Lens shows senders & receivers Documents containing “cotable”

TextFlow Showing how topics merge and split –Cui et al TVCG (InfoVis) ‘11

ParallelTopics Dou et al VAST ‘11

End.

Sources Stasko, J. (2013). CS Information Visualization at Georgia Tech Hearst, M. (2009) Search User Interfaces –Ch. 10: Information Visualization for Search Interfaces: 10.3, 10.4, 10.9, 10.10; –Ch. 11: Information Visualization for Text Analysis; F. Viegas, M. Wattenberg, "Tag Clouds and the Case for Vernacular Visualization", interactions, Vol. 15, No. 4, Jul-Aug 2008, pp

Video Resources TileBars (file) 5+ min “document cards” video (vimeo) – no sound Parallel Tag Clouds (long) FeatureLens (long – find spots) FacetAtlas minuteshttp://vimeo.com/ :50http://in-spire.pnnl.gov/ –Galaxy, etc. Jigsaw (file) – VISUAL ANALYTICS EXAMPLE PaperLens (file) FeatureLens (file) Mani-Wordle - Koh et al TVCG (InfoVis) ‘10 -- video