Download presentation
Presentation is loading. Please wait.
Published byAaron Schmidt Modified over 11 years ago
1
Visualizing Information: Using WebTheme to Visualize Internet Search Results Karen Buxton and Mary Frances Lembo The Value of Information: American Society for Information Science Pacific Northwest Chapter Fall Meeting Sept. 20-21, 2002 PNNL-SA-36456
2
2 Presentation Overview Brief Overview of Information Visualization Introduction to WebTheme Preparing a WebTheme Query Exploring a Dataset Question & Answer
3
3 Information Visualization What is an information visualization? Visual representation of data, which allows the user to navigate through large datasets more quickly and gain additional insight about the data. What types of data can be used? Text Image Data Numerical Data. Etc.
4
4 How Is Information Visualization Used? Battlefield Awareness Business Intelligence Enterprise Knowledge Management Environmental Security Intellectual Asset Management Intelligence Analysis Law Enforcement Market Assessment Medical Informatics Medical Research Nuclear Non-Proliferation Research Program Management Science and Technology Scanning Translingual Text Analysis
5
5 Information Visualization at PNNL Analyzes large volumes of text Displays related documents and themes as star clusters and terrain maps SPIRE Related Technologies WebTheme Galaxies ThemeView Correlation Tool Starlight
6
6 What is WebTheme? Web-enabled version of SPIRE Harvests data from the World Wide Web by using search terms, or following links derived from user specified URLs
7
7 LicensingLicensing Government Agencies (NOT Contractors) WebTheme use agreement available at no cost! Installation and training agreement available for a fee Non-Governmental Organizations Negotiate a contract Recommend installation and training agreement
8
8 Why Use WebTheme? Investigate and characterize websites Investigate a new technology Find key players in a particular field Find opportunities for Collaboration
9
9 Using WebTheme Preparing a WebTheme Query Planning a Query Creating a Dataset Exploring a Dataset Using WebTheme Tools Exploring a Galaxy Exploring ThemeView
10
10
11
11 Planning a WebTheme Query Decide How to Gather the Data Experiment with Search Engine Queries Google Altavista Examine Search Results Modify Query If Needed Exploration of the Site or Links URL List
12
12 WebThemeWebTheme
13
13 Create a New Data Set
14
14 Create a Search Query or Follow a URL List?
15
15 Harvest Settings
16
16 Advanced Options for Harvesting
17
17 FiltersFilters
18
18 ProcessingProcessing
19
19 Galaxies Layout White Dots = Documents Location Has Meaning Proximity Distance Degree of Thematic Concentration Topic Strength & Number of Documents Galaxies Clouds = ThemeView Mountains Note Instructions at Bottom of Window
20
20 WebTheme Toolbar
21
21 Exploring Galaxies Cluster Centroids: Click on Centroid Circle to See Cluster Terms Thematic Labels Indicate Dominant Themes in Clouds
22
22 Viewing Document Titles Select Click + S icon Drag Select to Choose a Group of Documents View Document Titles Click + Ab icon Click on dots to reveal titles
23
23 Viewing Documents Document Viewer Search for Words in a Document View in Browser
24
24 Link Mode Must Turn on Link Mode BEFORE Processing Arrows Indicate Links from One Page to Another Circle Indicates No Links from Page
25
25
26
26 Probe Tool To Use, Select the Probe Button ( + P) Left Click to Probe Region. Shows a Graphical Representation of Topics at Designated Location Value Indicates Relative Topic Strength
27
27 Gisting Tool To Use Select Documents to Gist Click the Gist Button (% ) Shows Top 50 Topics in Selected Documents: Identify Terms of Interest Copy Terms to Clipboard Window
28
28
29
29 Query Tool Word Query Selects Documents that Contain All Query Words Click Group Results to create a set that contains search terms
30
30 Query Tool Query By Example Looks for text similar to the example Determines Location of Greatest Term Strength Use Slider to Increase Number Selected
31
31 Group Tool Create subsets Retrieved from Query Selected in Galaxy Dots Change Color to Reflect Group Membership Combine Sets Select Disjunction Intersection Union
32
32 ThemeViewThemeView
33
33 When to Reprocess You Get Too Many Clusters that Are Too Similar Reduce Number of Clusters Requested You Get Big Clusters with Too Many Unrelated Documents Increase Number of Clusters Requested
34
Questions? Mary Frances Lembo mf.lembo@pnl.gov Karen Buxton karen.buxton@pnl.gov Battelle U.S. Department of Energy Pacific Northwest National Laboratory
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.