1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager.

Slides:



Advertisements
Similar presentations
1 TreeJuxtaposer side by side comparison of evolutionary trees.
Advertisements

1 TreeJuxtaposer: Scalable Tree Comparison using Focus+Context with Guaranteed Visibility Tamara Munzner Univ. British Columbia François Guimbretière Univ.
SequenceJuxtaposer: Fluid Navigation For Large-Scale Sequence Comparison in Context James Slack *, Kristian Hildebrand *†, Tamara Munzner * and Katherine.
H3: Laying Out Large Directed Graphs in 3D Hyperbolic Space Tamara Munzner, Stanford University.
Ray Tracing CMSC 635. Basic idea How many intersections?  Pixels  ~10 3 to ~10 7  Rays per Pixel  1 to ~10  Primitives  ~10 to ~10 7  Every ray.
Scalable Visualization with Accordion Drawing Tamara Munzner University of British Columbia Department of Computer Science joint work with James Slack,
Presented by Konstantinos Georgiadis. Abstract This method extends the Hierarchical Radiosity approach for environments whose geometry and surface attributes.
Small Displays Nicole Arksey Information Visualization December 5, 2005 My new kitty, Erwin.
1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager.
1 Information Visualization at UBC Tamara Munzner University of British Columbia.
MDSteer: Steerable and Progressive Multidimensional Scaling Matt Williams and Tamara Munzner University of British Columbia Imager Lab.
University of British Columbia Department of Computer Science Tamara Munzner Interactive Visualization of Evolutionary Trees and Gene Sequences February.
1 Information Visualization with Accordion Drawing Tamara Munzner University of British Columbia.
University of British Columbia Department of Computer Science Tamara Munzner Visualization: From Pixels to Insight March 3, 2007 UBC CS TechTrek.
Introduction to Data-driven Animation Jinxiang Chai Computer Science and Engineering Texas A&M University.
1 Visualization Process and Collaboration Tamara Munzner Department of Computer Science University of British Columbia
1 Presented by Jean-Daniel Fekete. 2  Motivation  Mélange [Elmqvist 2008] Multiple Focus Regions.
Presented by Marlene Shehadeh Advanced Topics in Computer Vision ( ) Winter
Composite Rectilinear Deformation for Stretch and Squish Navigation James Slack, Tamara Munzner University of British Columbia November 1, 2006 Imager.
Evaluation Adam Bodnar CPSC 533C Monday, April 5, 2004.
“Occlusion” Prepared by: Shreya Rawal 1. Extending Distortion Viewing from 2D to 3D S. Carpendale, D. J. Cowperthwaite and F. David Fracchia (1997) 2.
Tree-Maps: A Space-Filling Approach to the Visualization of Hierarchical Information Structures Brian Johnson Ben Shneiderman (HCIL TR 91-06) Steve Betten.
Live Re-orderable Accordion Drawing (LiveRAC) Peter McLachlan, Tamara Munzner Eleftherios Koutsofios, Stephen North AT&T Research Symposium August, 2007.
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
School of Computer Science and Software Engineering A Networked Virtual Environment Communications Model using Priority Updating Monash University Yang-Wai.
Large Scale Radial Graph Drawing Cody Robson. Graph Exploration Radial Layout: Constrains graph to rings User can shift focus to avoid panning.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus + Context Visualization for Tabular Information R. Rao and S. K.
1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager.
DEPARTMENT OF COMPUTER SCIENCE SOFTWARE ENGINEERING, GRAPHICS, AND VISUALIZATION RESEARCH GROUP 15th International Conference on Information Visualisation.
1 Biology Design Studies comparative functional genomics –Pathline comparative genomics (synteny) –MizBee gene expression + interaction network –Cerebral.
SpaceTree: Supporting Exploration in Large Node Link Tree, Design Evolution and Empirical Evaluation Catherine Plaisant, Jesse Grosjean, Benjamin B.Bederson.
Partitioned Rendering Infrastructure with Stable Accordion Navigation James Slack MSc Thesis Presentation April 4, 2005.
1 PRISAD: A Partitioned Rendering Infrastructure for Scalable Accordion Drawing James Slack, Kristian Hildebrand, Tamara Munzner University of British.
Tree Structures (Hierarchical Information) cs5764: Information Visualization Chris North.
Evaluation of Viewport Size and Curvature of Large, High-Resolution Displays Lauren Shupp, Robert Ball, John Booker, Beth Yost, Chris North Virginia Polytechnic.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
By LaBRI – INRIA Information Visualization Team. Tulip 2010 – version Tulip is an information visualization framework dedicated to the analysis.
Pad++ A Zooming Graphical Sketchpad for Exploring Alternative Interface Physics Benjamin B. Bederson, James D. Hollan, Ken Perlin, Jonathan Meyer, David.
Accelerated Focused Crawling Through Online Relevance Feedback Soumen Chakrabarti, IIT Bombay Kunal Punera, IIT Bombay Mallela Subramanyam, UT Austin.
A Focus+Context Technique Based on Hyperbolic Geometry for Visualizing Large Hierarchies. John Lamping, Ramana Rao, and Peter Pirolli Xerox Palo Alto Research.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
IAT 814 Trees Chapter 3.2 of Spence ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS +
CHAPTER TEN AUTHORING.
IAT Overview + Detail Focus + Context ______________________________________________________________________________________ SCHOOL OF INTERACTIVE.
Intuitive Database Query System, Zooming Query Results Previews Drawing upon existing literature on zooming interface technology, intuitive navigation.

Real-time Graphics for VR Chapter 23. What is it about? In this part of the course we will look at how to render images given the constrains of VR: –we.
Interactive Visualizations for Biodiversity Information Bongshin Lee Researcher Visualization and Interaction Research Group Microsoft Research Bongshin.
Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.
Fall 2002CS/PSY Information Visualization 2 Case Study: Portraying Hierarchies Visualizing hierarchies  Variety of techniques Traditional tree views,
3D Viewers Two main uses: –Detector/event exploration – interactivity priority (15fps min). –Generate presentation material (still/movie renders) – quality.
Jinwook Seo Pad++: A Zoomable Graphical Sketchpad For Exploring Alternative Interface Physics Benjamin B. Bederson, James D. Hollan, Ken Perlin, Jonathan.
Pad++: A Zooming Graphical Interface for Exploring Alternate Interface Physics Presented By: Daniel Loewus-Deitch.
1D & 2D Spaces for Representing Data Mao Lin Huang.
Pad++: A Zoomable Graphical User Interface. Motivations View information at multiple scales –Semantic zooming Tap into natural spatial ways of thinking.
Coherent Hierarchical Culling: Hardware Occlusion Queries Made Useful Jiri Bittner 1, Michael Wimmer 1, Harald Piringer 2, Werner Purgathofer 1 1 Vienna.
1 An Efficient Optimal Leaf Ordering for Hierarchical Clustering in Microarray Gene Expression Data Analysis Jianting Zhang Le Gruenwald School of Computer.
1 A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.
CHC ++: Coherent Hierarchical Culling Revisited Oliver Mattausch, Jiří Bittner, Michael Wimmer Institute of Computer Graphics and Algorithms Vienna University.
3D Object Representations 2009, Fall. Introduction What is CG?  Imaging : Representing 2D images  Modeling : Representing 3D objects  Rendering : Constructing.
Concept Relationship Editor: A visual interface to support the assertion of synonymy relationships between taxonomic classifications Paul Craig & Jessie.
An Evaluation of Pan & Zoom and Rubber Sheet Navigation with and without an Overview Dmitry Nekrasovski, Adam Bodnar, Joanna McGrenere, François Guimbretière,
Human Computer Interaction Lecture 21 User Support
Real-Time Soft Shadows with Adaptive Light Source Sampling
Human Computer Interaction Lecture 21,22 User Support
3D Object Representations
Hierarchies (Trees) Definition Examples
cs5764: Information Visualization Chris North
Scalable Visual Comparison of Biological Trees and Sequences
François Guimbretière
Presentation transcript:

1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager

2 Outline TreeJuxtaposer –tree comparison Accordion Drawing –information visualization technique SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework Evaluation –comparing AD to pan/zoom, with/without overview

3 Phylogenetic/Evolutionary Tree M Meegaskumbura et al., Science 298:379 (2002)

4 Common Dataset Size Today M Meegaskumbura et al., Science 298:379 (2002)

5 Future Goal: 10M Node Tree of Life David Hillis, Science 300:1687 (2003) Plants Protists Fungi Animals You are here

6 Paper Comparison: Multiple Trees focus context

7 TreeJuxtaposer side by side comparison of evolutionary trees –[video] –software downloadable from [TreeJuxtaposer: Scalable Tree Comparison using Focus+Context with Guaranteed Visibility. Tamara Munzner, François Guimbretière, Serdar Tasiran, Li Zhang, Yunhong Zhou. Proc SIGGRAPH 2003]

8 Related Work: Tree Browsing general –Cone Trees [Robertson et al 91] –Hyperbolic Trees [Lamping 94] –H3 [Munzner 97] –Hierarchical Clustering Explorer [Seo & Shneiderman 02] –SpaceTree [Plaisant et al 02] –DOI Tree [Card and Nation 02] phylogenetic trees –TreeWiz [Rost and Bornberg-Bauer 02] –TaxonTree [Lee et al 04]

9 Related Work: Comparison tree comparison –RF distance [Robinson and Foulds 81] –perfect node matching [Day 85] visual tree comparison –creation/deletion only [Chi and Card 99] –leaves only [Graham and Kennedy 01] subsequent work –DoubleTree [Parr et al 04]

10 TJ Contributions first interactive tree comparison system –automatic structural difference computation scalable to large datasets –250,000 to 500,000 total nodes –all preprocessing subquadratic –all realtime rendering sublinear items to render >> number of available pixels scalable to large displays (4000 x 2000) introduced accordion drawing

11 Outline TreeJuxtaposer –tree comparison Accordion Drawing –information visualization technique SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework Evaluation –comparing AD to pan/zoom, with/without overview

12 Accordion Drawing rubber-sheet navigation –stretch out part of surface, the rest squishes –borders nailed down –Focus+Context technique integrated overview, details –old idea [Sarkar et al 93], [Robertson et al 91] guaranteed visibility –marks always visible –important for scalability –new idea [Munzner et al 03]

13 Guaranteed Visibility marks are always visible –regions of interest shown with color highlights –search results, structural differences, user specified easy with small datasets

14 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible

15 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible –outside the window AD solution: constrained navigation

16 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible –outside the window AD solution: constrained navigation –underneath other marks AD solution: avoid 3D

17 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible –outside the window AD solution: constrained navigation –underneath other marks AD solution: avoid 3D –smaller than a pixel AD solution: smart culling

18 Guaranteed Visibility: Small Items Naïve culling may not draw all marked items GVno GV Guaranteed visibility of marks No guaranteed visibility

19 Guaranteed Visibility: Small Items Naïve culling may not draw all marked items GVno GV Guaranteed visibility of marks No guaranteed visibility

20 Guaranteed Visibility Rationale relief from exhaustive exploration –missed marks lead to false conclusions –hard to determine completion –tedious, error-prone compelling reason for Focus+Context –controversy: does distortion help or hurt? –strong rationale for comparison infrastructure needed for efficient computation

21 Related Work multiscale zooming –Pad++ [Bederson and Hollan 94] multiscale visibility –space-scale diagrams [Furnas & Bederson 95] –effective view navigation [Furnas 97] –critical zones [Jul and Furnas 98]

22 Outline TreeJuxtaposer –tree comparison Accordion Drawing –information visualization technique SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework Evaluation –comparing AD to pan/zoom, with/without overview

23 Genomic Sequences multiple aligned sequences of DNA investigate benefits of accordion drawing –showing multiple focus areas in context –smooth transitions between states –guaranteed visibility for globally visible landmarks now commonly browsed with web apps –zoom and pan with abrupt jumps

24 Related Work web based, database driven, multiple tracks –Ensembl [Hubbard 02] –UCSC Genome Browser [Kent 02] –NCBI [Wheeler 02] client side approaches –Artemis [Rutherford et al 00] –BARD [Spell et al 03] –PhyloVISTA [Shah et al 03]

25 SequenceJuxtaposer side by side comparison of multiple aligned gene sequences [video], software downloadable from [SequenceJuxtaposer: Fluid Navigation For Large-Scale Sequence Comparison In Context. James Slack, Kristian Hildebrand, Tamara Munzner, and Katherine St. John. Proc. German Conference on Bioinformatics 2004]

26 Searching search for motifs –protein/codon search –regular expressions supported results marked with guaranteed visibility

27 Differences explore differences between aligned pairs –slider controls difference threshold in realtime –standard difference algorithm, not novel results marked with guaranteed visibility

28 SJ Contributions fluid tree comparison system –showing multiple focus areas in context –guaranteed visibility of marked areas thresholded differences, search results scalable to large datasets –2M nucleotides –all realtime rendering sublinear

29 Outline TreeJuxtaposer –tree comparison Accordion Drawing –information visualization technique SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework Evaluation –comparing AD to pan/zoom, with/without overview

30 Scaling Up: TJC/TJC-Q TJC: 15M nodes –no quadtree –picking with new hardware feature requires HW multiple render target support TJC-Q: 5M nodes –lightweight quadtree for picking support both support tree browsing only –no comparison data structures [Scalable, Robust Visualization of Large Trees Dale Beermann, Tamara Munzner, Greg Humphreys. Proc. EuroVis 2005]

31 Generic Infrastructure: PRISAD generic AD infrastructure PRITree is TreeJuxtaposer using PRISAD PRISeq is SequenceJuxtaposer using PRISAD efficiency –faster rendering: minimize overdrawing –smaller memory footprint correctness –rendering with no gaps: eliminate overculling [Partitioned Rendering Infrastructure for Scalable Accordion Drawing. James Slack, Kristian Hildebrand, and Tamara Munzner. Proc. InfoVis 2005 extended version: Information Visualization, to appear]

32 Navigation generic navigation infrastructure –application independent –uses deformable grid –split lines grid lines define object boundaries –horizontal and vertical separate independently movable

33 Split Line Hierarchy data structure supports navigation, picking, drawing two interpretations –linear ordering –hierarchical subdivision ABCDEF

34 PRISAD Architecture world-space discretization preprocessing initializing data structures placing geometry screen-space rendering frame updating analyzing navigation state drawing geometry

35 Partitioning partition object set into bite-sized ranges –using current split line screen-space positions required for every frame –subdivision stops if region smaller than 1 pixel or if range contains only 1 object [1,2] [3,4] [5] { [1,2], [3,4], [5] } Queue of ranges

36 Seeding reordering range queue result from partition –marked regions get priority in queue drawn first to provide landmarks [1,2] [3,4] [5] { [1,2], [3,4], [5] } { [3,4], [5], [1,2] } ordered queue

37 Drawing Single Range each enqueued object range drawn according to application geometry –selection for trees –aggregation for sequences

38 PRITree Range Drawing select suitable leaf in each range draw path from leaf to the root –ascent-based tree drawing –efficiency: minimize overdrawing only draw one path per range [3,4] { [3,4], [5], [1,2] }

39 Rendering Dense Regions –correctness: eliminate overculling bad leaf choices would result in misleading gaps –efficiency: maximize partition size to reduce rendering too much reduction would result in gaps Intended renderingPartition size too big

40 Rendering Dense Regions –correctness: eliminate overculling bad leaf choices would result in misleading gaps –efficiency: maximize partition size to reduce rendering too much reduction would result in gaps Intended renderingPartition size too big

41 PRITree Skeleton guaranteed visibility of marked subtrees during progressive rendering first frame: one path per marked group full scene: entire marked subtrees

42 PRISeq Range Drawing: Aggregation aggregate range to select box color for each sequence –random select to break ties AACC ATTT [1,4] A T TTT C T

43 PRISeq Range Drawing collect identical nucleotides in column –form single box to represent identical objects attach to split line hierarchy cache lazy evaluation draw vertical column A T T T A { A:[1,1], T:[2,3] } 1 2 3

44 PRITree Rendering Time Performance TreeJuxtaposer renders all nodes for star trees branching factor k leads to O(k) performance

45 PRITree Rendering Time Performance TreeJuxtaposer renders all nodes for star trees branching factor k leads to O(k) performance

46 PRITree Rendering Time Performance InfoVis 2003 Contest dataset 5x rendering speedup

47 PRITree Rendering Time Performance a closer look at the fastest rendering times

48 PRITree Rendering Time Performance

49 PRITree handles 4 million nodes in under 0.4 seconds TreeJuxtaposer takes twice as long to render 1 million nodes Detailed Rendering Time Performance

50 Detailed Rendering Time Performance TreeJuxtaposer valley from overculling

51 Memory Performance linear memory usage for both applications 4-5x more efficient for synthetic datasets

52 Performance Comparison PRITree vs. TreeJuxtaposer –detailed benchmarks against identical TJ functionality 5x faster, 8x smaller footprint handles over 4M node trees PRISeq vs. SequenceJuxtaposer –15x faster rendering, 20x smaller memory size –44 species * 17K nucleotides = 770K items –6400 species * 6400 nucleotides = 40M items

53 PRISAD Contributions infrastructure for efficient, correct, and generic accordion drawing efficient and correct rendering –screen-space partitioning tightly bounds overdrawing and eliminates overculling first generic AD infrastructure –PRITree renders 5x faster than TJ –PRISeq renders 20x larger datasets than SJ future work –editing support

54 Outline TreeJuxtaposer –tree comparison Accordion Drawing –information visualization technique SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework Evaluation –comparing AD to pan/zoom, with/without overview

55 Evaluation evaluate RSN navigation technique –compare to conventional pan/zoom clarify utility of overviews for navigation –why add overview to F+C? Need evidence to support or refute common InfoVis assumption regarding usefulness of overviews [An Evaluation of Pan & Zoom and Rubber Sheet Navigation with and without an Overview. Dmitry Nekrasovski, Adam Bodnar, Joanna McGrenere, François Guimbretière, and Tamara Munzner. Proc. SIGCHI 06.

56 Conventional Pan & Zoom (PZN) navigation via panning (translation) and zooming (uniform scale changes) easy to lose context and become lost Selecting region to zoomZooming result

57 Overviews separate global view of the dataset maintain contextual awareness force attention split between views

58 Rubber Sheet Navigation (RSN) Focus + Context technique stretching and squishing rubber sheet metaphor maintain contextual awareness in single view Selecting region to zoomZooming result

59 Previous Findings Mixed mixed results for navigation and overviews speed: F+C faster than PZN [Schaffer et al., 1996; Gutwin and Skopik, 2003] accuracy: PZN more accurate than F+C [Hornbaek and Frokjaer, 2001; Gutwin and Fedak, 2004] preference: Overviews generally preferred [Beard and Walker, 1990; Plaisant et al., 2002]

60 Dataset Motivating domain: evolutionary biology –large datasets, clear tasks –require understanding topological structure at different places and scales 5,918 node binary tree –Leaves are species, internal nodes are ancestors

61 Task Generalized version requiring no specialized knowledge of evolutionary trees (no labels) Compare topological distance between marked nodes Requires multiple navigation actions to complete Several instances isomorphic in difficulty

62 Experiment Interfaces Common visual representation and interaction model –Lacking in majority of previous evaluations Common set of navigation actions Guarantee visibility of areas of interest

63 RSN

64 PZN

65 RSN + Overview

66 PZN + Overview

67 Guaranteed Visibility PZN –Implemented in PZN similarly to Halo [Baudisch et al., 2003] RSN –Implicit as areas of interest compressed along bounds of display Sub-pixel marked regions always drawn using PRISAD framework [Slack et al., 2005]

68 Hypotheses H1 - RSN performs better than PZN independent of overview presence H2 - For RSN, presence of overview does not result in better performance H3 - For PZN, presence of overview results in better performance

69 Design 2 (navigation, between) x 2 (presence of overview, between) x 7 (blocks, within) Each block contained 5 randomized trials 40 subjects, each randomly assigned to each interface

70 Procedure and Measures Training protocols used to train subjects in effective strategies to solve task Subjects completed 35 trials (7 blocks x 5 trials), each isomorphic in difficulty Completion time, navigation actions, resets, errors, and subjective NASA-TLX workload

71 Results - Navigation PZN outperformed RSN (p < 0.001) Learning effect shows performance plateau Subjects using PZN performed fewer navigation actions and fewer resets Subjects using PZN reported less mental demand (p < 0.05)

72 Results – Presence of Overview No effect on any performance measure Subjects using overviews reported less physical demand and more enjoyment (p < 0.05)

73 Summary of Results H1 - RSN performs better than PZN independent of overview presence No – PZN outperformed RSN H2 - For RSN, presence of overview does not result in better performance Yes – No effect of overview on performance H3 - For PZN, presence of overview results in better performance No – No effect of overview on performance

74 Discussion – Navigation Performance differences cannot be ascribed to unfamiliarity with the techniques Design guidelines for PZN extensively studied, but not so for F+C or RSN

75 Discussion – Overviews Overviews for PZN and RSN: –No performance benefits –Preference for overview Overview may act as cognitive cushion –Provide subjective but not performance benefits Guaranteed visibility may provide same benefits as overviews

76 Evaluation Conclusions First evaluation comparing PZN and RSN techniques with and without an overview Performance: –PZN faster and more accurate than RSN Preference: –Overviews preferred, but no performance benefits

77 Other Projects Focus+Context evaluation –low-level visual search and visual memory graph drawing –TopoLayout: multi-level decomposition and layout using topological features dimensionality reduction –MDSteer: progressive and steerable MDS papers, talks, videos available from