Download presentation
Presentation is loading. Please wait.
Published byGladys Casey Modified over 9 years ago
1
IntroDefinitionSizeComplexityWrap-up 1/54 Individual Big Data Visual Analytics: Challenges and Opportunities Remco Chang and Eli Brown Tufts University
2
IntroDefinitionSizeComplexityWrap-up 2/54 Individual Talk Outline Visual Analytics + Big Data: 1.What is Big Data Visual Analytics? Definition and Problem Statement 2.How to Visualize Large Amounts of Data? 3.Tufts Research on Individual Differences 4.How to Visualize High Dimensional Data?
3
IntroDefinitionSizeComplexityWrap-up 3/54 Individual 1. What is Big Data Visual Analytics? A Definition and Problem Statement
4
IntroDefinitionSizeComplexityWrap-up 4/54 Individual Defining Big Data for Visual Analytics Let’s say that I have a billion data items, is that Big Data? What if: – These data items only have two attributes (e.g., latitude, longitude)? – If I transpose this dataset such that I have two rows of data, but with a billion attributes?
5
IntroDefinitionSizeComplexityWrap-up 5/54 Individual Defining Big Data for Visual Analytics Big Data is NOT just about the size of your data For the purpose of this talk, let’s talk about Big Data in the following way: – Size: The number of rows (n) Assume the amount of data cannot fit into a desktop computer’s memory – Complexity: The number of attributes (k) Assume (k > 2)
6
IntroDefinitionSizeComplexityWrap-up 6/54 Individual Problem Statements Considering the two together is too difficult, so we’ll tackle the two issues independently for now Our goal is to visualize (large| complex) data sets while: – Maintaining interactivity: rendering at 10 fps – Allowing for operations on the data (zoom, pivot, etc)
7
IntroDefinitionSizeComplexityWrap-up 7/54 Individual 2. How to Visualize Large Amount of Data?
8
IntroDefinitionSizeComplexityWrap-up 8/54 Individual Problem Statement Visualization on a Commodity Hardware Large Data in a Data Warehouse
9
IntroDefinitionSizeComplexityWrap-up 9/54 Individual Problem Statement Constraint: Data is too big to fit into the memory or hard drive of the personal computer – Note: Ignoring various database technologies (OLAP, Column-Store, No-SQL, Array-Based, etc) Classic Computer Science Problem… What are some previous techniques? – Truncate (sample, filter) – Resolution reduction (“blurring”, image zooming) – Stream (think Netflix, Hulu) – Pre-fetch (think open world 3D video games)
10
IntroDefinitionSizeComplexityWrap-up 10/54 Individual Pros and Cons: Truncate Truncate (sample, filter) – Pros: Easy to implement; efficient; scalable – Cons: Sampling is often data- or task-dependent Sampling Algorithm
11
IntroDefinitionSizeComplexityWrap-up 11/54 Individual Pros and Cons: Resolution Reduction Resolution reduction (“blurring”) – Pros: Allows hierarchical navigations – Cons: Fine details are often lost, not all data types can be easily blurred (order-invariant data)
12
IntroDefinitionSizeComplexityWrap-up 12/54 Individual Pros and Cons: Streaming Stream [Fisher et al. CHI 2012] – Pros: Query can be terminated at any time – Cons: It is inefficient on the database end t = 1 second t = 5 minute Fisher et al., Trust Me, I'm Partially Right: Incremental Visualization Lets Analysts Explore Large Datasets Faster. CHI 2012
13
IntroDefinitionSizeComplexityWrap-up 13/54 Individual Pros and Cons: Pre-Fetch Pre-fetch – Pros: Seamless to the user – Cons: Predicting the future is kind of hard Possible in 3D games because of limited degrees of freedom http://www.youtube.com/watch?v=n27NLuc44Lk
14
IntroDefinitionSizeComplexityWrap-up 14/54 Individual Pros and Cons: Pre-Fetch Pre-fetch in Visual Analytics [Chan, Hanrahan, 2008 VAST] – Limit the types of operations a user can do – Allows interactive analysis of over a billion data points Chan et al.,. Maintaining Interactivity While Exploring Massive Time Series. IEEE VAST 2008
15
IntroDefinitionSizeComplexityWrap-up 15/54 Individual Research at Tufts: User-Centric Pre-Fetching Joint work with Caroline Ziemkiewicz, Alvitta Ottley
16
IntroDefinitionSizeComplexityWrap-up 16/54 Individual Motivation
17
IntroDefinitionSizeComplexityWrap-up 17/54 Individual Individual Differences and Interaction Pattern Existing research shows that all the following factors affect how someone uses a visualization: – Spatial Ability – Cognitive Workload/Mental Demand – Personality – Experience (novice vs. expert) – Emotional State – Perceptual Speed – … and more
18
IntroDefinitionSizeComplexityWrap-up 18/54 Individual Preliminary Study – Novice v. Expert Novice vs. Expert financial experts use of the WireVis system when searching for fraud – Novice exhibited “breadth-first-search” behaviors – Experts exhibited “depth-first-search” behaviors Our next step is to use Machine Learning methods to distinguish a user by analyzing their interactions in real-time
19
IntroDefinitionSizeComplexityWrap-up 19/54 Individual Preliminary Study – Locus of Control Identified the personality factor, Locus of Control (LOC), as a predictor for how a user interacts with the following visualizations:
20
IntroDefinitionSizeComplexityWrap-up 20/54 Individual Results When with list view compared to containment view, internal LOC users are: – faster (by 70%) – more accurate (by 34%) Only for complex (inferential) tasks The speed improvement is about 2 minutes (116 seconds) R. Chang et al., How Locus of Control Influences Compatibility with Visualization Style, IEEE VAST 2011. R. Chang et al., How Visualization Layout Relates to Locus of Control and Other Personality Factors. TVCG 2012. To Appear.
21
IntroDefinitionSizeComplexityWrap-up 21/54 Individual Cognitive / Affective Priming
22
IntroDefinitionSizeComplexityWrap-up 22/54 Individual LOC Priming Visual Form List-View Containment Performance Poor Good Internal LOC External LOC Average ->Internal Average LOC R. Chang et al., Poster: Priming locus of control to affect performance. VAST Poster 2012.
23
IntroDefinitionSizeComplexityWrap-up 23/54 Individual Affective Priming on Visual Judgment R. Chang et al., Influencing Visual Judgment Through Affective Priming, CHI 2013. To Appear
24
IntroDefinitionSizeComplexityWrap-up 24/54 Individual Affective Priming on Visual Judgment R. Chang et al., Influencing Visual Judgment Through Affective Priming, CHI 2013. To Appear
25
IntroDefinitionSizeComplexityWrap-up 25/54 Individual Preliminary Study – Using Brain Sensing (fNIRS) Functional Near-Infrared Spectroscopy a lightweight brain sensing technique measures mental demand (working memory) R. Chang et al., Using fNIRS Brain Sensing to Evaluate Information Visualization Interfaces. CHI 2013. To Appear
26
IntroDefinitionSizeComplexityWrap-up 26/54 Individual This is Your Brain on Bar graphs and Pie Charts 3-back test
27
IntroDefinitionSizeComplexityWrap-up 27/54 Individual Quick Summary Pre-Fetching is a promising approach for supporting interactive visual analysis of large amounts of data Our “User-Centric” approach is three-pronged: – Understand the user’s cognitive “traits” (e.g., LOC, Numeracy, Spatial Ability, etc.) – Understand the user’s cognitive “states” (Cognitive Load, Affect, etc.) – Alter the user’s behavior by influencing cognitive traits and states through priming
28
IntroDefinitionSizeComplexityWrap-up 28/54 Individual 3. How to Visualize Complex (High-Dimensional) Data?
29
IntroDefinitionSizeComplexityWrap-up 29/54 Individual Why is This Problem Hard? You can only see 2D because Your monitor is 2D In other words: you can show at most 2 dimensional data. Everything else is a hack.
30
IntroDefinitionSizeComplexityWrap-up 30/54 Individual Ways to Visualize k-Dimensional Data Two primary ways to do this “hack” – Divide up the 2D screen into multiple 2D regions Showing no correlation between dimensions Showing k-1 correlations Showing all pair-wise correlations – Project k-Dimensional Data into 2D 3D to 2D k-D projection
31
IntroDefinitionSizeComplexityWrap-up 31/54 Individual Ways to Visualize k-Dimensional Data Divide up the 2D screen into multiple 2D regions – Showing no correlation between dimensions – Showing k-1 correlations – Showing all pair-wise correlations Project k-Dimensional Data into 2D – 3D to 2D – k-D projection
32
IntroDefinitionSizeComplexityWrap-up 32/54 Individual Ways to Visualize k-Dimensional Data Divide up the 2D screen into multiple 2D regions – Showing no correlation between dimensions – Showing k-1 correlations – Showing all pair-wise correlations Project k-Dimensional Data into 2D – 3D to 2D – k-D projection Parallel Coordinates
33
IntroDefinitionSizeComplexityWrap-up 33/54 Individual Ways to Visualize k-Dimensional Data Divide up the 2D screen into multiple 2D regions – Showing no correlation between dimensions – Showing k-1 correlations – Showing all pair-wise correlations Project k-Dimensional Data into 2D – 3D to 2D – k-D projection Scatterplot Matrix
34
IntroDefinitionSizeComplexityWrap-up 34/54 Individual Ways to Visualize k-Dimensional Data Divide up the 2D screen into multiple 2D regions – Showing no correlation between dimensions – Showing k-1 correlations – Showing all pair-wise correlations Project k-Dimensional Data into 2D – 3D to 2D – k-D projection
35
IntroDefinitionSizeComplexityWrap-up 35/54 Individual Ways to Visualize k-Dimensional Data Divide up the 2D screen into multiple 2D regions – Showing no correlation between dimensions – Showing k-1 correlations – Showing all pair-wise correlations Project k-Dimensional Data into 2D – 3D to 2D – k-D projection
36
IntroDefinitionSizeComplexityWrap-up 36/54 Individual Ways to Visualize k-Dimensional Data Divide up the 2D screen into multiple 2D regions – Showing no correlation between dimensions – Showing k-1 correlations – Showing all pair-wise correlations Project k-Dimensional Data into 2D – 3D to 2D – k-D projection Example Projection Methods: (Dimension Reduction) PCA MDS LDA LLE Many others! Usually, try to preserve distances in 2D as they exist in k-D
37
IntroDefinitionSizeComplexityWrap-up 37/54 Individual What We Have Done (at Tufts) We like projection methods because it is more scalable than the “divide the screen” methods iPCA – does interaction help understanding high dimensional data? – Demo Dis-Function – are interactions in 2D meaningful (recoverable) in k-D? – Switch to Eli
38
IntroDefinitionSizeComplexityWrap-up 38/54 Individual Summary
39
IntroDefinitionSizeComplexityWrap-up 39/54 Individual Summary Visual Analytics + Big Data: 1.Definition of Big Data Visual Analytics (Large | Complex) Data Analysis 2.How to Visualize Large Amounts Data? Pre-Fetching using individual differences and priming 3.How to Visualize High Dimensional Data? nD to 2D Projection Translating interactions from 2D to nD
40
IntroDefinitionSizeComplexityWrap-up 40/54 Individual
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.