ProvenanceIntroPersonalityPrimingDist FuncWrap-up 1/52 User-Centric Visual Analytics Remco Chang Tufts University.

Slides:



Advertisements
Similar presentations
1/26Remco Chang – Dagstuhl 14 Analyzing User Interactions for Data and User Modeling Remco Chang Assistant Professor Tufts University.
Advertisements

ProvenanceIntroLOCCog StateDist FuncWrap-up 1/52 User-Centric Visual Analytics Remco Chang Tufts University.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Experiments and Variables
INTRODUCTION TO MODELING
LECTURE 10: ANALYTIC PROVENANCE April 6, 2015 COMP Topics in Visual Analytics Note: slide deck adapted from R. Chang.
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Chapter 1 Introduction to Modeling DECISION MODELING WITH MICROSOFT EXCEL Copyright 2001 Prentice Hall.
EvaluationIntroVis/GfxInteractionWrap-up Thinking Interactively with Visualizations Remco Chang UNC Charlotte Charlotte Visualization Center.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Minnesota State Community and Technical College Critical Thinking Assignment Example and Assessment.
Chapter 1 Introduction to Modeling DECISION MODELING WITH MICROSOFT EXCEL Copyright 2001 Prentice Hall Publishers and Ardith E. Baker.
Overarching Goal: Understand that computer models require the merging of mathematics and science. 1.Understand how computational reasoning can be infused.
VALTChessVA IntroAppsWrap-up 1/25 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science.
Dist FuncIntroVAAppsATGWrap-up 1/25 Visual Analytics Research at Tufts Remco Chang Assistant Professor Tufts University.
ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
1/26Remco Chang – PNNL 14 Analyzing User Interactions for Data and User Modeling Remco Chang Assistant Professor Tufts University.
Two Technique Papers on High Dimensionality Allan Rempel December 5, 2005.
Chapter 11 Artificial Intelligence and Expert Systems.
Research to Reality William Ribarsky Remco Chang University of North Carolina at Charlotte.
The Architecture Design Process
CONTENT BASED FACE RECOGNITION Ankur Jain 01D05007 Pranshu Sharma Prashant Baronia 01D05005 Swapnil Zarekar 01D05001 Under the guidance of Prof.
Chapter 14 The Second Component: The Database.
Experimental design – (The Scientific Method)
© 2010 IBM Corporation © 2011 IBM Corporation September 6, 2012 NCDHHS FAMS Overview for Behavioral Health Managed Care Organizations.
1/30Remco Chang – SEAri Workshop 15 Big Data Visual Analytics: A User Centric Approach Remco Chang Assistant Professor Tufts University.
SizeIntroDefinitionComplexityTuftsWrap-up 1/54 Big Data Visual Analytics: Challenges and Opportunities Remco Chang Tufts University.
Conducting a User Study Human-Computer Interaction.
8th Grade Earth and Space Science
Dist FuncIntroPersonalityProvenanceGroupWrap-up 1/40 User-Centric Visual Analytics Remco Chang Tufts University.
IntroDefinitionSizeComplexityWrap-up 1/54 Individual Big Data Visual Analytics: Challenges and Opportunities Remco Chang and Eli Brown Tufts University.
VALTVA IntroAppsWrap-up 1/16 Interactive Data Analysis and Model Exploration: A Visual Analytics Approach Remco Chang Tufts University Department of Computer.
11 C H A P T E R Artificial Intelligence and Expert Systems.
Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.
What are your interactions doing for your visualization? Remco Chang UNC Charlotte Charlotte Visualization Center.
Scientific Inquiry & Skills
1/20 (Big Data Analytics for Everyone) Remco Chang Assistant Professor Department of Computer Science Tufts University Big Data Visual Analytics: A User-Centric.
Observation & Analysis. Observation Field Research In the fields of social science, psychology and medicine, amongst others, observational study is an.
Fundamentals of Information Systems, Third Edition2 Principles and Learning Objectives Artificial intelligence systems form a broad and diverse set of.
Disclosure risk when responding to queries with deterministic guarantees Krish Muralidhar University of Kentucky Rathindra Sarathy Oklahoma State University.
VISUAL ANALYTICS: VISUAL EXPLORATION, ANALYSIS, AND PRESENTATION OF LARGE COMPLEX DATA Remco Chang, PhD (Charlotte Visualization Center) (Tufts University)
The Scientific Method. The Scientific Method is an orderly process used to investigate the natural world.
VALTVA IntroAppsWrap-up 1/34 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science.
Where did plants and animals come from? How did I come to be?
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
Science Skills. Observations Observation - noticeable facts; using senses to gather information ; data measured, collected, perceived, or noticed 2 types:
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
ProvenanceIntroPersonalityPrimingDist FuncWrap-up 1/40 User-Centric Visual Analytics Remco Chang Tufts University.
1 Remco Chang – Dagstuhl 15 From vision science to data science: applying perception to problems in big data Remco Chang Assistant Professor Computer Science.
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
Module 4: Systems Development Chapter 13: Investigation and Analysis.
Finite State Machines (FSM) OR Finite State Automation (FSA) - are models of the behaviors of a system or a complex object, with a limited number of defined.
LECTURE 12: ANALYTIC PROVENANCE November 16, 2015 SDS235: Visual Analytics Note: slide deck adapted from R. Chang.
Evaluating the Relationships between User Interaction and Financial Visual Analysis Dong Hyun Jeong, Wenwen Dou, Felesia Stukes, William Ribarsky, Heather.
Lesson Overview Lesson Overview What Is Science?.
Systems Development Lifecycle
IntroGoalCrowdPredictionWrap-up 1/26 Learning Debugging and Hacking the User Remco Chang Assistant Professor Tufts University.
THE SCIENTIFIC METHOD Murtaugh 1A Living Environment.
1 Design and evaluation methods: Objectives n Design life cycle: HF input and neglect n Levels of system design: Going beyond the interface n Sources of.
Lecture 15: Analytic Provenance
School of Computer Science & Engineering
Remco Chang Associate Professor Computer Science, Tufts University
Big Data Visual Analytics: Challenges and Opportunities
CSc4730/6730 Scientific Visualization
CSc4730/6730 Scientific Visualization
Introduction to Visual Analytics
Title of your experimental design
GhostLink: Latent Network Inference for Influence-aware Recommendation
Presentation transcript:

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 1/52 User-Centric Visual Analytics Remco Chang Tufts University

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 2/52 Human + Computer Human vs. Artificial Intelligence Garry Kasparov vs. Deep Blue (1997) – Computer takes a “brute force” approach without analysis – “As for how many moves ahead a grandmaster sees,” Kasparov concludes: “Just one, the best one” Artificial vs. Augmented Intelligence Hydra vs. Cyborgs (2005) – Grandmaster + 1 chess program > Hydra (equiv. of Deep Blue) – Amateur + 3 chess programs > Grandmaster + 1 chess program

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 3/52 Visual Analytics = Human + Computer Visual analytics is "the science of analytical reasoning facilitated by visual interactive interfaces.“ 1 By definition, it is a collaboration between human and computer to solve problems. 1. Thomas and Cook, “Illuminating the Path”, 2005.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 4/52 Example: What Does (Wire) Fraud Look Like? Financial Institutions like Bank of America have legal responsibilities to report all suspicious wire transaction activities (money laundering, supporting terrorist activities, etc) Data size: approximately 200,000 transactions per day (73 million transactions per year) Problems: – Automated approach can only detect known patterns – Bad guys are smart: patterns are constantly changing – Data is messy: lack of international standards resulting in ambiguous data Current methods: – 10 analysts monitoring and analyzing all transactions – Using SQL queries and spreadsheet-like interfaces – Limited time scale (2 weeks)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 5/52 WireVis: Financial Fraud Analysis In collaboration with Bank of America – Develop a visual analytical tool (WireVis) – Visualizes 7 million transactions over 1 year – Beta-deployed at WireWatch A great problem for visual analytics: – Ill-defined problem (how does one define fraud?) – Limited or no training data (patterns keep changing) – Requires human judgment in the end (involves law enforcement agencies) Design philosophy: “combating human intelligence requires better (augmented) human intelligence” R. Chang et al., Scalable and interactive visual analysis of financial wire transactions for fraud detection. Information Visualization,2008. R. Chang et al., Wirevis: Visualization of categorical, time-varying data from financial transactions. IEEE VAST, 2007.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 6/52 WireVis: A Visual Analytics Approach Heatmap View (Accounts to Keywords Relationship) Strings and Beads (Relationships over Time) Search by Example (Find Similar Accounts) Keyword Network (Keyword Relationships)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 7/52 Applications of Visual Analytics Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison R. Chang et al., Two Visualization Tools for Analysis of Agent-Based Simulations in Political Science. IEEE CG&A, 2012

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 8/52 Applications of Visual Analytics Where When Who What Original Data Evidence Box R. Chang et al., Investigative Visual Analysis of Global Terrorism, Journal of Computer Graphics Forum, Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 9/52 Applications of Visual Analytics R. Chang et al., An Interactive Visual Analytics System for Bridge Management, Journal of Computer Graphics Forum, To Appear. Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 10/52 Applications of Visual Analytics R. Chang et al., Interactive Coordinated Multiple-View Visualization of Biomechanical Motion Data, IEEE Vis (TVCG) Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 11/52 Talk Outline Discuss 4 Visual Analytics problems from a User-Centric perspective: 1.One optimal visualization for every user? 2.Does the user always behave the same with a visualization? 3.Can a user’s reasoning process be recorded and stored? 4.Can such reasoning processes and knowledge be expressed quantitatively?

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 12/52 1. Is there an optimal visualization? How personality influences compatibility with visualization style

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 13/52 What’s the Best Visualization for You? Jürgensmann and Schulz, “Poster: A Visual Survey of Tree Visualization”. InfoVis, 2010.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 14/52 What’s the Best Visualization for You? Intuitively, not everyone is created equal. – Our background, experience, and personality should affect how we perceive and understand information. So why should our visualizations be the same for all users?

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 15/52 Cognitive Profile Objective: to create personalized information visualizations based on individual differences Hypothesis: cognitive factors affect a person’s ability (speed and accuracy) in using different visualizations.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 16/52 Experiment Procedure 4 visualizations on hierarchical visualization – From list-like view to containment view 250 participants using Amazon’s Mechanical Turk Questionnaire on “locus of control” (LOC) – Definition of LOC: the degree to which a person attributes outcomes to themselves (internal LOC) or to outside forces (external LOC) R. Chang et al., How Locus of Control Influences Compatibility with Visualization Style, IEEE VAST V1 V2 V3 V4

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 17/52 Results When with list view compared to containment view, internal LOC users are: – faster (by 70%) – more accurate (by 34%) Only for complex (inferential) tasks The speed improvement is about 2 minutes (116 seconds)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 18/52 Conclusion Cognitive factors can affect how a user perceives and understands information from using a visualization The effect could be significant in terms of both efficiency and accuracy Design Implications: Personalized displays should take into account a user’s cognitive profile

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 19/52 2. WHAT?? Is the relationship between LOC and visual style coincidental or dependent?

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 20/52 What We Know About LOC and Visualization: Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOCAverage LOC

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 21/52 We Also Know: Based on Psychology research, we know that locus of control can be temporarily affected through priming For example, to reduce locus of control (to make someone have a more external LOC) “We know that one of the things that influence how well you can do everyday tasks is the number of obstacles you face on a daily basis. If you are having a particularly bad day today, you may not do as well as you might on a day when everything goes as planned. Variability is a normal part of life and you might think you can’t do much about that aspect. In the space provided below, give 3 examples of times when you have felt out of control and unable to achieve something you set out to do. Each example must be at least 100 words long.”

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 22/52 Known Facts: 1.There is a relationship between LOC and use of visualization 2.LOC can be primed Research Question: – If we can affect the user’s LOC, will that affect their use of visualization? Hypothesis: – If yes, then the relationship between LOC and visualization style is dependent – If no, then we claim that LOC is a stable indicator of a user’s visualization style =>Publication! Research Question =>Publication!

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 23/52 LOC and Visualization Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOCAverage LOC Condition 1: Make Internal LOC more like External LOC

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 24/52 LOC and Visualization Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOC Average LOC Condition 2: Make External LOC more like Internal LOC

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 25/52 LOC and Visualization Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOC Average LOC Condition 3: Make 50% of the Average LOC more like Internal LOC Condition 4: Make 50% of the Average LOC more like External LOC

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 26/52 Result Yes, users behaviors can be altered by priming their LOC! However, this is only true for: – Speed (less so for accuracy) – Only for complex tasks (inferential tasks)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 27/52 Effects of Priming (Condition 3) Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOC Average LOC Average -> External

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 28/52 Effects of Priming (Condition 4) Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOC Average LOC Average ->Internal

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 29/52 Effects of Priming (Condition 1) Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOC Average LOC Internal->External

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 30/52 Effects of Priming (Condition 2) Visual Form List-View (V1) Containment (V4) Performance Poor Good Internal LOC External LOCAverage LOC External -> Internal

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 31/52 Conclusion The relationship between Locus of Control and visualization style appears to be causal: by priming a user’s LOC, we an alter their behavior with a visualization in a deterministic manner. Future work: examine if the interaction patterns are different between the LOC groups. – Can train machine learning models to learn a personality profile based on interaction pattern. – Sell the software to Google! Implications to (a) evaluations of visualizations, and (b) designing visual interfaces.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 32/52 3. What’s In a User’s Interactions? How much of a user’s reasoning can be recovered from the interaction log?

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 33/52 What is in a User’s Interactions? Types of Human-Visualization Interactions – Word editing (input heavy, little output) – Browsing, watching a movie (output heavy, little input) – Visual Analysis (closer to 50-50) Challenge: Can we capture and extract a user’s reasoning and intent through capturing a user’s interactions? VisualizationHuman Output Input Keyboard, Mouse, etc Images (monitor)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 34/52 What is in a User’s Interactions? Goal: determine if a user’s reasoning and intent are reflected in a user’s interactions. Analysts Grad Students (Coders) Logged (semantic) Interactions Compare! (manually) Strategies Methods Findings Guesses of Analysts’ thinking WireVis Interaction-Log Vis

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 35/52 What’s in a User’s Interactions From this experiment, we find that interactions contains at least: – 60% of the (high level) strategies – 60% of the (mid level) methods – 79% of the (low level) findings R. Chang et al., Recovering Reasoning Process From User Interactions. CG&A, R. Chang et al., Evaluating the Relationship Between User Interaction and Financial Visual Analysis. VAST, 2009.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 36/52 What’s in a User’s Interactions Why are these so much lower than others? – (recovering “methods” at about 15%) Only capturing a user’s interaction in this case is insufficient.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 37/52 Conclusion A high percentage of a user’s reasoning and intent are reflected in a user’s interactions. Raises lots of question: (a) what is the upper- bound, (b) how to automate the process, (c) how to utilize the captured results This study is not exhaustive. It merely provides a sample point of what is possible. R. Chang et al., Analytic Provenance Panel at IEEE VisWeek R. Chang et al., Analytic Provenance Workshop at CHI. 2011

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 38/52 4. If Interaction Logs Contain Knowledge… Can domain knowledge be captured and represented quantitatively?

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 39/52 Find Distance Function, Hide Model Inference Observation: Domain experts do not know how to visualize their own data, but knows it when a visualization looks “wrong”. More importantly, they often know why it looks wrong

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 40/52 Working with Domain Experts Common practice: the visualization expert modifies the visualization and asks for the domain expert’s opinion. – Repeat cycle – …Publish results Question: why can’t the domain expert “fix” the visualization themselves by interacting with the visualization directly?

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 41/52 Direct Manipulation of Visualization We have developed a system that allows the expert to directly move the elements of the visualization to what they think is “right”. We start by “guessing” a distance function, and ask the user to move the points to the “right” place

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 42/52 Direct Manipulation of Visualization The process is repeated a few times… Until the expert is happy (or the visualization can not be improved further) The system outputs a new distance function!

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 43/52 Our Approach Data Distance Function (Θ 0 ) Principal Component Analysis We start with a standard high-D to 2D visualization method using Principal Component Analysis (PCA). – Input to PCA is a distance matrix – Meaning that we need to assume a distance function At t=0, the system assumes the weights to the distance function. We call these weights (Θ 0 ). The system creates a visualization Then the user updates the visualization… 2D Visualization (t=0)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 44/52 Our Approach Data Distance Function (Θ 1 ) Principal Component Analysis At t=1, we look to update our model to (Θ 1 ) based on the layout that the user created. We notice that the data is immutable, the PCA cannot be inverted. But we could update the weights to the distance function. We use a standard gradient descent method to find a set of weights (Θ 1 ) that best satisfies the layout Then we repeat the process 2D Visualization (t=1)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 45/52 Our Approach Data Distance Function (Θ 1 ) Principal Component Analysis At t=2, we want to use the newly-found set of weights (Θ 1 ) to create a new visualization. We do that by using (Θ 1 ) to compute the distance matrix, which feeds into PCA, and results in a new visualization layout. This process is iterated until the user finds a satisfactory layout, or the system cannot improve its answer any further. 2D Visualization (t=2)

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 46/52 Results Tells the domain expert what dimension of data they care about, and what dimensions are not useful!

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 47/52 Our Current Implementation Linear distance function: Optimization:

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 48/52 Conclusion With an appropriate projection model, it is possible to quantify a user’s interactions. In our system, we let the domain expert interact with a familiar representation of the data (scatter plot), and hides the ugly math (distance function) The system learns the weights of the distance function. The resulting function reflects the expert’s mental model of the dataset. Many machine learning algorithms require a valid distance function. We see our system being the “first step” to many visual analytics systems. R. Chang et al., Find Distance Function, Hide Model Inference. IEEE VAST Poster 2011

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 49/52 Summary

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 50/52 Summary While Visual Analytics have grown and is slowly finding its identity, There is still many open problems that need to be addressed. I propose that one research area that has largely been unexplored is in the understanding and supporting of the human user.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 51/52 Summary 1.Is there a best visualization for each user? – Possibly, through understanding individual differences 2.Can the user’s behavior with a visualization be altered? – Yes, priming LOC affects a user’s behavior with a visualization 3.What is in a user’s interactions? – A great deal of a user’s reasoning process can be recovered through analyzing a user’s interactions 4.Can domain knowledge be externalized quantitatively? – Yes, given some assumptions about the visualization, a user can interactively externalize their knowledge quantitatively.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 52/52

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 53/52 Backup Slides…

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 54/52 Surveyed 1,200+ papers from CHI, IUI, KDD, Vis, InfoVis, VAST Found 49 relating to human + computer collaboration Using a model of human and computer affordances, examined each of the projects to identify what “works” and what could be missing Human Complexity

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 55/52 Visual Judgment Cleveland and McGill study on perception of angle vs. position in statistical charts. (1984) Indicates that humans are better at judging length (in bar graph) than angles (in pie chart) Heer and Bostock extension to using Amazon’s Mechanical Turk (2010) Replicated Cleveland- McGill and show that Turk is feasible for perceptual experiments

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 56/52 Visual Judgment We introduced affective- priming to Heer-Bostock and found significance in how positively-primed subjects perform better in visual judgment. Priming was introduced through text (verbal priming). Uplifting and discouraging stories found on NY Times

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 57/52 fNIRS with Visualizations Bar graphs have been shown to be better than pie charts for visual judgment. Why are pie charts everywhere? – Increasing workload in n-back tests – Mental workload difference

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 58/52 Human + Computer: Dimension Reduction – Lost in Translation Dimension reduction using principle component analysis (PCA) Quick Refresher of PCA – Find most dominant eigenvectors as principle components – Data points are re-projected into the new coordinate system For reducing dimensionality For finding clusters For many (especially novices), PCA is easy to understand mathematically, but difficult to understand “semantically”. age height GPA 0.5*GPA + 0.2*age + 0.3*height = ?

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 59/52 Human + Computer: Exploring Dimension Reduction: iPCA R. Chang et al., iPCA: An Interactive System for PCA-based Visual Analytics. Computer Graphics Forum (Eurovis), 2009.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 60/52 Human + Computer: Comparing iPCA to SAS/INSIGHT Results – Users seem to understand the intuition behind PCA better – A bit more accurate – Not faster – People don’t “give up” Overall preference – Using letter grades (A through F) with “A” representing excellent and F a failing grade. Problem is worse with non-linear dimension reduction A lot more work needs to be done…

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 61/52 4. How to Aggregate Multiple Analysis To Perform Group Analytics

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 62/52 Scaling Human Computation Problem Statement: Computing can be scaled (by adding more CPUs). Visualizations can be scaled (by adding more monitors). Can analysis be scaled by adding more humans? Assumption: Conventional wisdom says that humans cannot be scaled because of difficulty in communicating analytical reasoning efficiently.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 63/52 Temporal Graph Research Proposal: We propose a Temporal Graph approach to model analytical trails. In a temporal graph, – Node = a unique state in the visual analysis trail. – Edge = a (temporal) transition from one state to another.

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 64/52 For Example: 2 analysts, A and B, each performed an analysis on the same data A0A1A2A3A4 A5 B0B1B2B3 B4

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 65/52 For Example: If A2 is the same as B1 (in that they represent the same analysis step)… A0A1 A2 A3A4 A5 B0 B1 B2B3 B4

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 66/52 For Example: We will merge the two nodes A0A1 A2 B1 A3A4 A5 B0B2B3 B4

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 67/52 For Example This process is repeated for all analysis trails across all analysts, and we could get a temporal graph that look like:

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 68/52 With a Temporal Graph… We can answer many questions. For example: – Given a particular outcome (a yellow states), is there a state that is the catalyst in which every subsequent analysis trail start from? the answer is yes: The red states are “points of no return” The green states are the “last decision points”

ProvenanceIntroPersonalityPrimingDist FuncWrap-up 69/52 Conclusion There are many benefits to posing analysis trails as a temporal graph problem. Mostly, the benefit comes from our ability to apply known graph algorithms. Incidentally, this temporal graph formulation can be applied to visualize and analyze other problems involving large state space. Poster to be presented at VAST 2011