VALTVA IntroAppsWrap-up 1/16 Interactive Data Analysis and Model Exploration: A Visual Analytics Approach Remco Chang Tufts University Department of Computer.

Slides:



Advertisements
Similar presentations
1/26Remco Chang – Dagstuhl 14 Analyzing User Interactions for Data and User Modeling Remco Chang Assistant Professor Tufts University.
Advertisements

1/54Remco Chang – LANL 14 Analyzing User Interactions for Data and User Modeling Remco Chang Assistant Professor Tufts University.
ProvenanceIntroLOCCog StateDist FuncWrap-up 1/52 User-Centric Visual Analytics Remco Chang Tufts University.
LECTURE 10: ANALYTIC PROVENANCE April 6, 2015 COMP Topics in Visual Analytics Note: slide deck adapted from R. Chang.
EvaluationIntroVis/GfxInteractionWrap-up Thinking Interactively with Visualizations Remco Chang UNC Charlotte Charlotte Visualization Center.
VALTChessVA IntroAppsWrap-up 1/25 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science.
Dist FuncIntroVAAppsATGWrap-up 1/25 Visual Analytics Research at Tufts Remco Chang Assistant Professor Tufts University.
ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science.
JOURNAL OF HUMAN–COMPUTER INTERACTION 2010 Ji Soo Yi, Niklas Elmqvist, and Seungyoon Lee.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
WireVis Visualization of Categorical, Time-Varying Data From Financial Transactions Remco Chang, Mohammad Ghoniem, Robert Kosara, Bill Ribarsky, Jing Yang,
1/26Remco Chang – PNNL 14 Analyzing User Interactions for Data and User Modeling Remco Chang Assistant Professor Tufts University.
Research to Reality William Ribarsky Remco Chang University of North Carolina at Charlotte.
Live Re-orderable Accordion Drawing (LiveRAC) Peter McLachlan, Tamara Munzner Eleftherios Koutsofios, Stephen North AT&T Research Symposium August, 2007.
Chapter 14 The Second Component: The Database.
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
WPI Center for Research in Exploratory Data and Information Analysis From Data to Knowledge: Exploring Industrial, Scientific, and Commercial Databases.
CHAPTER 11 Managerial Support Systems. CHAPTER OUTLINE  Managers and Decision Making  Business Intelligence Systems  Data Visualization Technologies.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
1/30Remco Chang – SEAri Workshop 15 Big Data Visual Analytics: A User Centric Approach Remco Chang Assistant Professor Tufts University.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
SizeIntroDefinitionComplexityTuftsWrap-up 1/54 Big Data Visual Analytics: Challenges and Opportunities Remco Chang Tufts University.
Information Design and Visualization
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
Dist FuncIntroPersonalityProvenanceGroupWrap-up 1/40 User-Centric Visual Analytics Remco Chang Tufts University.
IntroDefinitionSizeComplexityWrap-up 1/54 Individual Big Data Visual Analytics: Challenges and Opportunities Remco Chang and Eli Brown Tufts University.
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, 2.
Introduction GAM 376 Robin Burke Winter Outline Introductions Syllabus.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
What are your interactions doing for your visualization? Remco Chang UNC Charlotte Charlotte Visualization Center.
1/20 (Big Data Analytics for Everyone) Remco Chang Assistant Professor Department of Computer Science Tufts University Big Data Visual Analytics: A User-Centric.
VISUAL ANALYTICS: VISUAL EXPLORATION, ANALYSIS, AND PRESENTATION OF LARGE COMPLEX DATA Remco Chang, PhD (Charlotte Visualization Center) (Tufts University)
VALTVA IntroAppsWrap-up 1/34 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science.
ProvenanceIntroPersonalityPrimingDist FuncWrap-up 1/52 User-Centric Visual Analytics Remco Chang Tufts University.
The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology.
ProvenanceIntroPersonalityPrimingDist FuncWrap-up 1/40 User-Centric Visual Analytics Remco Chang Tufts University.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Friday, 14 November 2003 William.
LECTURE 16: (EVEN MORE) OPEN QUESTIONS IN VISUAL ANALYTICS December 9, 2015 SDS 235 Visual Analytics.
L&I SCI 110: Information science and information theory Instructor: Xiangming(Simon) Mu Sept. 9, 2004.
1 Remco Chang – Dagstuhl 15 From vision science to data science: applying perception to problems in big data Remco Chang Assistant Professor Computer Science.
1/41 Visualization and Analysis of Text Remco Chang, PhD Assistant Professor Department of Computer Science Tufts University December 17, 2010 Cologne,
LECTURE 12: ANALYTIC PROVENANCE November 16, 2015 SDS235: Visual Analytics Note: slide deck adapted from R. Chang.
Evaluating the Relationships between User Interaction and Financial Visual Analysis Dong Hyun Jeong, Wenwen Dou, Felesia Stukes, William Ribarsky, Heather.
Mining of Massive Datasets Edited based on Leskovec’s from
IntroGoalCrowdPredictionWrap-up 1/26 Learning Debugging and Hacking the User Remco Chang Assistant Professor Tufts University.
Machine Learning. Definition Machine learning is a subfield of computer science that evolved from the study of pattern recognition and computational.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
R EMCO C HANG | T UFTS U NIVERSITY 1/38 B IG D ATA V ISUAL A NALYTICS : A U SER -C ENTRIC A PPROACH Remco Chang Assistant Professor Computer Science, Tufts.
Usability and Human Factors Cognition and Human Performance Lecture c This material (Comp15_Unit3c) was developed by Columbia University, funded by the.
R EMCO C HANG | T UFTS U NIVERSITY 1/38 B IG D ATA V ISUAL A NALYTICS : A U SER -C ENTRIC A PPROACH Remco Chang Assistant Professor Computer Science, Tufts.
Overview of Artificial Intelligence (1) Artificial intelligence (AI) Computers with the ability to mimic or duplicate the functions of the human brain.
Big Data Visual Analytics: A User-Centric Approach
Lecture 15: Analytic Provenance
School of Computer Science & Engineering
Lecture 18: (even more) Open Problems
Remco Chang Associate Professor Computer Science, Tufts University
Introduction C.Eng 714 Spring 2010.
Big Data Visual Analytics: Challenges and Opportunities
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
Data Warehousing and Data Mining
CSc4730/6730 Scientific Visualization
Information Design and Visualization
Introduction to Visual Analytics
CHAPTER 7: Information Visualization
Big DATA.
Presentation transcript:

VALTVA IntroAppsWrap-up 1/16 Interactive Data Analysis and Model Exploration: A Visual Analytics Approach Remco Chang Tufts University Department of Computer Science

VALTVA IntroAppsWrap-up 2/16 Human + Computer Human vs. Artificial Intelligence Garry Kasparov vs. Deep Blue (1997) – Computer takes a “brute force” approach without analysis – “As for how many moves ahead a grandmaster sees,” Kasparov concludes: “Just one, the best one” Artificial vs. Augmented Intelligence Hydra vs. Cyborgs (2005) – Grandmaster + 1 chess program > Hydra (equiv. of Deep Blue) – Amateur + 3 chess programs > Grandmaster + 1 chess program

VALTVA IntroAppsWrap-up 3/16 Visual Analytics = Human + Computer Visual analytics is “the science of analytical reasoning facilitated by visual interactive interfaces.” 1 By definition, it is a collaboration between human and computer to solve problems. 1. Thomas and Cook, “Illuminating the Path”, 2005.

VALTVA IntroAppsWrap-up 4/16 Example: What Does (Wire) Fraud Look Like? Financial Institutions like Bank of America have legal responsibilities to report all suspicious wire transaction activities (money laundering, supporting terrorist activities, etc) Data size: approximately 200,000 transactions per day (73 million transactions per year) Problems: – Automated approach can only detect known patterns – Bad guys are smart: patterns are constantly changing – Data is messy: lack of international standards resulting in ambiguous data Current methods: – 10 analysts monitoring and analyzing all transactions – Using SQL queries and spreadsheet-like interfaces – Limited time scale (2 weeks)

VALTVA IntroAppsWrap-up 5/16 WireVis: Financial Fraud Analysis In collaboration with Bank of America – Develop a visual analytical tool (WireVis) – Visualizes 7 million transactions over 1 year – Beta-deployed at WireWatch Integrates an interactive visual interface with computation: – User-defined hierarchical clustering – “Search by example” – Etc Design philosophy: “combating human intelligence requires better (augmented) human intelligence” R. Chang et al., Scalable and interactive visual analysis of financial wire transactions for fraud detection. Information Visualization,2008. R. Chang et al., Wirevis: Visualization of categorical, time-varying data from financial transactions. IEEE VAST, 2007.

VALTVA IntroAppsWrap-up 6/16 WireVis: A Visual Analytics Approach Heatmap View (Accounts to Keywords Relationship) Strings and Beads (Relationships over Time) Search by Example (Find Similar Accounts) Keyword Network (Keyword Relationships)

VALTVA IntroAppsWrap-up 7/16 Applications of Visual Analytics Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison R. Chang et al., Two Visualization Tools for Analysis of Agent-Based Simulations in Political Science. IEEE CG&A, 2012

VALTVA IntroAppsWrap-up 8/16 Applications of Visual Analytics Where When Who What Original Data Evidence Box R. Chang et al., Investigative Visual Analysis of Global Terrorism, Journal of Computer Graphics Forum, Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison

VALTVA IntroAppsWrap-up 9/16 Applications of Visual Analytics R. Chang et al., An Interactive Visual Analytics System for Bridge Management, Journal of Computer Graphics Forum, To Appear. Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison

VALTVA IntroAppsWrap-up 10/16 Applications of Visual Analytics R. Chang et al., Interactive Coordinated Multiple-View Visualization of Biomechanical Motion Data, IEEE Vis (TVCG) Political Simulation – Agent-based analysis – With DARPA Global Terrorism Database – With DHS Bridge Maintenance – With US DOT – Exploring inspection reports Biomechanical Motion – Interactive motion comparison

VALTVA IntroAppsWrap-up 11/16 Interaction In these examples, one of the keys to making these systems effective is the use of high interactivity – Technically, this means about 12 frames per second (fps) – Perceptually, our eyes perceive 12+ fps as “responsive” and “smoothly animated” – Cognitively, 0.2 seconds is the amount of time our brain can hold sensory memory (the “after image effect”) In building VA systems, interactivity allows a user to: – “Externalize” memory – Perform analysis in an uninterrupted manner – Express domain knowledge

VALTVA IntroAppsWrap-up 12/16 Analyzing User’s Interactions: Do Interactions Contain Knowledge?

VALTVA IntroAppsWrap-up 13/16 What is in a User’s Interactions? Goal: determine if a user’s reasoning and intent are reflected in a user’s interactions. Analysts Grad Students (Coders) Logged (semantic) Interactions Compare! (manually) Strategies Methods Findings Guesses of Analysts’ thinking WireVis Interaction-Log Vis

VALTVA IntroAppsWrap-up 14/16 What’s in a User’s Interactions From this experiment, we find that interactions contains at least: – 60% of the (high level) strategies – 60% of the (mid level) methods – 79% of the (low level) findings R. Chang et al., Recovering Reasoning Process From User Interactions. CG&A, R. Chang et al., Evaluating the Relationship Between User Interaction and Financial Visual Analysis. VAST, 2009.

VALTVA IntroAppsWrap-up 15/16 Human + Computer Interaction allows the human to express domain knowledge Part of the purpose of this panel is to demonstrate to you that statistics (computing) + humans is much more powerful than statistics alone or human alone This can be achieved through well-designed Visual Analytics systems

VALTVA IntroAppsWrap-up 16/16 Final Thought… “The sexy job in the next 10 years will be statisticians,” said Hal Varian, chief economist at Google. “And I’m not kidding.” Yet data is merely the raw material of knowledge. “We’re rapidly entering a world where everything can be monitored and measured,” said Erik Brynjolfsson, an economist and director of the Massachusetts Institute of Technology’s Center for Digital Business. “But the big problem is going to be the ability of humans to use, analyze and make sense of the data.” “The key is to let computers do what they are good at, which is trawling these massive data sets for something that is mathematically odd,” said Daniel Gruhl, an I.B.M. researcher whose recent work includes mining medical data to improve treatment. “And that makes it easier for humans to do what they are good at — explain those anomalies.” 1 1. New York Times. “For Today’s Graduate, Just One Word: Statistics “, August 5, Graphics & Visualization Computing Interaction & Reasoning

VALTVA IntroAppsWrap-up 17/16 Thank you! Questions?

VALTVA IntroAppsWrap-up 18/16

VALTVA IntroAppsWrap-up 19/16 Backup Slides

VALTVA IntroAppsWrap-up 20/16 VALT Research Projects 1.Theory -- Jordan Crouser: Complexity classes of Human+Computer 2.Interactive Machine Learning -- Eli Brown: Model learning from user interactions Analytic provenance 3.Psych / Cog Sci -- Alvitta Ottley: Personality factors and Brain Sensing with fNIRS Uncertainty visualization (medical) 4.Big Data -- Leilani Battle (MIT): Interactive DB Visualization & Exploration (collaboration with MIT)

VALTVA IntroAppsWrap-up 21/16 Analysis (Jordan Crouser) 1. Human + Computer Computation: Can The Two Complement Each Other?

VALTVA IntroAppsWrap-up 22/16 Quantifying Human+Computer Collaboration

VALTVA IntroAppsWrap-up 23/16 Quantifying Human+Computer Collaboration

VALTVA IntroAppsWrap-up 24/16 Surveyed 1,200+ papers from CHI, IUI, KDD, Vis, InfoVis, VAST Found 49 relating to human + computer collaboration Using a model of human and computer affordances, examined each of the projects to identify what “works” and what could be missing Understanding Human Complexity Joint work with Jordan Couser. An affordance-based framework for human computation and human-computer collaboration. IEEE VAST 2012.

VALTVA IntroAppsWrap-up 25/16 Quantifying Human+Computer Collaboration

VALTVA IntroAppsWrap-up 26/16 Interactive Machine Learning (Eli Brown) 2. Interactive Model Learning: Can Knowledge be Represented Quantitatively?

VALTVA IntroAppsWrap-up 27/16 Iterative Interactive Analysis

VALTVA IntroAppsWrap-up 28/16 Direct Manipulation of Visualization Linear distance function: Optimization:

VALTVA IntroAppsWrap-up 29/16 Results Tells the users what dimension of data they care about, and what dimensions are not useful! Blue: original data dimension Red: randomly added dimensions X-axis: dimension number Y-axis: final weights of the distance function Using the “Wine” dataset (13 dimensions, 3 clusters) – Assume a linear (sum of squares) distance function Added 10 extra dimensions, and filled them with random values

VALTVA IntroAppsWrap-up 30/16 Individual Differences (Alvitta Ottley) 3. A User’s Cognitive Traits & States, Experiences & Biases: How To Identify The End User’s Needs?

VALTVA IntroAppsWrap-up 31/16 Experiment Procedure 4 visualizations on hierarchical visualization – From list-like view to containment view 250 participants using Amazon’s Mechanical Turk Questionnaire on “locus of control” (LOC) – Definition of LOC: the degree to which a person attributes outcomes to themselves (internal LOC) or to outside forces (external LOC) R. Chang et al., How Locus of Control Influences Compatibility with Visualization Style, IEEE VAST V1 V2 V3 V4

VALTVA IntroAppsWrap-up 32/16 Results Personality Factor: Locus of Control – (internal => faster/better with containment) – (external => faster/better with list)

VALTVA IntroAppsWrap-up 33/16 Affective Priming on Visual Judgment R. Chang et al., Influencing Visual Judgment Through Affective Priming, CHI 2013.

VALTVA IntroAppsWrap-up 34/16 Preliminary Study – Using Brain Sensing (fNIRS) Functional Near-Infrared Spectroscopy a lightweight brain sensing technique measures mental demand (working memory) R. Chang et al., Using fNIRS Brain Sensing to Evaluate Information Visualization Interfaces. CHI 2013.

VALTVA IntroAppsWrap-up 35/16 This is Your Brain on Bar graphs and Pie Charts 3-back test

VALTVA IntroAppsWrap-up 36/16 Big Data (Leilani Battle (MIT) & Liz Salowitz) 4. Interactive Exploration of Large Databases: Big Database, Small Laptop, Can a User Interact with Big Data in Real Time?

VALTVA IntroAppsWrap-up 37/16 Problem Statement Visualization on a Commodity Hardware Large Data in a Data Warehouse

VALTVA IntroAppsWrap-up 38/16 Problem Statement Constraint: Data is too big to fit into the memory or hard drive of the personal computer – Note: Ignoring various database technologies (OLAP, Column-Store, No-SQL, Array-Based, etc) Classic Computer Science Problem… What are some previous techniques? – Truncate (sample, filter) – Resolution reduction (“blurring”, image zooming) – Stream (think Netflix, Hulu) – Pre-fetch (think open world 3D video games)

VALTVA IntroAppsWrap-up 39/16 Strategies for Real Time DB Visualization

VALTVA IntroAppsWrap-up 40/16 Using SciDB