User Tasks in Visualization Environments--Eleven basic actions identify, locate, distinguish, categorize, cluster, distribution, rank, compare within relations,

Slides:



Advertisements
Similar presentations
Chapter 3 – Data Exploration and Dimension Reduction © Galit Shmueli and Peter Bruce 2008 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
Advertisements

Population Population
Statistics It is the science of planning studies and experiments, obtaining sample data, and then organizing, summarizing, analyzing, interpreting data,
Math 2260 Introduction to Probability and Statistics
IAT 355 Basic Tasks with Interactive Example: EZChooser, Chapter 2 of Spence ______________________________________________________________________________________.
Measurement in Survey Research Developing Questionnaire Items with Respect to Content and Analysis.
Chapter 9 Business Intelligence Systems
Statistical Methods in Computer Science © 2006-now Gal Kaminka / Ido Dagan 1 Statistical Methods in Computer Science Descriptive Statistics Data 1: Frequency.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Data Mining By Archana Ketkar.
Recommender systems Ram Akella November 26 th 2008.
Ch 2 and 9.1 Relationships Between 2 Variables
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Data Mining Techniques
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
CS 235: User Interface Design November 26 Class Meeting Department of Computer Science San Jose State University Fall 2014 Instructor: Ron Mak
Managing Software Projects Analysis and Evaluation of Data - Reliable, Accurate, and Valid Data - Distribution of Data - Centrality and Dispersion - Data.
Chapter Eight The Concept of Measurement and Attitude Scales
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Copyright © Cengage Learning. All rights reserved. Getting Started 1.
Introduction to Statistics What is Statistics? : Statistics is the sciences of conducting studies to collect, organize, summarize, analyze, and draw conclusions.
Introduction to Probability and Statistics Consultation time: Ms. Chong.
Laboratory Assignments If time: Visualization, Perception & Cognition.
Taxonomies of Visualization Techniques CMPT 455/826 - Week 12, Day 2 w12d2 Sept-Dec
Warm Up. Unit 2: One-Variable Statistics Two Main Uses of Statistics TO DESCRIBE (Data Analysis) TO PREDICT (Statistical Inference)
Copyright © Cengage Learning. All rights reserved. Getting Started 1.
______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT] |
Statistics for Business and Economics Chapter 1 Statistics, Data, & Statistical Thinking.
Research Methods: Final Year Project Toolkit – Level 6 Laura Lake.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Warm Up – to be done on a separate sheet of paper Find the length of a segment with endpoints (-5, 2) and (2, -1).
Math 3680 Lecture #1 Graphical Representation of Data.
L643: Evaluation of Information Systems Week 13: March, 2008.
Data Preprocessing Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
G042 - Lecture 12 Using Local Databases Mr C Johnston ICT Teacher
Ch1 Larson/Farber 1 Elementary Statistics Math III Introduction to Statistics.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
Chapter 4 –Dimension Reduction Data Mining for Business Analytics Shmueli, Patel & Bruce.
Descriptive Statistics  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals or things. 
Introduction Exploring Categorical Variables Exploring Numerical Variables Exploring Categorical/Numerical Variables Selecting Interesting Subsets of Data.
Elementary Statistics (Math 145) June 19, Statistics is the science of collecting, analyzing, interpreting, and presenting data. is the science.
Exploratory data analysis, descriptive measures and sampling or, “How to explore numbers in tables and charts”
Copyright © 2009 Pearson Education, Inc. Chapter 2 Data.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Math 145 June 19, Outline 1. Recap 2. Sampling Designs 3. Graphical methods.
By: Michael Mack, Ana Meneses and Zhane’ Fleming.
Statistics Vocabulary. 1. STATISTICS Definition The study of collecting, organizing, and interpreting data Example Statistics are used to determine car.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
IAT 355 Basic Tasks with Interactive Example
Cognition Models Task Models
Chapter 1 Created by Bethany Stubbe and Stephan Kogitz.
UNIT – V BUSINESS ANALYTICS
AP Statistics Variables.
Data Mining Concept Description
Descriptive Statistics
Descriptive Statistics
Elementary Statistics (Math 145)
Software metrics.
Descriptive Statistics
Descriptive Statistics
Multidimensional Scaling
Methods of Acquiring Information
Population Population
Descriptive Statistics
Population Population
Descriptive Statistics
Math 145 January 24, 2007.
Math 145 May 28, 2009.
Comp 15 - Usability & Human Factors
Chapter 4 –Dimension Reduction
Presentation transcript:

User Tasks in Visualization Environments--Eleven basic actions identify, locate, distinguish, categorize, cluster, distribution, rank, compare within relations, compare between relations, associate, correlate identify, locate, distinguish, categorize, cluster, distribution, rank, compare within relations, compare between relations, associate, correlate

Data Types and Tasks

Terminology Data case – An entity in the data set Data case – An entity in the data set Attribute – A value measured for all data cases Attribute – A value measured for all data cases Aggregation function – A function that creates a numeric representation for a set of data cases (eg, average, count, sum) Aggregation function – A function that creates a numeric representation for a set of data cases (eg, average, count, sum)

Steps in Creating Data Visualization 1. Retrieve Value General Description: Given a set of specific cases, find attributes of those cases. Examples: - What is the mileage per gallon of the Audi TT? - How long is the movie Gone with the Wind? 1. Retrieve Value General Description: Given a set of specific cases, find attributes of those cases. Examples: - What is the mileage per gallon of the Audi TT? - How long is the movie Gone with the Wind?

2. Filter General Description: Given some concrete conditions on attribute values, find data cases satisfying those conditions. Examples: - What Kelloggs cereals have high fiber? - What comedies have won awards? - Which funds underperformed the SP-500? General Description: Given some concrete conditions on attribute values, find data cases satisfying those conditions. Examples: - What Kelloggs cereals have high fiber? - What comedies have won awards? - Which funds underperformed the SP-500?

3. Compute Derived Value General Description: Given a set of data cases, compute an aggregate numeric representation of those data cases. Examples: - What is the gross income of all stores combined? - How many manufacturers of cars are there? - What is the average calorie content of Post cereals? General Description: Given a set of data cases, compute an aggregate numeric representation of those data cases. Examples: - What is the gross income of all stores combined? - How many manufacturers of cars are there? - What is the average calorie content of Post cereals?

4. Find Extremum General Description: General Description: Find data cases possessing an extreme value of an attribute over its range within the data set. Examples: - What is the car with the highest MPG? - What director/film has won the most awards? - What Robin Williams film has the most recent release date? Find data cases possessing an extreme value of an attribute over its range within the data set. Examples: - What is the car with the highest MPG? - What director/film has won the most awards? - What Robin Williams film has the most recent release date?

5. Sort General Description: Given a set of data cases, rank them according to some ordinal metric. Examples: - Order the cars by weight. - Rank the cereals by calories. General Description: Given a set of data cases, rank them according to some ordinal metric. Examples: - Order the cars by weight. - Rank the cereals by calories.

6. Determine Range General Description: Given a set of data cases and an attribute of interest, find the span of values within the set. Examples: - What is the range of film lengths? - What is the range of car horsepowers? - What actresses are in the data set?

7. Characterize Distribution General Description: Given a set of data cases and a quantitative attribute of interest, characterize the distribution of that attribute’s values over the set. Examples: - What is the distribution of carbohydrates in cereals? - What is the age distribution of shoppers?

8.Find Anomalies General Description: Identify any anomalies within a given set of data cases with respect to a given relationship or expectation, e.g. statistical outliers. Examples: - Are there any outliers in protein? - Are there exceptions to the relationship between horsepower and acceleration?

9. Cluster General Description: Given a set of data cases, find clusters of similar attribute values. Examples: - Are there groups of cereals w/ similar fat/calories/sugar? - Is there a cluster of typical film lengths? General Description: Given a set of data cases, find clusters of similar attribute values. Examples: - Are there groups of cereals w/ similar fat/calories/sugar? - Is there a cluster of typical film lengths?

10. Correlate General Description: Given a set of data cases and two attributes, determine useful relationships between the values of those attributes. Examples: - Is there a correlation between carbohydrates and fat? - Is there a correlation between country of origin and MPG? - Do different genders have a preferred payment method? - Is there a trend of increasing film length over the years? General Description: Given a set of data cases and two attributes, determine useful relationships between the values of those attributes. Examples: - Is there a correlation between carbohydrates and fat? - Is there a correlation between country of origin and MPG? - Do different genders have a preferred payment method? - Is there a trend of increasing film length over the years?

Compound tasks “Sort the cereal manufacturers by average fat content” Compute derived value; Sort “Sort the cereal manufacturers by average fat content” Compute derived value; Sort “Which actors have co-starred with Julia Roberts?” Filter; Retrieve value “Which actors have co-starred with Julia Roberts?” Filter; Retrieve value

What questions were left out? Basic math “Which cereal has more sugar, Cheerios or Special K?” “Compare the average MPG of American and Japanese cars.” Basic math “Which cereal has more sugar, Cheerios or Special K?” “Compare the average MPG of American and Japanese cars.” Uncertain criteria “Does cereal (q, Y, Zj) sound tasty?” “What are the characteristics of the most valued customers? Uncertain criteria “Does cereal (q, Y, Zj) sound tasty?” “What are the characteristics of the most valued customers? Higher-level tasks “How do mutual funds get rated?” “Are there car aspects that Toyota has concentrated on?” Higher-level tasks “How do mutual funds get rated?” “Are there car aspects that Toyota has concentrated on?” More qualitative comparison “How does the Toyota RAV4 compare to the Honda CRV?” “What other cereals are most similar to Trix?” More qualitative comparison “How does the Toyota RAV4 compare to the Honda CRV?” “What other cereals are most similar to Trix?”

Analytic gaps – “obstacles faced by visualizations in facilitating higher-level analytic tasks, such as decision making and learning.” “obstacles faced by visualizations in facilitating higher-level analytic tasks, such as decision making and learning.”