Data Collection and Analysis

Slides:



Advertisements
Similar presentations
T-tests continued.
Advertisements

Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
Analysis of Variance: ANOVA. Group 1: control group/ no ind. Var. Group 2: low level of the ind. Var. Group 3: high level of the ind var.
Statistics for Managers Using Microsoft® Excel 5th Edition
BCOR 1020 Business Statistics
Statistics for the Social Sciences Psychology 340 Spring 2005 Within Groups ANOVA.
S519: Evaluation of Information Systems
 What is t test  Types of t test  TTEST function  T-test ToolPak 2.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Go to Table of ContentTable of Content Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 12: Analysis of Variance
SBD: Usability Evaluation
Statistical Analysis Statistical Analysis
Evaluation: Controlled Experiments Chris North cs3724: HCI.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Design and Evaluation. Overview Formal Evaluations of Visualization Techniques Design (review) Evaluation/Critique of Visualizations (what we’ve been.
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 10: t test.
ANOVA (Analysis of Variance) by Aziza Munir
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
9.2 Testing the Difference Between Two Means: Using the t Test
SBD: Usability Evaluation Chris North cs3724: HCI.
Math 4030 – 9b Comparing Two Means 1 Dependent and independent samples Comparing two means.
Design and Evaluation. Design Use Field Guide to identify information relevant to planning visualization.Field Guide Formally plan visualization using.
SBD: Usability Evaluation Chris North CS 3724: HCI.
Empirical Evaluation Chris North cs5984: Information Visualization.
Analysis of Variance 11/6. Comparing Several Groups Do the group means differ? Naive approach – Independent-samples t-tests of all pairs – Each test doesn't.
Testing Differences in Means (t-tests) Dr. Richard Jackson © Mercer University 2005 All Rights Reserved.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 9: t test.
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
Statistics Made Simple
Chapter 11 Analysis of Variance
Statistical Significance
Dependent-Samples t-Test
Chapter 10 Two Sample Tests
Statistics made simple Dr. Jennifer Capers
STATISTICS FOR SCIENCE RESEARCH
Estimation & Hypothesis Testing for Two Population Parameters
Math 4030 – 10a Tests for Population Mean(s)
Hypothesis testing using contrasts
SBD: Usability Evaluation
ANOVA 11/3.
Post Hoc Tests on One-Way ANOVA
Post Hoc Tests on One-Way ANOVA
CJ 526 Statistical Analysis in Criminal Justice
Inferential Statistics
Kin 304 Inferential Statistics
AP Biology Intro to Statistics
Statistical Analysis Determining the Significance of Data
Statistics for the Social Sciences
Last class Tutorial 1 Census Overview
Hypothesis Testing and Comparing Two Proportions
Overview and Chi-Square
One-Way Analysis of Variance
Statistics Made Simple
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Facts from figures Having obtained the results of an investigation, a scientist is faced with the prospect of trying to interpret them. In some cases the.
Psych 231: Research Methods in Psychology
Hypothesis Testing S.M.JOSHI COLLEGE ,HADAPSAR
(This presentation may be used for instructional purposes)
Inference for Distributions
Presentation transcript:

Data Collection and Analysis

Scientific Method Form hypothesis Collect data Analyze Accept/reject hypothesis

Empirical Experiment Typical question: Lifelines PerspectiveWall Which visualization is better in a situation? Lifelines PerspectiveWall

Question Does Vis Tool (Lifelines or PerspWall) have an effect on user performance time for task X? Null hypothesis: No effect Lifelines = PerspWall Want to disprove, provide counter-example, show an effect

Variables Independent variables (what you vary): Tool or technique (Lifelines, Perspective Wall) Task type (find, count, compare) Data size (100, 1000, 1000000) Dependent variables (what you measure): User performance time Errors Subjective satisfaction (survey) HCI metrics

Example: 2 x 3 design Task1 Task2 Task3 Life-Lines Persp. Wall Ind Var 2: Task Type Task1 Task2 Task3 Life-Lines Persp. Wall n users per cell Ind Var 1: Vis. Tool Measured user performance times (dep var)

Groups “Between subjects” variable 1 group of users for each variable treatment Group 1: 20 users, Lifelines Group 2: 20 users, PerspWall Total: 40 users, 20 per cell “With-in subjects” (repeated) variable All users perform all treatments Counter-balancing order effect Group 1: 20 users, Lifelines then PerspWall Group 2: 20 users, PerspWall then Lifelines Total: 40 users, 40 per cell

Data Measure dependent variables Spreadsheet: Lifelines task 1, 2, 3, PerspWall task 1, 2, 3

Averages Task1 Task2 Task3 Life-Lines 37.2 54.5 103.7 Persp. Wall 29.8 Ind Var 2: Task Type Task1 Task2 Task3 Life-Lines 37.2 54.5 103.7 Persp. Wall 29.8 53.2 145.4 Ind Var 1: Vis. Tool Measured user performance times (dep var)

PerspWall better than Lifelines? Problem with averages: lossy Compares only 2 numbers What about the 40 data values? Perf time (secs) Lifelines PerspWall

Another Picture Need stats that take all data into account Perf time (secs) Lifelines PerspWall

Statistics t-test ANOVA: ANalysis Of VAriance Compares 1 dep var on 2 treatments of 1 ind var ANOVA: ANalysis Of VAriance Compares 1 dep var on n treatments of m ind vars Result: “significant difference” between treatments? p = significance level (confidence) typical cut-off: p < 0.05

Statistics in Microsoft Excel Enter data into a spreadsheet Go to Tools…, Data Analysis… (may need to choose Analysis Toolpak from Addins first) Select appropriate analysis

t-tests in Excel Used to compare two groups of data Most common is “t-test: two-sample assuming equal variances” Other t-tests: Paired two-sample for means Two-sample assuming unequal variances

ANOVAs in Excel Allows for more than two groups of data to be compared Most common is “ANOVA: Single factor analysis” Other ANOVAs: ANOVA: Two-factor with replication ANOVA: Two-factor without replication

p < 0.05 Found a “statistically significant difference” Averages determine which is ‘better’ Conclusion: Vis Tool has an “effect” on user performance for task1 PerspWall better user performance than Lifelines for task1 “95% confident that PerspWall better than Lifelines” Not “PerspWall beats Lifelines 95% of time” Found a counterexample to the null hypothesis Null hypothesis: Lifelines = PerspWall Hence: Lifelines  PerspWall

p > 0.05 Hence, same? Be careful! How? Vis Tool has no effect on user performance for task1? Lifelines = PerspWall ? Be careful! We did not detect a difference, but could still be different Did not find a counter-example to null hypothesis Provides evidence for Lifelines = PerspWall, but not proof Boring! Basically found nothing How? Not enough users (other tests can verify this) Need better tasks, data, …

Reporting Results Often considered the most important section of professional papers Statistics NOT the most important part of the results section Statistics used to back up differences described in a figure or table

Reporting Means, SDs, t-tests Give means and standard deviations, then t-test … the mean number was significantly greater in condition 1 (M=9.13, SD=2.52) than in condition 2 (M=5.66, SD=3.01), t(44)=3.45, p=.01

What Are Those Numbers? … the mean number was significantly greater in condition 1 (M=9.13, SD=2.52) than in condition 2 (M=5.66, SD=3.01), t(44)=3.45, p=.01 M is the mean SD is the standard deviation t is the t stat the number in parentheses is the degrees of freedom (df) p is the probability the difference occurred by chance

Reporting ANOVAs … for the three conditions, F(2,52)=17.24, MSE=4528.75, p<.001 F(x,y) -- F value for x between groups and y within groups degrees of freedom (df) MSE -- mean square error for the between groups condition p -- probability that difference occurred by chance