F, t, and p Basic Statistics for Computer Scientists (aka knowing enough to be critical of user studies) April 4, 2002 Benjamin Lok.

Slides:



Advertisements
Similar presentations
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Advertisements

One sample means Testing a sample mean against a population mean.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Conducting a User Study Human-Computer Interaction.
BHS Methods in Behavioral Sciences I April 25, 2003 Chapter 6 (Ray) The Logic of Hypothesis Testing.
 Once you know the correlation coefficient for your sample, you might want to determine whether this correlation occurred by chance.  Or does the relationship.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
Statistics Sample: Descriptive Statistics Population: Inferential Statistics.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Inferential Stats for Two-Group Designs. Inferential Statistics Used to infer conclusions about the population based on data collected from sample Do.
BHS Methods in Behavioral Sciences I
Understanding Statistics in Research
Lecture 9: One Way ANOVA Between Subjects
Hypothesis Testing. Introduction Always about a population parameter Attempt to prove (or disprove) some assumption Setup: alternate hypothesis: What.
Sample Size Determination In the Context of Hypothesis Testing
Chapter 9 Hypothesis Testing.
Introduction to Analysis of Variance (ANOVA)
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Inferential Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Choosing Statistical Procedures
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Statistical Analysis Statistical Analysis
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
User Study Evaluation Human-Computer Interaction.
Go to index Two Sample Inference for Means Farrokh Alemi Ph.D Kashif Haqqi M.D.
Conducting a User Study Human-Computer Interaction.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
Conducting a User Study Human-Computer Interaction.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
The binomial applied: absolute and relative risks, chi-square.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Human-Computer Interaction. Overview What is a study? Empirically testing a hypothesis Evaluate interfaces Why run a study? Determine ‘truth’ Evaluate.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Experimental Design and Statistics. Scientific Method
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Statistics in IB Biology Error bars, standard deviation, t-test and more.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Inferential Statistics Significance Testing Chapter 4.
© Copyright McGraw-Hill 2004
The Analysis of Variance ANOVA
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistics Made Simple
Hypothesis Testing Hypothesis testing is an inferential process
Chapter 9 Hypothesis Testing.
As in the books, not you personally!
Math 4030 – 10a Tests for Population Mean(s)
Hypothesis Testing Review
Conducting a User Study
Hypothesis Testing: Hypotheses
Hypothesis Testing.
Statistics Made Simple
Experimental Design Data Normal Distribution
Intro to Confidence Intervals Introduction to Inference
Statistical Inference for the Mean: t-test
Presentation transcript:

F, t, and p Basic Statistics for Computer Scientists (aka knowing enough to be critical of user studies) April 4, 2002 Benjamin Lok

User Studies Trying to identify phenomena or trends –Hypothesis Blood pressure increases with age and weight Smoking increases risk of cancer Real objects in VEs improve performance –How might we investigate this?

Variables and Conditions Hypothesis: Real objects in VEs improve performance Independent Variable – the variable that is being manipulated by the experimenter (VE type) Dependent Variable – the variable that is caused by the independent variable. (performance) Experimental conditions – The level of independent variable in which the situation of interest was created.

Descriptive Statistics Hypothesis: Real objects in VEs improve performance null hypothesis - assume real objects in VEs are the SAME as virtual objects in VEs Innocent until proven guilty Your job: Prove otherwise! alternate hypothesis – interacting with real objects is better than interacting with virtual objects

Raw Data What does the mean tell us? Is that enough?

Variances standard deviation – measure of dispersion (square root of the sum of squares divided by N) Small Pattern (seconds)Large Pattern (seconds) MeanS.D.MinMaxMeanS.D.MinMax Real Space (n=41) Purely Virtual (n=13) Hybrid (n=13) Vis Faith Hybrid (n=14)

Hypothesis We assumed the means are “equal” But are they? Or is the difference due to chance? Small Pattern (seconds)Large Pattern (seconds) MeanS.D.MinMaxMeanS.D.MinMax Real Space (n=41) Purely Virtual (n=13) Hybrid (n=13) Vis Faith Hybrid (n=14)

T - test T – test – statistical test used to determine whether two observed means are statistically different

T – test (rule of thumb) Good values of t > 1.96 Look at what contributes to t tm

F statistic, p values F statistic – assesses the extent to which the means of the experimental conditions differ more than would be expected by chance t is related to F statistic Look up a table, get the p value. Compare to α α value – probability of making a Type I error (rejecting null hypothesis when really true) p value – statistical likelihood of an observed pattern of data, calculated on the basis of the sampling distribution of the statistic. (% chance it was due to chance)

Let’s look at data Small PatternLarge Pattern t – test with unequal variance p – value t – test with unequal variance p - value PVE – RSE vs. VFHE – RSE ** *** PVE – RSE vs. HE – RSE ** * VFHE – RSE vs. HE – RSE

Total Sense of Presence Score Scale from 0..6 Mea n S.D Mi n Ma x Purely VE Hybrid VE Visually Faithful Hybrid VE Between Groups Total Sense of Presence t – test with unequal variance p – value PVE – VFHE PVE – HE VFHE – HE

Significance What does it mean to be significant? You have some confidence it was not due to chance. But difference between statistical significance and meaningful significance Always know: –samples (n) –p value –variance/standard deviation –means