Human-Computer Interaction. Overview What is a study? Empirically testing a hypothesis Evaluate interfaces Why run a study? Determine ‘truth’ Evaluate.

Slides:



Advertisements
Similar presentations
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Decision Errors and Power
Statistical Issues in Research Planning and Evaluation
Conducting a User Study Human-Computer Interaction.
Tim Wiemken, PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky Planning Your Study Statistical Issues.
Chapter 10: Hypothesis Testing
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
BHS Methods in Behavioral Sciences I April 25, 2003 Chapter 6 (Ray) The Logic of Hypothesis Testing.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Understanding Statistics in Research
Lecture 9: One Way ANOVA Between Subjects
Chapter Sampling Distributions and Hypothesis Testing.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Ch. 9 Fundamental of Hypothesis Testing
Chapter 14 Inferential Data Analysis
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
AM Recitation 2/10/11.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Statistical Analysis Statistical Analysis
Conducting a User Study Human-Computer Interaction.
Comparing Two Population Means
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
Spring /6.831 User Interface Design and Implementation1 Lecture 15: Experiment Analysis.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Individual values of X Frequency How many individuals   Distribution of a population.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
User Study Evaluation Human-Computer Interaction.
Conducting a User Study Human-Computer Interaction.
Exam Exam starts two weeks from today. Amusing Statistics Use what you know about normal distributions to evaluate this finding: The study, published.
Conducting a User Study Human-Computer Interaction.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
PowerPoint presentation to accompany Research Design Explained 6th edition ; ©2007 Mark Mitchell & Janina Jolley Chapter 10 The Simple Experiment.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Experimental Design and Statistics. Scientific Method
Experimental Psychology PSY 433 Appendix B Statistics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
F, t, and p Basic Statistics for Computer Scientists (aka knowing enough to be critical of user studies) April 4, 2002 Benjamin Lok.
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Scientific Method Probability and Significance Probability Q: What does ‘probability’ mean? A: The likelihood that something will happen Probability.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
But! Let’s first review…
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Welcome to MM570 Psychological Statistics
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Statistical Analysis. Null hypothesis: observed differences are due to chance (no causal relationship) Ex. If light intensity increases, then the rate.
1 Foundations of Research Cranach, Tree of Knowledge [of Good and Evil] (1472) Click “slide show” to start this presentation as a show. Remember: focus.
BHS Methods in Behavioral Sciences I May 9, 2003 Chapter 6 and 7 (Ray) Control: The Keystone of the Experimental Method.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Inferential Statistics Psych 231: Research Methods in Psychology.
Psychological Experimentation The Experimental Method: Discovering the Causes of Behavior Experiment: A controlled situation in which the researcher.
Understanding Results
Conducting a User Study
Hypothesis Testing.
What are their purposes? What kinds?
Type I and Type II Errors
BHS Methods in Behavioral Sciences I
Presentation transcript:

Human-Computer Interaction

Overview What is a study? Empirically testing a hypothesis Evaluate interfaces Why run a study? Determine ‘truth’ Evaluate if a statement is true

Example Overview Ex. The heavier a person weighs, the higher their blood pressure Many ways to do this: Look at data from a doctor’s office Descriptive design: What’s the pros and cons? Get a group of people to get weighed and measure their BP Analytic design: What’s the pros and cons? Ideally? Ideal solution: have everyone in the world get weighed and BP Participants are a sample of the population You should immediately question this! Restrict population

Study Components Design Hypothesis Population Task Metrics Procedure Data Analysis Conclusions Confounds/Biases

Study Design How are we going to evaluate the interface? Hypothesis What do you want to find out? Population Who? Metrics How will you measure?

Hypothesis Statement that you want to evaluate Ex. A mouse is faster than a keyboard for numeric entry Create a hypothesis Ex. Participants using a keyboard to enter a string of numbers will take less time than participants using a mouse. Identify Independent and Dependent Variables Independent Variable – the variable that is being manipulated by the experimenter (interaction method) Dependent Variable – the variable that is caused by the independent variable. (time)

Hypothesis Testing Hypothesis: People who use a mouse and keyboard will be faster to fill out a form than keyboard alone. US Court system: Innocent until proven guilty NULL Hypothesis: Assume people who use a mouse and keyboard will fill out a form in the same amount of time as keyboard alone Your job to prove differently! Alternate Hypothesis 1: People who use a mouse and keyboard will fill out a form faster than keyboard alone. Alternate Hypothesis 2: People who use a mouse and keyboard will fill out a form slower than keyboard alone.

Population The people going through your study Type - Two general approaches Have lots of people from the general public Results are generalizable Logistically difficult People will always surprise you with their variance Select a niche population Results more constrained Lower variance Logistically easier Number The more, the better How many is enough? Logistics Recruiting (n>20 is pretty good)

Two Group Design Design Study Groups of participants are called conditions How many participants? Do the groups need the same # of participants? What’s your design? What are the independent and dependent variables?

Design External validity – do your results mean anything? Results should be similar to other similar studies Use accepted questionnaires, methods Power – how much meaning do your results have? The more people the more you can say that the participants are a sample of the population Pilot your study Generalization – how much do your results apply to the true state of things

Design People who use a mouse and keyboard will be faster to fill out a form than keyboard alone. Let’s create a study design Hypothesis Population Procedure Two types: Between Subjects Within Subjects

Procedure Formally have all participants sign up for a time slot (if individual testing is needed) Informed Consent (let’s look at one) Execute study Questionnaires/Debriefing (let’s look at one)

Biases Hypothesis Guessing Participants guess what you are trying hypothesis Experimenter Bias Subconscious bias of data and evaluation to find what you want to find Systematic Bias bias resulting from a flaw integral to the system E.g. an incorrectly calibrated thermostat) List of biases

Confounds Confounding factors – factors that affect outcomes, but are not related to the study Population confounds Who you get? How you get them? How you reimburse them? How do you know groups are equivalent? Design confounds Unequal treatment of conditions Learning Time spent

Metrics What you are measuring Types of metrics Objective Time to complete task Errors Ordinal/Continuous Subjective Satisfaction Pros/Cons of each type?

Analysis Most of what we do involves: Normal Distributed Results Independent Testing Homogenous Population

Raw Data Keyboard times E.g. 3.4, 4.4, 5.2, 4.8, 10.1, 1.1, 2.2 Mean = 4.46 Variance = 7.14 (Excel’s VARP) Standard deviation = 2.67 (sqrt variance) What do the different statistical data tell us?

What does Raw Data Mean?

Roll of Chance How do we know how much is the ‘truth’ and how much is ‘chance’? How much confidence do we have in our answer?

Hypothesis We assumed the means are “equal” But are they? Or is the difference due to chance? Ex. A μ 0 = 4, μ 1 = 4.1 Ex. B μ 0 = 4, μ 1 = 6

T - test T – test – statistical test used to determine whether two observed means are statistically different

T-test Distributions

T – test (rule of thumb) Good values of t > 1.96 Look at what contributes to t

F statistic (ANOVA), p values F statistic – assesses the extent to which the means of the experimental conditions differ more than would be expected by chance t is related to F statistic Look up a table, get the p value. Compare to α α value – probability of making a Type I error (rejecting null hypothesis when really true) p value – statistical likelihood of an observed pattern of data, calculated on the basis of the sampling distribution of the statistic. (% chance it was due to chance)

Significance What does it mean to be significant? You have some confidence it was not due to chance. But difference between statistical significance and meaningful significance Significance is not a measure of the “size” of the difference Always know: samples (n) p value variance/standard deviation means

IRB Let’s look at a completed one You MUST turn one in before you complete a study Must have OKed before running study

Let’s Design a Study! Random Ideas for studies: gas tank size vs searching for parking spaces type of cell phone and video game play glasses or contacts impact social interaction? cell phone signals and driving performance virtual reality and name association Do guitar hero skills translate to music skills?