User Study Evaluation Human-Computer Interaction.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Hypothesis testing Week 10 Lecture 2.
 Once you know the correlation coefficient for your sample, you might want to determine whether this correlation occurred by chance.  Or does the relationship.
PSY 307 – Statistics for the Behavioral Sciences
Using Statistics in Research Psych 231: Research Methods in Psychology.
What z-scores represent
Inferential Stats for Two-Group Designs. Inferential Statistics Used to infer conclusions about the population based on data collected from sample Do.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Today Concepts underlying inferential statistics
Using Statistics in Research Psych 231: Research Methods in Psychology.
Major Points Formal Tests of Mean Differences Review of Concepts: Means, Standard Deviations, Standard Errors, Type I errors New Concepts: One and Two.
Hypothesis Testing Using The One-Sample t-Test
PSY 307 – Statistics for the Behavioral Sciences
Inferential Statistics
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
© 2011 Pearson Prentice Hall, Salkind. Introducing Inferential Statistics.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Statistical Analysis Statistical Analysis
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Analyzing and Interpreting Quantitative Data
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Chapter 10 The t Test for Two Independent Samples
Chapter Eight: Using Statistics to Answer Questions.
Data Analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
© Copyright McGraw-Hill 2004
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Chapter 13 Understanding research results: statistical inference.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Inferential Statistics Psych 231: Research Methods in Psychology.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
15 Inferential Statistics.
Logic of Hypothesis Testing
Understanding Results
Hypothesis Testing.
Psych 231: Research Methods in Psychology
Inferential Statistics
15.1 The Role of Statistics in the Research Process
Psych 231: Research Methods in Psychology
Presentation transcript:

User Study Evaluation Human-Computer Interaction

Hypothesis A statement of prediction Describes what you expect will happen in your study Alternative hypothesis (H 1 ) – your prediction, i.e. a claim of difference in the population e.g. Participants will commit more errors with interface A than with interface B Null hypothesis (H 0 ) – No difference or no effect e.g. Participants will commit the same number of errors between interface A and interface B or Participants will commit more errors in interface B than with interface A

Hypothesis – one or two tailed? Alternative hypothesis One-tailed: Participants will commit more errors with interface A than with interface B (i.e. directional) Two-tailed: There will be a significant difference in the number of errors participants commit with interface A than with interface B but I don’t know if there will be more or fewer (i.e. non- directional) Can’t prove the alternative hypothesis, can only reject the null hypothesis If your prediction was correct – reject null hypothesis Not rejecting null hypothesis ≠ accepting it

Metrics What you are measuring Some types of metrics Objective – facts of an event Time to complete task (continuous) Errors (discrete, i.e. distinct and separate, can be counted) Subjective – a person’s opinion Satisfaction

Metrics Types of metrics Objective – facts of an event Subjective – a person’s opinion *Both* are important How to measure Instrumentation – record data within your system Questionnaires / Surveys Scales Free-response Let’s discuss appropriateness of each Let’s look at a very popular survey (SUS)SUS

Analysis Most of what we do involves: Normal Distributed Results Independent Testing Homogenous Population Recall, we are testing the hypothesis by trying to prove the NULL hypothesis false

Analysis 3 main steps for analysis Data Preparation: Cleaning and organizing the data for analysis Checking the data for accuracy Transforming data (e.g. reverse coding survey data) Descriptive Statistics: Describing the data Provide simple summaries about the sample and the measures Simply describing what is, what the data shows Inferential Statistics: Testing Hypotheses and Models Try to infer from the sample data what the population thinks Make judgments of the probability that an observed difference between groups is a dependable one or one that might have happened by chance

Data preparation Checking data for accuracy Are the responses legible/readable? Are all important questions answered? Are the responses complete? Is all relevant contextual information included (e.g., data, time, place, researcher)?

Data preparation Data transformations Missing values Depending on program, need designate specific values to represent missing values, e.g. -99 Scale totals Add or average across individual items Item reversals Likert scale – sometimes rating for items need to be reversed 1 (strongly disagree) – 5 (strongly agree) “I generally feel good about myself.” “Sometimes I feel like I'm not worth much as a person.” What does a 5 mean in each case?

Descriptive statistics Simple summaries of sample and measures, i.e. data Describing what is or what the data shows Central tendency – estimate of the “center” of a distribution of values Mean – average across a set of values 15, 15, 18, 25, 33 = 106 µ = 106/5 = 21.2 Median – score found in middle of a set of values 15, 15, 18, 25, 33 Mode – most frequently occurring value 15, 15, 18, 25, 33 Describe the data with a number and a graph

Inferential statistics Try to reach conclusions that go beyond the immediate data – draw inferences e.g. want to compare the average performance of 2 groups to see if there’s a difference t-test: statistical test used to determine whether two observed means are statistically different

t-test What does it mean to say that the averages for two groups are statistically different?

t-test Variability is the noise that may make it harder to see the group difference Variance: measure of variability around the mean Standard deviation: s quare root of the variance

t – test (rule of thumb) Good values of t > 1.96 (standard deviations from the mean)

t-test Once computed, look up t-value to see whether the ratio is large enough to say that the difference between the groups is not likely to have been a chance finding. To test the significance, you need to set a risk level (called the alpha level). Accepted standard is alpha level of times out of 100 you would find a statistically significant difference between the means even if there was none (i.e., by "chance"). Degrees of freedom (df). For t-test, the df = sum of the persons in both groups minus 2. Given the alpha level, the df, and the t-value, look up t-value to determine whether the t-value is large enough to be significant. If yes, conclude that difference between means for the 2 groups is different (even given the variability) and reject null hypothesis.

α and p values α value – probability of making a Type I error (rejecting null hypothesis when really true) p value – probability that the effect found did not occur by chance. The lower the p value, the higher the statistical significance (the more rigorous the test)

Relationship between α and p values Once the alpha level has been set, a statistic (like t) is computed. Each statistic has an associated probability value called a p- value, or the likelihood of an observed statistic occurring due to chance, given the sampling distribution. Alpha sets the standard for how extreme the data must be before we can reject the null hypothesis. The p-value indicates how extreme the data are. Compare the p-value with alpha to determine whether the observed data are statistically significantly different from the null hypothesis

Kinds of t-tests Formula is slightly different for each: Single-sample: tests whether a sample mean is significantly different from a pre-existing value (e.g. norms) Paired-samples: tests the relationship between 2 linked samples, e.g. means obtained in 2 conditions by a single group of participants Independent-samples: tests the relationship between 2 independent populations Which test fits your situation?

t and alpha values

Independent samples t-test Example: social presence questionnaire “I perceived I was in the presence of a patient in the room with me.”

Correlations Correlations – relationship between two variables Pearon’s product-moment correlation coefficient – r

Correlations Pearson’s product-moment correlation coefficient – r m/tests/pearson/Default2.asp x rrelation_and_dependence