CS 594: Empirical Methods in HCC Experimental Research in HCI (Part 1)

Slides:



Advertisements
Similar presentations
Jamil A. Malik (PhD) National Institute of Psychology Quaid-e-Azam Univeristy.
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Statistical Issues in Research Planning and Evaluation
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Richard M. Jacobs, OSA, Ph.D.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Choosing Statistical Procedures
AM Recitation 2/10/11.
Difference Two Groups 1. Content Experimental Research Methods: Prospective Randomization, Manipulation Control Research designs Validity Construct Internal.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
Chapter 8 Introduction to Hypothesis Testing
Applying Science Towards Understanding Behavior in Organizations Chapters 2 & 3.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
T tests comparing two means t tests comparing two means.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Experimental Design: One-Way Correlated Samples Design
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Review Hints for Final. Descriptive Statistics: Describing a data set.
Experimental Design and Statistics. Scientific Method
Experimental Psychology PSY 433 Appendix B Statistics.
Academic Research Academic Research Dr Kishor Bhanushali M
Question paper 1997.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
Experimentation in Computer Science (Part 2). Experimentation in Software Engineering --- Outline  Empirical Strategies  Measurement  Experiment Process.
T tests comparing two means t tests comparing two means.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Introduction to Validity True Experiment – searching for causality What effect does the I.V. have on the D.V. Correlation Design – searching for an association.
T tests comparing two means t tests comparing two means.
Chapter 13 Understanding research results: statistical inference.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Methods of Presenting and Interpreting Information Class 9.
Logic of Hypothesis Testing
Dependent-Samples t-Test
9 research designs likely for PSYC 2100
INF397C Introduction to Research in Information Studies Spring, Day 12
Unit 3 Hypothesis.
Understanding Results
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
© LOUIS COHEN, LAWRENCE MANION AND KEITH MORRISON
12 Inferential Analysis.
Hypothesis Testing: Hypotheses
© 2011 The McGraw-Hill Companies, Inc.
Internal Validity – Control through
Introduction to Inferential Statistics
2 independent Groups Graziano & Raulin (1997).
12 Inferential Analysis.
Wrap-up and Course Review
CS 594: Empirical Methods in HCC Experimental Research in HCI (Part 2)
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Product moment correlation
What are their purposes? What kinds?
Inferential Statistics
Research Methods & Statistics
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
Presentation transcript:

CS 594: Empirical Methods in HCC Experimental Research in HCI (Part 1) Dr. Debaleena Chattopadhyay Department of Computer Science debchatt@uic.edu debaleena.com hci.cs.uic.edu

Agenda Overview Sampling Significance Testing Study Design Parametric Statistics Correlation Regression T-test ANOVA Multilevel Linear Modeling

Overview Experimental research is used in HCI to answer questions of causality. To show how the manipulation of one variable of interest has a direct causal influence on another variable of interest Exp. Research in HCI builds upon the tradition of psychology, sociology, cognitive science, information science, and broadly social science. Exp. Research in HCI can be theoretically driven or engineering driven.

Advantages and Limitations of Exp. Research Internal validity Demonstrate a strong causal connection Provides a systematic process to test theoretical propositions and advance theory Disadvantages Requires well-defined, testable hypotheses, and a small set of well-controlled variables. Risk of low external and ecological validity A poorly executed experiment may have the veneer of “scientific validity” because of the methodological rigor, but ultimately provides little more than well-measured noise. External Validity. External validity refers to whether the study results can be generalized to the population that you are studying across setting and time. Ecological Validity. Ecological validity tells us whether or not the findings can be generalized to real-world settings.

Sampling

Sampling (cont….) Non-probability sampling Probability sampling Snowball sampling Quota sampling Probability sampling Random selection To err is human, to randomly err is statistically divine.

Hypothesis Formulation Precise Meaningful Testable Falsifiable A hypothesis both defi nes the variables involved and the relationship between them , and can take many forms: A causes B; A is larger, faster, or more enjoyable than B; etc.

Study Design Independent and Dependent variable Operational definition Manipulation check and check for operational confounds Reliability and Validity of the dependent variable Rules for quantifying Scope and boundaries of what is to be measured Face validity, concurrent validity, predictive validity Use standardized measures whenever you can Consider sensitivity and practicality

Significance Testing The first step in null hypothesis significance testing is to formulate the original research hypothesis as a null hypothesis and an alternative hypothesis. The null hypothesis (H0 ) is set up as a falsifiable statement that predicts no difference between experimental conditions. The alternative hypothesis (often written as HA or H1) captures departures from the null hypothesis.

Building a model out of data

Example model The standard deviation of sample means is known as the standard error of the mean (SE). sampling variation: that is, samples will vary because they contain different members of the population; A sampling distribution is simply the frequency distribution of sample means5 from the same population.

Parametric data

Skewness and Kurtosis Positively skewed Negatively skewed Platykurtic Leptokurtic Kurtosis cut-off 3

Significance Testing (cont…) Type 1 error = Pr(reject H 0 | H 0 true). (also significance level or alpha) p value describes the probability of obtaining the observed data, or more extreme data, if the null hypothesis were true. Pr(observed data| H 0 true) p < .05 means the chances of a Type I error occurring are less than 5 %.

Type 1 and Type 2 error

Type 1 and Type 2 error (cont…)

Type 1 and Type 2 error (cont…)

Study Design (cont…) Randomized Experiments Between Subject Experiments Within Subject Experiments Mixed or Factorial Designs How to balance to avoid order effect? Complete counterbalancing; >= n! participants needed Use Latin Square Designs or Balanced Latin Square Designs

Study Design (cont…)

One-tailed vs. Two-tailed Assumption: DV follows a normal distribution A statistical model that tests a directional hypothesis is called a one-tailed test, whereas one testing a non-directional hypothesis is known as a two-tailed test.

Latin Square Balancing better than standard Latin square designs are balanced Latin square designs where each condition precedes and follows each other condition equally often. This can help to minimize sequential effects

Quasi-Experimental Designs in HCI Non-equivalent group designs Pre-test/post-test design Interrupted Time-Series Design Infers the effects of an independent variable by comparing multiple measures obtained before and after an intervention takes place. major threats to internal validity

Writing up Experimental Research Answer “Why should anyone care?” Use APA conventions—mostly accepted Results should contain the study design and statistical tests Discussion should contain how the results address the HCI research question operationalized using the metrics and tested. Report effect size. Pre-register large experiments with OSF Open Science Framework https://osf.io/

Assumptions of parametric data Normally distributed data Homogeneity of variance Interval data Independence that the behaviour of one participant does not influence the behaviour of another. the variances should be the same throughout the data.

Statistical Power and Effect Sizes Effect sizes are useful because they provide an objective measure of the importance of an effect. Effect size depends on: Sample size Alpha Statistical power of the test used The power of a test is the probability that a given test will find an effect assuming that one exists in the population. If β, the probability of a Type II error, power = 1- β. G*Power

Parametric tests Correlation Regression T-test ANOVA GLM

Upcoming: Proposal due Sep 24, 11:59pm Start working on your annotated bibliography