IS 4800 Empirical Research Methods for Information Science Class Notes March 16, 2012 Instructor: Prof. Carole Hafner, 446 WVH Tel:

Slides:



Advertisements
Similar presentations
Multiple-choice question
Advertisements

2  How to compare the difference on >2 groups on one or more variables  If it is only one variable, we could compare three groups with multiple ttests:
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Hypothesis Testing IV Chi Square.
PSY 307 – Statistics for the Behavioral Sciences
BHS Methods in Behavioral Sciences I April 25, 2003 Chapter 6 (Ray) The Logic of Hypothesis Testing.
ANOVA notes NR 245 Austin Troy
Chapter 8 The t Test for Independent Means Part 1: March 6, 2008.
PSY 307 – Statistics for the Behavioral Sciences
Inferences About Means of Two Independent Samples Chapter 11 Homework: 1, 2, 3, 4, 6, 7.
Experimental Design & Analysis
Lecture 8 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Inferential Stats for Two-Group Designs. Inferential Statistics Used to infer conclusions about the population based on data collected from sample Do.
PSY 307 – Statistics for the Behavioral Sciences
PSYC512: Research Methods PSYC512: Research Methods Lecture 19 Brian P. Dyre University of Idaho.
Inferences About Means of Two Independent Samples Chapter 11 Homework: 1, 2, 4, 6, 7.
One-way Between Groups Analysis of Variance
Statistics for the Social Sciences
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Chapter 9 Two-Sample Tests Part II: Introduction to Hypothesis Testing Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.
ANOVA Chapter 12.
AM Recitation 2/10/11.
Inferential Statistics: SPSS
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
ANOVA Greg C Elvers.
Chapter 14: Repeated-Measures Analysis of Variance.
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
A Repertoire of Hypothesis Tests  z-test – for use with normal distributions and large samples.  t-test – for use with small samples and when the pop.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Stats Lunch: Day 4 Intro to the General Linear Model and Its Many, Many Wonders, Including: T-Tests.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Inferential Statistics
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
IS 4800 Empirical Research Methods for Information Science Class Notes March 13 and 15, 2012 Instructor: Prof. Carole Hafner, 446 WVH
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
General Linear Model 2 Intro to ANOVA.
ANOVA: Analysis of Variance.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Statistics for the Behavioral Sciences Second Edition Chapter 13: Within-Groups ANOVA iClicker Questions Copyright © 2012 by Worth Publishers Susan A.
Experimental Design and Statistics. Scientific Method
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Chapter 12 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 12: One-Way Independent ANOVA What type of therapy is best for alleviating.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Chapter 9 Introduction to the Analysis of Variance Part 1: Oct. 22, 2013.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Factorial BG ANOVA Psy 420 Ainsworth. Topics in Factorial Designs Factorial? Crossing and Nesting Assumptions Analysis Traditional and Regression Approaches.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
CHAPTER 10: ANALYSIS OF VARIANCE(ANOVA) Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Statistics for the Social Sciences
Chapter 10 Introduction to the Analysis of Variance
Chapter 9 Test for Independent Means Between-Subjects Design
Presentation transcript:

IS 4800 Empirical Research Methods for Information Science Class Notes March 16, 2012 Instructor: Prof. Carole Hafner, 446 WVH Tel: Course Web site:

Outline Sampling and statistics (cont.) T test for paired samples T test for independent means Analysis of Variance Two way analysis of Variance

3 Relationship Between Population and Samples When a Treatment Had No Effect

4 Relationship Between Population and Samples When a Treatment Had An Effect

Population  Mean?Variance? Sampling Sample of size N Mean values from all possible samples of size N aka “distribution of means”    Z M = ( M - 

Z tests and t-tests t is like Z: Z = M - μ / t = M – μ / μ = 0 for paired samples We use a stricter criterion (t) instead of Z because is based on an estimate of the population variance while is based on a known population variance. S 2 = Σ (X - M) 2 = SS N – 1 N-1 S 2 M = S 2 /N

Given info about population of change scores and the sample size we will be using (N) T-test with paired samples Now, given a particular sample of change scores of size N We can compute the distribution of means We compute its mean and finally determine the probability that this mean occurred by chance ?  = 0 S 2 est  2 from sample = SS/df df = N-1 S 2 M = S 2 /N

t test for independent samples Given two samples Estimate population variances (assume same) Estimate variances of distributions of means Estimate variance of differences between means (mean = 0) This is now your comparison distribution

Estimating the Population Variance S 2 is an estimate of σ 2 S 2 = SS/(N-1) for one sample (take sq root for S) For two independent samples – “pooled estimate”: S 2 = df 1 /df Total * S df 2 /df Total * S 2 2 df Total = df 1 + df 2 = (N1 -1) + (N2 – 1) From this calculate variance of sample means: S 2 M = S 2 /N needed to compute t statistic S 2 difference = S 2 Pooled / N1 + S 2 Pooled / N2

t test for independent samples, continued This is your comparison distribution NOT normal, is a ‘t’ distribution Shape changes depending on df df = (N1 – 1) + (N2 – 1) Distribution of differences between means Compute t = (M1-M2)/SDifference Determine if beyond cutoff score for test parameters (df,sig, tails) from lookup table.

ANOVA: When to use Categorial IV numerical DV (same as t-test) HOWEVER: –There are more than 2 levels of IV so: –(M1 – M2) / Sm won’t work

12 ANOVA Assumptions Populations are normal Populations have equal variances More or less..

13 Basic Logic of ANOVA Null hypothesis –Means of all groups are equal. Test: do the means differ more than expected give the null hypothesis? Terminology –Group = Condition = Cell

14 Accompanying Statistics Experimental –Between-subjects Single factor, N-level (for N>2) –One-way Analysis of Variance (ANOVA) Two factor, two-level (or more!) –Factorial Analysis of Variance –AKA N-way Analysis of Variance (for N IVs) –AKA N-factor ANOVA –Within-subjects Repeated-measures ANOVA (not discussed) –AKA within-subjects ANOVA

15 The Analysis of Variance is used when you have more than two groups in an experiment –The F-ratio is the statistic computed in an Analysis of Variance and is compared to critical values of F –The analysis of variance may be used with unequal sample size (weighted or unweighted means analysis) –When there are just 2 groups, ANOVA is equivalent to the t test for independent means ANOVA: Single factor, N-level (for N>2)

One-Way ANOVA – Assuming Null Hypothesis is True… Within-Group Estimate Of Population Variance Between-Group Estimate Of Population Variance M1 M2 M3

Justification for F statistic

Calculating F

Example

Using the F Statistic Use a table for F(BDF, WDF) –And also α BDF = between-groups degrees of freedom = number of groups -1 WDF = within-groups degrees of freedom = Σ df for all groups = N – number of groups

One-way ANOVA in SPSS

23 Data Mean

24 Analyze/Compare Means/One Way ANOVA…

SPSS Results… F(2,21)=9.442, p<.05

26 Factorial Designs Two or more nominal independent variables, each with two or more levels, and a numeric dependent variable. Factorial ANOVA teases apart the contribution of each variable separately. For N IVs, aka “N-way” ANOVA

27 Factorial Designs Adding a second independent variable to a single- factor design results in a FACTORIAL DESIGN Two components can be assessed –The MAIN EFFECT of each independent variable The separate effect of each independent variable Analogous to separate experiments involving those variables –The INTERACTION between independent variables When the effect of one independent variable changes over levels of a second Or– when the effect of one variable depends on the level of the other variable.

Example Wait Time Sign in Student Center vs. No Sign Satisfaction

Example of An Interaction - Student Center Sign – 2 Genders x 2 Sign Conditions F M No Sign

30 Two-way ANOVA in SPSS

31 Analyze/General Linear Model/Univariate

32 Results

33 Results

34 Degrees of Freedom df for between-group variance estimates for main effects –Number of levels – 1 df for between-group variance estimates for interaction effect –Total num cells – df for both main effects – 1 –e.g. 2x2 => 4 – (1+1) – 1 = 1 df for within-group variance estimate –Sum of df for each cell = N – num cells Report: “F(bet-group, within-group)=F, Sig.”

Publication format N=24, 2x3=6 cells => df TrainingDays=2, df within-group variance=24-6=18 =>F(2,18)=7.20, p<.05

36 Reporting rule IF you have a significant interaction THEN –If 2x2 study: do not report main effects, even if significant –Else: must look at patterns of means in cells to determine whether to report main effects or not.

Results? TrainingDays Trainer TrainingDays * Trainer Sig n.s.

Results? TrainingDays Trainer TrainingDays * Trainer Sig Significant interaction between TrainingDays And Trainer, F(2,22)=.584, p<.05

Results? TrainingDays Trainer TrainingDays * Trainer Sig Main effect of Trainer, F(1,22)=.001, p<.05

Results? TrainingDays Trainer TrainingDays * Trainer Sig Significant interaction between TrainingDays And Trainer, F(2,22)=.584, p<.05 Do not report TrainingDays as significant

Results? TrainingDays Trainer TrainingDays * Trainer Sig Main effects for both TrainingDays, F(2,22)=7.20, p<.05, and Trainer, F(1,22)=.001, p<.05

“Factorial Design” Not all cells in your design need to be tested –But if they are, it is a “full factorial design”, and you do a “full factorial ANOVA” Real-Time Retrospective Agent Text   X

43 Higher-Order Factorial Designs More than two independent variables are included in a higher-order factorial design –As factors are added, the complexity of the experimental design increases The number of possible main effects and interactions increases The number of subjects required increases The volume of materials and amount of time needed to complete the experiment increases