Statistics for the Social Sciences

Slides:



Advertisements
Similar presentations
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Advertisements

Lecture 2: Null Hypothesis Significance Testing Continued Laura McAvinue School of Psychology Trinity College Dublin.
Statistics 101 Class 8. Overview Hypothesis Testing Hypothesis Testing Stating the Research Question Stating the Research Question –Null Hypothesis –Alternative.
Copyright © 2011 by Pearson Education, Inc. All rights reserved Statistics for the Behavioral and Social Sciences: A Brief Course Fifth Edition Arthur.
Ch 6: Making Sense of Statistical Significance: Decision Errors, Effect Size, and Power Pt 2: Sept. 26, 2013.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Thursday, September 12, 2013 Effect Size, Power, and Exam Review.
Statistics for the Social Sciences Psychology 340 Spring 2005 Sampling distribution.
On-line resources
T-tests Computing a t-test  the t statistic  the t distribution Measures of Effect Size  Confidence Intervals  Cohen’s d.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Behavioural Science II Week 1, Semester 2, 2002
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Statistics for the Social Sciences Psychology 340 Fall 2006 Review For Exam 1.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Understanding Statistics in Research
Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Chapter 6 Making Sense of Statistical Significance: Effect Size and Statistical Power Part 1: Feb. 19, 2008.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Statistics for the Social Sciences
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 8 Introduction to Hypothesis Testing
LECTURE 19 THURSDAY, 14 April STA 291 Spring
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 5 Introduction to Hypothesis.
6 Making Sense of Statistical Significance.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 7 Making Sense of Statistical.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Hypothesis test flow chart
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Chapter 3 Normal Curve, Probability, and Population Versus Sample Part 2 Aug. 28, 2014.
Inferential Statistics Psych 231: Research Methods in Psychology.
Reasoning in Psychology Using Statistics Psychology
Modern Languages Row A Row B Row C Row D Row E Row F Row G Row H Row J Row K Row L Row M
Chapter 9 Introduction to the t Statistic
Psych 231: Research Methods in Psychology
INF397C Introduction to Research in Information Studies Spring, Day 12
Statistics for the Social Sciences
Chapter 6 Making Sense of Statistical Significance: Decision Errors, Effect Size and Statistical Power Part 1: Sept. 24, 2013.
Central Limit Theorem, z-tests, & t-tests
Reasoning in Psychology Using Statistics
P-value Approach for Test Conclusion
Statistics for the Social Sciences
Hypothesis Testing.
Reasoning in Psychology Using Statistics
Statistics for the Social Sciences
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Reasoning in Psychology Using Statistics
Psych 231: Research Methods in Psychology
Reasoning in Psychology Using Statistics
Inferential Statistics
Chapter 8 Making Sense of Statistical Significance: Effect Size, Decision Errors, and Statistical Power.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Reasoning in Psychology Using Statistics
1 Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing The general goal of a hypothesis test is to rule out chance (sampling error) as.
Presentation transcript:

Statistics for the Social Sciences Psychology 340 Spring 2005 Effect sizes & Statistical Power

Outline Effect size: Cohen’s d Error types Statistical Power Analysis

Performing your statistical test There really isn’t an effect Real world (‘truth’) There really is an effect H0 is correct H0 is wrong Reject H0 Experimenter’s conclusions Fail to Reject H0

Performing your statistical test Real world (‘truth’) H0 is correct H0 is wrong

Performing your statistical test Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error Real world (‘truth’) H0 is correct H0 is wrong So there is only one distribution So there are two distributions The original (null) distribution The new (treatment) distribution The original (null) distribution

Performing your statistical test Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error Real world (‘truth’) H0 is correct H0 is wrong So there is only one distribution So there are two distributions The original (null) distribution The new (treatment) distribution The original (null) distribution

Effect Size Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error Hypothesis test tells us whether the observed difference is probably due to chance or not It does not tell us how big the difference is H0 is wrong So there are two distributions The new (treatment) distribution The original (null) distribution Effect size tells us how much the two populations don’t overlap

Effect Size Figuring effect size But this is tied to the particular units of measurement Figuring effect size The new (treatment) distribution The original (null) distribution Effect size tells us how much the two populations don’t overlap

Effect Size Standardized effect size Cohen’s d Puts into neutral units for comparison (same logic as z-scores) Cohen’s d The new (treatment) distribution The original (null) distribution Effect size tells us how much the two populations don’t overlap

Effect Size Effect size conventions small d = .2 medium d = .5 large d = .8 The new (treatment) distribution The original (null) distribution Effect size tells us how much the two populations don’t overlap

Error types There really isn’t an effect Real world (‘truth’) H0 is correct H0 is wrong I conclude that there is an effect Reject H0 Experimenter’s conclusions I can’t detect an effect Fail to Reject H0

Error types Real world (‘truth’) H0 is correct H0 is wrong Type I error (): concluding that there is a difference between groups (“an effect”) when there really isn’t. H0 is correct H0 is wrong Type I error Reject H0 Experimenter’s conclusions Type II error (): concluding that there isn’t an effect, when there really is. Fail to Reject H0 Type II error

Statistical Power The probability of making a Type II error is related to Statistical Power Statistical Power: The probability that the study will produce a statistically significant results if the research hypothesis is true (there is an effect) So how do we compute this?

Statistical Power Real world (‘truth’) a = 0.05 H0: is true (is no treatment effect) Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error The original (null) distribution a = 0.05 Reject H0 Fail to reject H0

Statistical Power Real world (‘truth’) a = 0.05 H0: is false (is a treatment effect) Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error The new (treatment) distribution The original (null) distribution a = 0.05 Fail to reject H0 Reject H0

Statistical Power Real world (‘truth’) H0: is false (is a treatment effect) Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error The new (treatment) distribution The original (null) distribution b = probability of a Type II error a = 0.05 Failing to Reject H0, even though there is a treatment effect Reject H0 Fail to reject H0

Statistical Power Real world (‘truth’) H0: is false (is a treatment effect) Real world (‘truth’) H0 is correct H0 is wrong Type I error Type II error The new (treatment) distribution The original (null) distribution b = probability of a Type II error a = 0.05 Power = 1 - b Failing to Reject H0, even though there is a treatment effect Probability of (correctly) Rejecting H0 Reject H0 Fail to reject H0

Statistical Power Steps for figuring power 1) Gather the needed information: mean and standard deviation of the Null Population and the predicted mean of Treatment Population

Statistical Power Steps for figuring power 2) Figure the raw-score cutoff point on the comparison distribution to reject the null hypothesis From the unit normal table: Z = -1.645 a = 0.05 Transform this z-score to a raw score

Statistical Power Steps for figuring power 3) Figure the Z score for this same point, but on the distribution of means for treatment Population Remember to use the properties of the treatment population! Transform this raw score to a z-score

Statistical Power Steps for figuring power 4) Use the normal curve table to figure the probability of getting a score more extreme than that Z score b = probability of a Type II error From the unit normal table: Z(0.355) = 0.3594 Power = 1 - b The probability of detecting this an effect of this size from these populations is 64%

Statistical Power Factors that affect Power: a-level Sample size Population standard deviation  Effect size 1-tail vs. 2-tailed

Statistical Power Factors that affect Power: a-level b Power = 1 - b Change from a = 0.05 to 0.01 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: a-level b Power = 1 - b Change from a = 0.05 to 0.01 a = 0.05 a = 0.01 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: a-level b Power = 1 - b Change from a = 0.05 to 0.01 a = 0.05 a = 0.01 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: a-level b Power = 1 - b Change from a = 0.05 to 0.01 a = 0.05 a = 0.01 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: a-level b Power = 1 - b Change from a = 0.05 to 0.01 a = 0.05 a = 0.01 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: a-level b Power = 1 - b So as the a level gets smaller, so does the Power of the test Change from a = 0.05 to 0.01 a = 0.05 a = 0.01 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Sample size b Recall that sample size is related to the spread of the distribution Change from n = 25 to 100 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Sample size b Change from n = 25 to 100 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Sample size b Change from n = 25 to 100 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Sample size b Change from n = 25 to 100 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Sample size b As the sample gets bigger, the standard error gets smaller and the Power gets larger Change from n = 25 to 100 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Population standard deviation Change from  = 25 to 20 Recall that standard error is related to the spread of the distribution a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Population standard deviation Change from  = 25 to 20 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Population standard deviation Change from  = 25 to 20 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Population standard deviation Change from  = 25 to 20 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Population standard deviation Change from  = 25 to 20 As the  gets smaller, the standard error gets smaller and the Power gets larger a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Effect size mtreatment Compare a small effect (difference) to a big effect mtreatment mno treatment a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Effect size b Compare a small effect (difference) to a big effect a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0 mtreatment mno treatment

Statistical Power Factors that affect Power: Effect size b Compare a small effect (difference) to a big effect a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Effect size b Compare a small effect (difference) to a big effect a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Effect size b Compare a small effect (difference) to a big effect a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Effect size b Compare a small effect (difference) to a big effect a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Effect size b Compare a small effect (difference) to a big effect a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: Effect size b Compare a small effect (difference) to a big effect a = 0.05 As the effect gets bigger, the Power gets larger b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: 1-tail vs. 2-tailed b Change from a = 0.05 two-tailed to a = 0.05 two-tailed a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: 1-tail vs. 2-tailed b Change from a = 0.05 two-tailed to a = 0.05 two-tailed p = 0.025 a = 0.05 b Power = 1 - b Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: 1-tail vs. 2-tailed b Change from a = 0.05 two-tailed to a = 0.05 two-tailed a = 0.05 p = 0.025 b Power = 1 - b p = 0.025 Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: 1-tail vs. 2-tailed b Change from a = 0.05 two-tailed to a = 0.05 two-tailed a = 0.05 p = 0.025 b Power = 1 - b p = 0.025 Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: 1-tail vs. 2-tailed b Change from a = 0.05 two-tailed to a = 0.05 two-tailed a = 0.05 p = 0.025 b Power = 1 - b p = 0.025 Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: 1-tail vs. 2-tailed b Change from a = 0.05 two-tailed to a = 0.05 two-tailed Two tailed functionally cuts the -level in half, which decreases the power. a = 0.05 p = 0.025 b Power = 1 - b p = 0.025 Reject H0 Fail to reject H0

Statistical Power Factors that affect Power: a-level: So as the a level gets smaller, so does the Power of the test Sample size: As the sample gets bigger, the standard error gets smaller and the Power gets larger Population standard deviation: As the population standard deviation gets smaller, the standard error gets smaller and the Power gets larger Effect size: As the effect gets bigger, the Power gets larger 1-tail vs. 2-tailed: Two tailed functionally cuts the -level in half, which decreases the power

Why care about Power? Determining your sample size Using an estimate of effect size, and population standard deviation, you can determine how many participants need to achieve a particular level of power When a result if not statistically significant Is is because there is no effect, or not enough power When a result is significant Statistical significance versus practical significance

Ways of Increasing Power