Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
Lecture 2: Null Hypothesis Significance Testing Continued Laura McAvinue School of Psychology Trinity College Dublin.
Statistics 101 Class 8. Overview Hypothesis Testing Hypothesis Testing Stating the Research Question Stating the Research Question –Null Hypothesis –Alternative.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Statistical Issues in Research Planning and Evaluation
Lecture 3: Null Hypothesis Significance Testing Continued Laura McAvinue School of Psychology Trinity College Dublin.
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
Ethan Cooper (Lead Tutor)
Thursday, September 12, 2013 Effect Size, Power, and Exam Review.
Review: What influences confidence intervals?
Statistics for the Social Sciences
Behavioural Science II Week 1, Semester 2, 2002
Power and Effect Size.
Lecture 11 Psyc 300A. Null Hypothesis Testing Null hypothesis: the statistical hypothesis that there is no relationship between the variables you are.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
Lecture 13: Review One-Sample z-test and One-Sample t-test 2011, 11, 1.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
Intro to Statistics for the Behavioral Sciences PSYC 1900
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
T-Tests Lecture: Nov. 6, 2002.
Lecture 7 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
The t Tests Independent Samples.
Tests of Hypothesis [Motivational Example]. It is claimed that the average grade of all 12 year old children in a country in a particular aptitude test.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Inferential Statistics
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. A sampling error occurs.
One Sample Z-test Convert raw scores to z-scores to test hypotheses about sample Using z-scores allows us to match z with a probability Calculate:
Statistics for the Social Sciences
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Hypothesis Testing.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Chapter 8 Introduction to Hypothesis Testing
Comparing Means From Two Sets of Data
Psy B07 Chapter 8Slide 1 POWER. Psy B07 Chapter 8Slide 2 Chapter 4 flashback  Type I error is the probability of rejecting the null hypothesis when it.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Elementary Statistical Methods André L. Souza, Ph.D. The University of Alabama Lecture 22 Statistical Power.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 4): Power Fall, 2008.
DIRECTIONAL HYPOTHESIS The 1-tailed test: –Instead of dividing alpha by 2, you are looking for unlikely outcomes on only 1 side of the distribution –No.
Correct decisions –The null hypothesis is true and it is accepted –The null hypothesis is false and it is rejected Incorrect decisions –Type I Error The.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. Sampling error means.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Hypothesis test flow chart
Psychology 290 Lab z-tests & t-tests March 5 - 7, 2007 –z-test –One sample t-test –SPSS – Chapter 7.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Chapter 9 Introduction to the t Statistic
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 11: Between-Subjects Designs 1.
Jacek Wallusch _________________________________ Statistics for International Business Lecture 10: Introduction to Hypothesis Testing.
Statistics for the Social Sciences
Central Limit Theorem, z-tests, & t-tests
Hypothesis Testing: Hypotheses
اختبار الفرضيات اختبارالفرضيات المتعلقة بالوسط
Review: What influences confidence intervals?
Hypothesis Testing.
Statistical Power.
Presentation transcript:

Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power

Power When you find a significant result, it implies that the null hypothesis is unlikely to be correct. When you find a significant result, it implies that the null hypothesis is unlikely to be correct. It does not imply that you will find the same result every time you conduct the experiment. It does not imply that you will find the same result every time you conduct the experiment. Though, on average, you should find differences in the same direction as the one you originally found. Though, on average, you should find differences in the same direction as the one you originally found. Type II errors lead to failures to reject the null when it is incorrect. Type II errors lead to failures to reject the null when it is incorrect.

Type II Errors and Power Power is the probability of correctly rejecting a false null hypothesis. Power is the probability of correctly rejecting a false null hypothesis. Power = (1-  ) Power = (1-  ) Why do we care about power? Why do we care about power? It tells us about the sensitivity of our experiments. It tells us about the sensitivity of our experiments. Allows us to determine needed sample sizes to detect effects of different sizes. Allows us to determine needed sample sizes to detect effects of different sizes.

Type II Errors and Power If we have two distributions, the control group and the treatment group, we can define the errors and power by examining their overlap. If we have two distributions, the control group and the treatment group, we can define the errors and power by examining their overlap. The region to the right of the critical value represents alpha (assuming a one-tailed test). The region to the right of the critical value represents alpha (assuming a one-tailed test). The second distribution represents the sampling distribution of the mean when the null is false (H1). The second distribution represents the sampling distribution of the mean when the null is false (H1). If part of the sampling distribution for H1 falls to the left of the critical value in H0, this represents the probability of a type II error. If part of the sampling distribution for H1 falls to the left of the critical value in H0, this represents the probability of a type II error. Power is the probability represented by the area under H1 that exceeds the critical value. Power is the probability represented by the area under H1 that exceeds the critical value.

Factors That Affect Power Alpha Alpha As alpha increases, power decreases. As alpha increases, power decreases. The Difference Between Means The Difference Between Means Increasing differences are associated with a higher probability of detection (greater power) Increasing differences are associated with a higher probability of detection (greater power) Sample Size Sample Size Increasing sample sizes reduce the standard error and thus increase power. Increasing sample sizes reduce the standard error and thus increase power.

Effect Size Again To calculate power (approximately) we will use a measure of effect size: To calculate power (approximately) we will use a measure of effect size: Gamma is the measure of the degree to which the two means differ relative to the sd of the parent population. Gamma is the measure of the degree to which the two means differ relative to the sd of the parent population.

Effect Size Again How to calculate gamma? How to calculate gamma? If we need to determine power before conducting an experiment, we can estimate gamma based on past research. If we need to determine power before conducting an experiment, we can estimate gamma based on past research. We can also estimate it based on theory using Cohen’s conventions for small, medium, and large effects. We can also estimate it based on theory using Cohen’s conventions for small, medium, and large effects. Or, we can run a small pilot study. Or, we can run a small pilot study. Once we have gamma, we need to determine delta, which will be a different function of sample size depending on the specific test we will conduct. Once we have gamma, we need to determine delta, which will be a different function of sample size depending on the specific test we will conduct.

Power for One Sample t-Tests Here, the function is: Here, the function is: So, if a group of 49 people taking an herbal supplement has an IQ of 108, the power with which we can detect this is: So, if a group of 49 people taking an herbal supplement has an IQ of 108, the power with which we can detect this is:

Power for One Sample t-Tests Once we have delta, we turn to the power tables and find the power for a two tailed test at the appropriate alpha level. Once we have delta, we turn to the power tables and find the power for a two tailed test at the appropriate alpha level. Here, power=.96 Here, power=.96 What does this mean? What does this mean?

Determining Sample Size Sometimes, we wish to know what size sample we would need to obtain a given level of power. Sometimes, we wish to know what size sample we would need to obtain a given level of power. We usually don’t have control over mean differences or conventional alpha levels. We usually don’t have control over mean differences or conventional alpha levels. In this case, we simply set delta to the desired level and solve for n. In this case, we simply set delta to the desired level and solve for n.

N for One Sample t-Tests Here, the function is: Here, the function is: We would need 28 people in the sample to have power of.80 in the previous experiment. We would need 28 people in the sample to have power of.80 in the previous experiment.

Power for Independent Sample t-Tests Here, the function is: Here, the function is: To determine sample size: To determine sample size:

Let’s Look at An Example