Week 8 – Power! November 16, 2012. Goals t-test review Power! Questions?

Slides:



Advertisements
Similar presentations
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Advertisements

PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Statistical Issues in Research Planning and Evaluation
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
+ Chapter 10 Section 10.4 Part 2 – Inference as Decision.
Hypothesis testing Week 10 Lecture 2.
PSY 307 – Statistics for the Behavioral Sciences
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
Monday, November 11 Statistical Power. Monday, November 12 Statistical Power It teaches you about the importance of effect size,  (gamma).
Using Statistics in Research Psych 231: Research Methods in Psychology.
Power and Effect Size.
Wednesday, November 14 Statistical Power. Wednesday, November 14 Why Statistical Power? It teaches you about the importance of effect size.  = d x f.
PSY 307 – Statistics for the Behavioral Sciences
Hypothesis Testing. G/RG/R Null Hypothesis: The means of the populations from which the samples were drawn are the same. The samples come from the same.
Statistics 101 Class 9. Overview Last class Last class Our FAVORATE 3 distributions Our FAVORATE 3 distributions The one sample Z-test The one sample.
T-Tests Lecture: Nov. 6, 2002.
Sample Size Determination In the Context of Hypothesis Testing
DTC Quantitative Research Methods Statistical Inference II: Statistical Testing Thursday 7th November 2014  
AM Recitation 2/10/11.
Hypothesis Testing:.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Intermediate Statistical Analysis Professor K. Leppel.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Lecture 14 Testing a Hypothesis about Two Independent Means.
Comparing Means From Two Sets of Data
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Statistical Power The ability to find a difference when one really exists.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
Comparing Means: t-tests Wednesday 22 February 2012/ Thursday 23 February 2012.
Elementary Statistical Methods André L. Souza, Ph.D. The University of Alabama Lecture 22 Statistical Power.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 9: Testing Hypotheses
Hypothesis Testing CSCE 587.
Business Research Methods William G. Zikmund
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Effects: Example Low High Variable 2.
Inferential Statistics. Coin Flip How many heads in a row would it take to convince you the coin is unfair? 1? 10?
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
© Copyright McGraw-Hill 2004
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Statistical Techniques
EDUC 200C week10 December 7, Two main ideas… Describing a sample – Individual variables (mean and spread of data) – Relationships between two variables.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Hypothesis Testing and Statistical Significance
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Inferential Statistics Psych 231: Research Methods in Psychology.
How big is my sample going to be?
Hypothesis Testing: Preliminaries
Sample Power.
Understanding Results
Inference Key Questions
MATH 2311 Section 8.2.
Chapter 9 Hypothesis Testing
Design Considerations: Independent Samples v. Repeated Measures
Monday, November 8 Statistical Power.
Wednesday, November 16 Statistical Power
Monday, November 30 Statistical Power.
Testing Hypotheses I Lesson 9.
Type I and Type II Errors
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
Presentation transcript:

Week 8 – Power! November 16, 2012

Goals t-test review Power! Questions?

Types of t-tests Single sample t-test – (How likely is it that your sample came from your null population?) Two sample t-test – (How likely is it that your two samples came from the same population?) – Pooled variance Matched pairs t-test – (How likely is it that the difference between pre-and post scores is zero?)

Type I and Type II errors H 0 TrueH 0 False Reject H 0 Type I error α Correct! Power: 1-β Retain H 0 Correct! Confidence: 1-α Type II error β

Type I and Type II errors H 0 TrueH 0 False Reject H 0 Type I error α Correct! Power: 1- β Retain H 0 Correct! Confidence: 1- α Type II error β

Type I errors We determine our chance of making Type I errors when we set α —the percent of observations in the red area If we calculate a t-statistic in the red area and the null hypothesis is true, we will mistakenly reject H 0 tαtα

1- α If H 0 is true, 1- α % of the sample means we draw will not lead us to mistakenly reject the null hypothesis

What if H 0 is false? If H 0 is false, then our sample mean is drawn from a population with a true mean that is different than μ 0

Type I and Type II errors H 0 TrueH 0 False Reject H 0 Type I error α Correct! Power: 1- β Retain H 0 Correct! Confidence: 1- α Type II error β

What if H 0 is false? Comparisons are still made using a critical t-value determined relative to μ 0 tαtα

Type II errors Type II errors are when we mistakenly retain the null hypothesis when it is false This happens when we calculate a t-statistic in the yellow area The proportion of observations in the yellow area is β

Power: 1- β The likelihood we will correctly reject the null hypothesis is 1- β —the proportion of possible sample means in the region of rejection for the null hypothesis

What determines power? First, we need a specific alternative mean. The standardized difference between our null hypothesis and this mean is the effect size, d. Can also think of this as the desired minimum detectable difference. d

The relationship between d and power We have more power to detect large effects (big d) than small ones (little d). Big dSmall d

What determines power? Because it affects the shape of the sampling distributions, N also affects power—higher N means more power Lower NHigher N

What determines power? Because power is represented by the area of the sampling distribution of the true (or alternative) mean that is in the region of rejection of the null hypothesis, α also affects power Note these graphs give α and β rather than 1- β (power), which is what we’ve seen in previous graphs Higher α Lower α

What determines power? Power is determined by d, N, and α d and N are both captured, in general by For single sample t-tests You can then look up the power of a given δ associated with different levels of α (in Table D in back of book)

Problem You develop a new measure of social efficacy for adolescent girls, with 24 items on a 3-point scale. The scale seems to have  = 18, and  = 16. You are asked to evaluate a new program to promote social efficacy in adolescent girls, and want to use your scale. You sample 16, but alas find that the sample mean of 22 does not allow you to reject the null hypothesis at  =.05. You’re really really frustrated because you think that a 4-point difference is meaningful. What should your next steps be?

 = d x f (N)  = d N 1/2 d = 4/16 =.25 N = 16  = 1.0 What would it take for power =.80? N = (  / d ) 2 N = (2.8 /.25) 2 =

Power summary Power reflects our ability to correctly reject the null hypothesis when it is false Must have a specific alternative hypothesis in mind – Alternatively, we can specify a target power level and, with a particular sample size determine how big of an effect we will be able to detect We have higher power with larger samples and when testing for large effect sizes There is a tradeoff between α and power

Questions?