1 Power, Power Curves and Sample Size. 2 Planning Reliable and Efficient Tests For any job, you need a tool that offers the right amount of power for.

Slides:



Advertisements
Similar presentations
If you are viewing this slideshow within a browser window, select File/Save as… from the toolbar and save the slideshow to your computer, then open it.
Advertisements

ABC. Question 1 Human capital is defined as: The knowledge, talent, and skills that people possess. A The common knowledge, talent, and skills that all.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 21 More About Tests.
Motivation Are you motivated to achieve what you really want in life? And how hard do you push yourself to get things done? Wanting to do something and.
Confidence Intervals for Proportions
PSY 307 – Statistics for the Behavioral Sciences
Finance, Financial Markets, and NPV
Power Analysis for Correlation & Multiple Regression Sample Size & multiple regression Subject-to-variable ratios Stability of correlation values Useful.
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Chapter 9 Hypothesis Testing.
Today Concepts underlying inferential statistics
BPT2423 – STATISTICAL PROCESS CONTROL.  Estimation of Population σ from Sample Data  Control Limits versus Specification Limits  The 6σ Spread versus.
Sample Size and Statistical Power Epidemiology 655 Winter 1999 Jennifer Beebe.
TEST YOUR KNOWLEDGE LESSON 4: BACK TO SCHOOL ABC Lesson 4: Back to School.
Chapter 19: Confidence Intervals for Proportions
Inferential Statistics
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 21 More About Tests.
Correlation and Regression
Hypothesis testing – mean differences between populations
Answering questions about life with statistics ! The results of many investigations in biology are collected as numbers known as _____________________.
Statistical Analysis Statistical Analysis
More About Tests and Intervals Chapter 21. Zero In on the Null Null hypotheses have special requirements. To perform a hypothesis test, the null must.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Significance Tests in practice Chapter Tests about a population mean  When we don’t know the population standard deviation σ, we perform a one.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
Statistics and Quantitative Analysis U4320
Fundamentals of Data Analysis Lecture 9 Management of data sets and improving the precision of measurement.
The Sampling Distribution of a Difference Between Two Means!
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Copyright © 2009 Pearson Education, Inc. Chapter 21 More About Tests.
Step 3 of the Data Analysis Plan Confirm what the data reveal: Inferential statistics All this information is in Chapters 11 & 12 of text.
Experimental Design If a process is in statistical control but has poor capability it will often be necessary to reduce variability. Experimental design.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
AP Statistics Chapter 23 Notes
Hypotheses tests for means
Systems Life Cycle. Know why it is necessary to evaluate a new system Understand the need to evaluate in terms of ease-of- use, appropriateness and efficiency.
1 MARKETING RESEARCH Week 5 Session A IBMS Term 2,
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Analyze Improve Define Measure Control L EAN S IX S IGMA L EAN S IX S IGMA Chi-Square Analysis Chi-Square Analysis Chi-Square Training for Attribute Data.
One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company.
Analyzing Statistical Inferences How to Not Know Null.
Chapter 21: More About Test & Intervals
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
ETM U 1 Analysis of Variance (ANOVA) Suppose we want to compare more than two means? For example, suppose a manufacturer of paper used for grocery.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.1.
Statistical Analysis. Null hypothesis: observed differences are due to chance (no causal relationship) Ex. If light intensity increases, then the rate.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
STATISTICS STATISTICS Numerical data. How Do We Make Sense of the Data? descriptively Researchers use statistics for two major purposes: (1) descriptively.
MAKING MEANING OUT OF DATA Statistics for IB-SL Biology.
Chapter 13 Understanding research results: statistical inference.
WASTE LAB: OUT OF SIGHT OUT OF MIND OBJECTIVES Recognize various categories and amount of solid waste produced. Compute percentages of waste, by category.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Lecture Notes and Electronic Presentations, © 2013 Dr. Kelly Significance and Sample Size Refresher Harrison W. Kelly III, Ph.D. Lecture # 3.
Normal Distribution ••••••••••••••••••••••••••••••••••
Chapter 16: Sample Size “See what kind of love the Father has given to us, that we should be called children of God; and so we are. The reason why the.
Nominal the best case analysis using Minitab
Using Bootstrapping to Teach Statistical Concepts
Chapter 21 More About Tests.
More about Tests and Intervals
Decision Errors and Power
CHAPTER 18: Inference in Practice
Basic Training for Statistical Process Control
Basic Training for Statistical Process Control
Two Categorical Variables: The Chi-Square Test
OMGT LECTURE 10: Elements of Hypothesis Testing
Presentation transcript:

1 Power, Power Curves and Sample Size

2 Planning Reliable and Efficient Tests For any job, you need a tool that offers the right amount of power for the task at hand. You wouldn't use a telescope to examine a stamp collection, or a handheld magnifying glass to search for new galaxies, because neither would provide you with meaningful observations. To complicate matters, if detecting a galaxy really was your goal, the cost of gaining the necessary power might be more than you can afford.

3 Planning Reliable and Efficient Tests Anyone using statistical tests faces the same issues. You must consider the precision you need to meet your goals (should your test detect subtle effects or massive shifts?), and balance it against the cost of sampling your population (are you testing toothpicks or jet engines?).

4 Planning Reliable and Efficient Tests You also want the confidence in your results that's appropriate for your situation (testing seat belts demands a greater degree of certainty than testing shampoo). We measure this certainty with statistical power – the probability your test will detect an effect that truly exists.

5 Planning Reliable and Efficient Tests Minitab's Power and Sample Size tools, with Power Curves, help you balance these issues that may compete for your limited resources. Here are three examples of how a quick Power and Sample Size test can help you save time and money getting results you can trust.

6 Don't Leave Success to Chance A paper clip manufacturer wants to detect significant changes in clip length. They sample thousands of clips because it is cheap and quick to do. But this huge sample makes the test too sensitive: the broken line (next slide) shows it will sound the alarm if the average length differs by a trivial amount (0.05).

7 Don't Leave Success to Chance

8 This Power Curve shows they are wasting resources on excessive precision. A sample size of just 100 will detect meaningful differences (0.25) without “crying wolf” at every negligible blip.

9 Don't Leave Success to Chance

10 Don't Leave Success to Chance An aerospace company is designing an experiment to test a new rocket. Each rocket is very expensive, so it is critical to test no more than necessary.

11 Don't Leave Success to Chance

12 Don't Leave Success to Chance This Power Curve confirms an experiment with 6 replicates will give researchers the power they need without spending more than they must.

13 Don't Leave Success to Chance

14 Don't Leave Success to Chance “We've always done it this way.” That's why a lumber company would sample 10 beams to test whether their strength meets the target.

15 Don't Leave Success to Chance

16 Don't Leave Success to Chance According to the Power Curve, this small sample size made their test incapable of detecting important effects. They must sample 34 beams to detect meaningful differences (0.50).

17 Don't Leave Success to Chance

18 See the Big Picture A power analysis helps you weigh your resources against your demands, and quantifies a test's ability to answer your question. It can expose design problems, like the lumber company's insufficient sample size. It can also reveal design solutions you hadn't considered.

19 See the Big Picture Take for instance the packaging plant of a snack company. Customers complain that the company's pretzel bags are sealed with glue that's too strong, so researchers use One-Way ANOVA to compare their current glue with three potential replacements.

20 See the Big Picture Differences in seal strength less than 10 are undetectable to most people, so their test only needs to detect a difference of 10. A power value of 80% is acceptable, but 90% is ideal. What sample size meets their needs?

21 See the Big Picture 30 samples of each glue ensure the test detects a difference of 10 with 90% power.

22 See the Big Picture Or, they could detect the same difference with 23 samples and 80% power. If this represents considerable savings, the researchers may consider using the smaller sample.

23 See the Big Picture The Power Curve illustrates this information, but it also charts every other combination of power and difference for a given sample size.

24 See the Big Picture

25 See the Big Picture The solid line indicates researchers can attain 90% power with just 23 samples if they are willing to seek a difference of 12 instead of 10. This might just be the ideal choice.

26 See the Big Picture

27 How to Create Power Curves in Minitab Performing power analyses with Power Curves couldn't be simpler. You supply the factors you know, and Minitab calculates the one you omit.

28 How to Create Power Curves in Minitab Suppose a trainer wants to compare two training courses for forklift operators. She will use a two-sample t-test to compare the average scores that operators from each course earned on the final exam.

29 How to Create Power Curves in Minitab She knows she must detect a difference of 5 in either direction between the two courses with 80% power, and historical data suggest a standard deviation of 5. But how many participants must she sample from each course?

30 How to Create Power Curves in Minitab Choose Stat > Power and Sample Size > 2-Sample t In Differences, type -5, 5 In Power values, type 0.80 In Standard deviation, type 5 Click OK

31 How to Create Power Curves in Minitab

32 How to Create Power Curves in Minitab

33 How to Create Power Curves in Minitab According to this Power Curve, sampling 17 participants from each class enables her test to detect the difference she seeks with 80% power.

34 How to Create Power Curves in Minitab

35 Putting Power Curves to Use Without knowing the power of your test, it's hard to know if you can trust your results: your test could be too weak to answer your question, or too strong for your needs. Minitab's Power Curves (available for many common statistical procedures) help you balance your resources against your goals and design a test you can trust that costs no more than necessary.

36 Putting Power Curves to Use Power Curves graph the dynamic relationships that define power, revealing the big picture and ensuring no option escapes your consideration. And perhaps most importantly, they make power analysis an easier and more accessible part of every project. Empower your test. Trust your results.