Introduction to Power Analysis  G. Quinn & M. Keough, 2003 Do not copy or distribute without permission of authors.

Slides:



Advertisements
Similar presentations
Experimental design and analysis Multiple linear regression  Gerry Quinn & Mick Keough, 1998 Do not copy or distribute without permission of authors.
Advertisements

CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Sampling: Final and Initial Sample Size Determination
1 G Lect 8a G Lecture 8a Power for specific tests: Noncentral sampling distributions Illustration of Power/Precision Software A priori and.
Estimation of Sample Size
ANALYSIS OF VARIANCE  Gerry Quinn & Mick Keough, 1998 Do not copy or distribute without permission of authors. One factor.
1 Chapter 2 Simple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
Part I – MULTIVARIATE ANALYSIS
Experimental design and analysis Experimental design  Gerry Quinn & Mick Keough, 1998 Do not copy or distribute without permission of authors.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Sample size computations Petter Mostad
Statistics: Data Analysis and Presentation Fr Clinic II.
Final Review Session.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Biol 500: basic statistics
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Chapter 11: Inference for Distributions
Comparing Several Means: One-way ANOVA Lesson 14.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Today Concepts underlying inferential statistics
Using Statistics in Research Psych 231: Research Methods in Psychology.
Simple Linear Regression and Correlation
Lorelei Howard and Nick Wright MfD 2008
Effect Sizes, Power Analysis and Statistical Decisions Effect sizes -- what and why?? review of statistical decisions and statistical decision errors statistical.
Chapter 14 Inferential Data Analysis
13 Design and Analysis of Single-Factor Experiments:
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Copyright, Gerry Quinn & Mick Keough, 1998 Please do not copy or distribute this file without the authors’ permission Experimental design and analysis.
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
More About Significance Tests
Choosing and using statistics to test ecological hypotheses
Hypothesis Testing Quantitative Methods in HPELS 440:210.
© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
ANOVA (Analysis of Variance) by Aziza Munir
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Review Hints for Final. Descriptive Statistics: Describing a data set.
One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company.
DOX 6E Montgomery1 Design of Engineering Experiments Part 2 – Basic Statistical Concepts Simple comparative experiments –The hypothesis testing framework.
The Analysis of Variance. One-Way ANOVA  We use ANOVA when we want to look at statistical relationships (difference in means for example) between more.
PCB 3043L - General Ecology Data Analysis.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Testing Differences in Means (t-tests) Dr. Richard Jackson © Mercer University 2005 All Rights Reserved.
Model adequacy checking in the ANOVA Checking assumptions is important –Normality –Constant variance –Independence –Have we fit the right model? Later.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
11.1 Heteroskedasticity: Nature and Detection Aims and Learning Objectives By the end of this session students should be able to: Explain the nature.
How Many Subjects Will I Need? Jane C. Johnson Office of Research Support A.T. Still University of Health Sciences Kirksville, MO USA.
Statistics for Education Research Lecture 4 Tests on Two Means: Types and Paired-Sample T-tests Instructor: Dr. Tung-hsien He
Inferential Statistics Psych 231: Research Methods in Psychology.
F73DA2 INTRODUCTORY DATA ANALYSIS ANALYSIS OF VARIANCE.
Chapter 9 Introduction to the t Statistic
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Stats Methods at IC Lecture 3: Regression.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
PCB 3043L - General Ecology Data Analysis.
Hypothesis testing using contrasts
Understanding Results
کارگاه حجم نمونه با نرم افزار G*Power
Presentation transcript:

Introduction to Power Analysis  G. Quinn & M. Keough, 2003 Do not copy or distribute without permission of authors.

Power of test Probability of detecting an effect if it exists Probability of rejecting incorrect H O 1 –  where  is the Type II error

HAHA HoHo Region where H o retainedRegion where H o rejected Type II errorType I error

Statistical power depends on Effect size (ES) –size of difference between treatments –large effects easier to detect Background variation: –variation between experimental units (  2 estimated by s 2 ) –greater background variability, less likely to detect effects

Sample size (n) for each treatment group: –increasing sample size makes effects easier to detect Significance level (  ): –Type I error rate –as  decreases,  increases, power decreases

Power analysis If  is fixed (usually at 0.05), then

Exact formula depends on statistical test (i.e. different for t, F etc.)

yy P(y) z 22 tF

Good returns Diminshing returns

A posteriori power

If conclusion is non-significant: report power of experiment to detect relevant effect size. Solve power equation for specific ES: Post-hoc (a posteriori) power analysis

Karban (1993) Ecology 74:9-19 Plant growth and reproduction in response to reduced herbivores. Two treatments: –normal herbivore damage –reduced herbivore damage –n = 31 plants in each treatment For plant growth: F 1,60 = 0.51, P = 0.48 ns

Karban (1993) Ecology 74:9-19 Power to detect effects: Small effect (ES = 0.1)0.11 Medium effect (ES = 0.25)0.50 Large effect (ES = 0.40)0.88 Effect size (ES) =  MS Groups /  MS Residual ie. SD Groups / SD Reps - see Cohen (1992)

A priori power analysis sample size determination To determine appropriate sample size a priori, we need to know: what power we want background variation (from pilot study or previous literature) what ES we wish to be able to detect if it occurs

Solve power equation for n

Required sample size

Example of a priori power analysis Effects of fish predation on mudflat crabs Two treatments: –cage vs cage control Pilot study: –number of crabs in 3 plots –variance was 19 (so s 2 = 19) –mean was 20

Aims: –to detect 50% increase in crab numbers due to caging, ie. an increase from 20 to about 30; so ES = 50% (or 10 crabs per plot) –to be 80% sure of picking up such an effect if it occurred; so power = 0.80 How many replicate plots required for each treatment? –what is required n?

Basic power (t-test) ES = 10 s = 4.36

Detecting a more subtle effect Halved

If data are more variable Variance doubled

Minimum Detectable Effect Size If an ES can’t be determined Specify target power, solve for ES

ES vs Sample Size (a priori)

Effect size How big: –what size of effect is biologically important? –how big an effect do we want to detect if it occurs?

An effect size is… Type of testEffect Size t-test (2-sample)Difference between means Simple linear regression Slope (or change in slope) ANOVA (1-way)Differences between means  2 (2 x 2 table) Difference in proportions

Effect size Where from? –biological knowledge –previous work/literature –compliance requirements (e.g. water quality)

Specification of effect size Easy for 2 groups: –difference between 2 means 0 P(y) Central t Noncentral t

Specification of effect size Harder for more than 2 groups: –Consider 4 groups: 50% difference from smallest to largest 1 = 2 = 3 < 4? 1 < 2 < 3 < 4? 1 = 2 < 3 = 4? –Shape of alternative distribution depends on the particular pattern

P(y) Central F

One-way ANOVA Range = 10, s = No. of replicates Power 3 vs 1 2 vs 2 Linear

Estimate of variance Other work on same system Published work on similar systems Pilot studies Must be estimate of same kind of variance –e.g. paired vs two-sample t test –Variance of difference vs variance of each sample

Power analysis requires Clear understanding of the kind of statistical model to be used (inc. the formal tests) Careful thought about important effects; hardest step, especially for interactions An estimate of variance Significance level to be used Desired level of confidence Understanding of non-centrality parameters for complex designs

Cautions Variance estimates may be uncertain –Allow for extra samples in case of larger than expected variation Realized power Cohen’s Effect Sizes Raw & standardized ES Terminology

Options for study planning: n Level of significance Desired power Target effect size Estimate of variation Calculated sample size –or Power vs n “Safety” factor

Options for planning: ES Significance level Desired level of confidence Estimate of variation Suggested sample size (or range) Effect that should be detected –ES vs n

Power calculations Charts & tables available in many books Software will do these calculations: –Gpower –PiFace & Java applets –Review at: Some statistical packages –But check what they do!

Power vs ES (post hoc)