Statistical Power: Sample & Effect Size Dr James Betts Developing Study Skills and Research Methods (HL20107)

Slides:



Advertisements
Similar presentations
If you are viewing this slideshow within a browser window, select File/Save as… from the toolbar and save the slideshow to your computer, then open it.
Advertisements

Research Skills Workshop Designing a Project
Statistical vs Clinical Significance
Validity and Reliability
1 Contact details Colin Gray Room S16 (occasionally) address: Telephone: (27) 2233 Dont hesitate to get in touch.
Introduction to Study Skills & Research Methods (HL10040)
Locating Variance: Post-Hoc Tests
Factorial Designs & Managing Violated Statistical Assumptions
Chapter 11 Comparing Two Means. Homework 19 Read: pages , , LDI: 11.1, 11.2, EX: 11.40, 11.41, 11.46,
Equivalence Testing Dig it!.
Statistical tests for replicated experiments Normal probability plots are a less formal diagnostic tool for detecting effects Normal probability plots.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
(This presentation may be used for instructional purposes)
Power and sample size.
Chapter 15 ANOVA.
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Research Methodology Statistics Maha Omair Teaching Assistant Department of Statistics, College of science King Saud University.
Comparing One Sample to its Population
1 COMM 301: Empirical Research in Communication Kwan M Lee Lect4_1.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
Effect Size and Power.
Statistical Issues in Research Planning and Evaluation
Effect Size and Power. ► Two things mentioned previously:  P-values are heavily influenced by sample size (n)  Statistics Commandment #1: P-values are.
Effect Size and Meta-Analysis
1 G Lect 8a G Lecture 8a Power for specific tests: Noncentral sampling distributions Illustration of Power/Precision Software A priori and.
Lecture 3: Null Hypothesis Significance Testing Continued Laura McAvinue School of Psychology Trinity College Dublin.
Power Analysis in Grant Writing Jill Harkavy-Friedman, Ph.D.
RIMI Workshop: Power Analysis Ronald D. Yockey
Smith/Davis (c) 2005 Prentice Hall Chapter Thirteen Inferential Tests of Significance II: Analyzing and Interpreting Experiments with More than Two Groups.
Estimation of Sample Size
C81MPR Practical Methods (Lab 2) Collecting Data Dr. Danielle Ropar & Dr. Jonathan Stirk.
Summary 1 l The Analytical Problem l Data Handling.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
An Overview of Today’s Class
Meta-analysis & psychotherapy outcome research
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Statistics 800: Quantitative Business Analysis for Decision Making Measures of Locations and Variability.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Reliability of Selection Measures. Reliability Defined The degree of dependability, consistency, or stability of scores on measures used in selection.
Chapter 14 Inferential Data Analysis
Relationships Among Variables
Chemometrics Method comparison
9.0 A taste of the Importance of Effect Size The Basics of Effect Size Extraction and Statistical Applications for Meta- Analysis Robert M. Bernard Philip.
Chapter Eleven Inferential Tests of Significance I: t tests – Analyzing Experiments with Two Groups PowerPoint Presentation created by Dr. Susan R. Burns.
Comparing Means From Two Sets of Data
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
Funded through the ESRC’s Researcher Development Initiative Prof. Herb MarshMs. Alison O’MaraDr. Lars-Erik Malmberg Department of Education, University.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Case Study.
PS215: Methods in Psychology II W eek 8. 2 Next Friday (Week 9) Evaluating research, class test First ten minutes of lecture ( ) Please come a.
Chapter 10 The t Test for Two Independent Samples
Testing Hypotheses II Lesson 10. A Directional Hypothesis (1-tailed) n Does reading to young children increase IQ scores?  = 100,  = 15, n = 25 l sample.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
Chapter 11 Meta-Analysis. Meta-analysis  Quantitative means of reanalyzing the results from a large number of research studies in an attempt to synthesize.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Simulation-based inference beyond the introductory course Beth Chance Department of Statistics Cal Poly – San Luis Obispo
Introduction to Power and Effect Size  More to life than statistical significance  Reporting effect size  Assessing power.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 15: Sample size and Power Marshall University Genomics.
Standard Level Diploma
H676 Week 3 – Effect Sizes Additional week for coding?
Reliability & Validity
Lecture 4: Meta-analysis
Inverse Transformation Scale Experimental Power Graphing
Calculating Sample Size: Cohen’s Tables and G. Power
Presentation transcript:

Statistical Power: Sample & Effect Size Dr James Betts Developing Study Skills and Research Methods (HL20107)

Lecture Outline: Statistical Power Effect Size Smallest Worthwhile Effect Coefficient of Variation Sample Size Estimation.

Statistical Power Our consideration of post-hoc tests last week focused heavily on controlling type I error rate However, it is also of importance that the rate of type II errors is controlled Statistical power reflects the sensitivity of our test (i.e. the power to detect a genuine effect 80% of the time) So what dictates the power of a test?

Statistical Power Significance tests determine a P value based upon 3 factors: – – – The power of a test is therefore dictated by the balance between the number of subjects recruited and the Effect Size (ES).

ES AKA Cohens d (d=D/SD) SubjectPre-ScorePost-ScoreDifference Tom12164 Dick14173 Harry10122 James12153 Mean12153 SD …BUT see Dunlap et al. (1996) Psychological Methods 1 (2) p

ES Interpretation/Application Effect size shows us the magnitude of our effect relative to SD Based upon the magnitude of correlation between trials, Jacob Cohen suggests thresholds of >0.2 (small), >0.5 (moderate) & >0.8 (large) –(n.b. others favour >0.2, >0.6 & >1.2) So effect size provides a useful tool for examining differences irrespective of sample size Another major application of ES is therefore to determine the required sample size for our study.

GB = s USA = s Smallest Worthwhile Effect D = 0.01 s

Smallest Worthwhile Effect It would appear that even a small amount of primary variance from an ergogenic aid would guarantee victory to either competitor… …however, the error variance is such that a re-run could produce entirely different results…

…for an effect to guarantee first place, it would need to exceed the opponents time by more than his error variance. Re-Run

Coefficient of Variation (CV) The coefficient of variation expresses within subject variation as a % of their average performance: –e.g. USA test-retest form last 10 training sessions s s s s s s s s s s Mean = SD = CV =

Smallest Worthwhile Effect So when conducting applied research into performance enhancement, the smallest worthwhile effect can be based on the actual % improvement that produces a worthwhile increase in your chance of winning the event –example for 100 m sprint an improvement of 0.3 of CV converts 2 nd1 st once every ten races Hopkins et al. (1999) MSSE 31 (3) p However, we dont always have such ecologically valid data to support our laboratory investigations.

Smallest Worthwhile Effect Ideally, it is recommended that a pilot study is conducted so that the typical effect size can be established and A Priori sample size calculations can be conducted Alternatively, the rationale for a planned study is often supported by previously published literature, in which case this data can be used as a guide to the magnitude of effects which can be expected.

Sample Size Estimation Overall, published data can be used for A Priori power analysis as a general guide for how many subjects to recruit Then post-hoc power analysis can be conducted to calculate the actual statistical power given the sample size attained –e.g. Using similar supplements to those under investigation in the present study, van Loon et al. (2000) reported the inclusion of protein to accelerate muscle glycogen resynthesis by 18.8 mmol glucosyl units kg dry mass -1 h -1, with a pooled standard deviation of 6.6 mmol glucosyl units kg dry mass -1 h -1. Based upon these data it was estimated that a sample size of 6 has a 99% power to detect such differences.

The purpose of sample size formulae is not to give an exact number…but rather to subject the study design to scrutiny, including an assessment of the validity and reliability of data collection, and to give an estimate to distinguish whether tens, hundreds, or thousands of participants are required Williamson et al. (2000) JRSS 163: p. 10

Summary The power of a statistical test is influenced by the size of the effect and sample size Effect size provides a useful tool for examining data when sample size is small The smallest worthwhile effect can also be applied to determine how many subjects would be required for statistical significance Remember that our choice of data for this analysis was very subjective in places.