Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.

Slides:



Advertisements
Similar presentations
Effect Size Mechanics.
Advertisements

CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L10.1 CorrelationCorrelation The underlying principle of correlation analysis.
QUANTITATIVE DATA ANALYSIS
PSY 307 – Statistics for the Behavioral Sciences
Independent Sample T-test Formula
Using Statistics in Research Psych 231: Research Methods in Psychology.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Lecture 9: One Way ANOVA Between Subjects
Introduction to Probability and Statistics Linear Regression and Correlation.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Social Research Methods
Today Concepts underlying inferential statistics
Using Statistics in Research Psych 231: Research Methods in Psychology.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Inferential statistics Hypothesis testing. Questions statistics can help us answer Is the mean score (or variance) for a given population different from.
The Practice of Social Research
AM Recitation 2/10/11.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Some terms Parametric data assumptions(more rigorous, so can make a better judgment) – Randomly drawn samples from normally distributed population – Homogenous.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Review Hints for Final. Descriptive Statistics: Describing a data set.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Adjusted from slides attributed to Andrew Ainsworth
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Kin 304 Inferential Statistics Probability Level for Acceptance Type I and II Errors One and Two-Tailed tests Critical value of the test statistic “Statistics.
PART 2 SPSS (the Statistical Package for the Social Sciences)
T tests comparing two means t tests comparing two means.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
Chapter 13 Understanding research results: statistical inference.
Differences Among Groups
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Inferential Statistics Psych 231: Research Methods in Psychology.
Methods of Presenting and Interpreting Information Class 9.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
INF397C Introduction to Research in Information Studies Spring, Day 12
Inference and Tests of Hypotheses
Group Comparisons What is the probability that group mean differences occurred by chance? With Excel (or any statistics program) we computed p values to.
12 Inferential Analysis.
Kin 304 Inferential Statistics
CHAPTER 29: Multiple Regression*
I. Statistical Tests: Why do we use them? What do they involve?
12 Inferential Analysis.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Statistics for the Behavioral Sciences
Inferential Statistics
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
Presentation transcript:

Statistical Power 1

First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of an intervention based on the distributions of the samples in your study. We use two measures of effect size – Cohen’s d – Eta Squared (η 2 ) 2

Effect Size: Cohen’s d Cohen’s d equals the difference in the means divided by the average of the standard deviations. It describes the distance between the means in units of pooled standard deviation (remember z scores?). It is a standardized measure of the impact of a statistically significant intervention (independent variable). 3 X2X2 X1X1

Effect Size: Eta Squared (η 2 ) Eta squared is the ratio of between group variance (impact of the intervention) to total variance. As the between group variance becomes a larger portion of the total variance (more intervention impact), eta squared gets closer to 1. Eta squared is a standardized measure of the explanatory power of the independent variable. 4 Between group variance Total variance Within group variance sum of squares between sum of squares total η 2 =

Effect Size Labeldrr 2 or η 2 Extremely large effect Very large effect Large effect Medium effect Small effect Although all the measures of effect size represent the impact of the independent variable in somewhat different ways, they are equivalent. 5 For an expanded version see the Effect Size table on the website

Effect Size In the real world how much difference did the independent variable make? Effect size is not based on inference. It is based on observed measures. 6 X2X2 X1X1

Sample Mean Sample Distribution Sampling Distribution Mean for a Given Group Size Sampling Distribution for a Given Group Size Now: Inferential Mistakes

Avoiding Type II Errors.05 Sampling Distribution of the Mean Theoretical distribution based on randomly selected groups of a given size. Means in this area would appear randomly less than 5% of the time.

Avoiding Type II Errors.05 Sometimes a mean score would not be identified as significant because it is likely to appear randomly more than 5% of the time. Sometimes that mean score represents a real world change in what is being measured but the difference isn’t enough to be significant. Type II Error

Avoiding Type II Errors.05 Now a larger group size moves the point at which the alpha level appears and something that wasn’t significant becomes so. Using larger groups reduces type II errors..05

Avoiding Type II Errors.05 Great. Using larger group sizes helps reduce type II errors. But, the cost is that developing large samples is difficult. We need to figure out how big the sample size needs to be to reasonably reduce type II errors but still keep the group as small as possible.

Power Power is defined as the probability of finding significance if it exists (avoiding type II errors). Eighty percent (.80) is accepted as a reasonable target power. If non-random change occurs it has an 80% probability of being observed. 12

Power There are 2 ways to use power calculations. First, they can be used to figure out appropriate sample sizes for a study. Second, they can be used to evaluate the use of a specific sample size after a study has been completed. 13

Power and Effect Size If a study shows larger effect sizes, smaller sample sizes will still be expected to show significance. Conversely, smaller effect sizes would require larger sample sizes. Fortunately, all of this can be read off of a table. 14 X2X2 X1X1

Power and Effect Size 15 Choosing Sample Sizes When Designing a StudyUsing Power to Explain Results

While we are here … Remember we have talked about inferential errors when something appears significant but it really wasn’t? Type I errors 16

Avoiding Type I Errors.05 Every 100 times a mean appear in the.05 area, 5 of them would have occurred randomly. That means we would identify something as not random (significant) 5 times out of 100 and be wrong. Type I Error

Avoiding Type I Errors.01 Solution: Move the alpha level to.01 That means we would identify something as not random (significant) 1 time out of 100 and be wrong. Less chance of a Type I Error

Avoiding Type I Errors.05 But this is social science and there is no good reason to make it this difficult to demonstrate significance..05 is a reasonable alpha level.

Avoiding Type I Errors.05 With larger group sizes a given point moves to a smaller probability of appearing. Using larger groups reduces type I errors..05

Sample Sizes We know that using larger sample sizes is statistically powerful but life isn’t that simple. Whenever possible use Power Analysis to help you be more confident you will find something if it is there. When things don’t appear to be significant at least now Power Analysis gives you something else to talk about to suggest what might be done to improve the quality of your data. 21

Examples 1.Most of the studies in your lit review are showing medium effects around 0.4 Cohen’s d. You want to be 90% sure you find non-random effects if they are there. Approximately how big does your sample need to be? 2.In your study you showed mean differences of 0.4 Cohen’s d but groups were not significantly different. Your sample size was 30. What was the probability of finding significant differences if they were there? 22

Analysis

Inferential Statistics Assumptions – Dependent variable is an interval measure of one characteristic of a group. – Tests are based on knowing or assuming the distribution of a population. – Statistics demonstrate if comparison samples are from the same population.

Testing Group Differences IndependentDependentTest 1 (1 group)1 measure 2 times (intervention in between)Paired t-test 1 (2 groups)1 measure (usually after the intervention) Independent samples t-test 1 (1 group) 1 measure 3 or more times (usually two after the intervention) Repeated measures ANOVA 1 (2 or more groups) 1 measure (usually after the intervention) Single factor ANOVA Non-Parametric 1 group2 non-interval measuresChi-square EZA

Post Hoc Tests Test ANOVASimilar group sizesTukey’s ANOVADissimilar group sizesScheffe’s EZA

Practical Significance (not inferential) Test Pooled Standard DeviationCohen’s d Ratio of variancesEta Squared EZA

Testing Group Differences (Things We Haven’t Done) IndependentDependentTest 2 (2 or more groups) 1 measure Factorial (2-way ANOVA) Shows interaction between groups 2 (2 or more groups) 1 measure 2 or more times ANCOVA (Analysis of Co-Variance) Allows for control of instance of dependent measure 2 or more variables 2 or more measures MANOVA (Multiple Analysis of Variance) 2 or more variables 2 or more measures 2 or more times MANCOVA (Multiple Analysis of Co-Variance) Allows for control of instance of dependent measure. Non-Parametric 2 (2 or more groups) Non-intervalKruskal-Wallis (ranked sum) EZA?

Correlational Statistics Assumptions – The relationship among the measures of two characteristics is linear. – Compared measures come from individuals in the same population – Correlations are not causal

How Do Variables Relate? ComparisonTest Association of 2 or more interval measuresPearson’s r Association of 2 measures at least one of which is not interval (ranked comparisons) Spearman’s ρ (rho) Measure of internal consistencyCronbach’s alpha Prediction based on association of 2 measuresLinear Regression Things We Haven’t Done Association of 3 or more measures at least one of which is not interval (ranked comparisons) Kendall’s τ (tau) Prediction based on 3 or more associationsMultiple Regression Finding relationships among items in a set of items (data reduction) Factor Analysis EZA EZA? EZA