In SAS March 11, 2013 - Mariya Cheryomina.  power (π) + β = 1  β = (1- π) = probability of accepting false Ho (ie. reject true Ha) - probability of.

Slides:



Advertisements
Similar presentations
Research Methodology Statistics Maha Omair Teaching Assistant Department of Statistics, College of science King Saud University.
Advertisements

Inferential Statistics
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Inferential Statistics & Hypothesis Testing
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Sample size computations Petter Mostad
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
Understanding Research Results. Effect Size Effect Size – strength of relationship & magnitude of effect Effect size r = √ (t2/(t2+df))
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Descriptive Statistics
Inferential Statistics
Introduction to Hypothesis Testing
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 7 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Psy B07 Chapter 8Slide 1 POWER. Psy B07 Chapter 8Slide 2 Chapter 4 flashback  Type I error is the probability of rejecting the null hypothesis when it.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Statistical Power The ability to find a difference when one really exists.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Chapter 9: Testing Hypotheses
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
CONFIDENCE INTERVAL It is the interval or range of values which most likely encompasses the true population value. It is the extent that a particular.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Test for Significant Differences T- Tests. T- Test T-test – is a statistical test that compares two data sets, and determines if there is a significant.
STA Lecture 251 STA 291 Lecture 25 Testing the hypothesis about Population Mean Inference about a Population Mean, or compare two population means.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
The diastolic blood pressure for American women aged has approx
General Linear Model 2 Intro to ANOVA.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Sample Size Determination Text, Section 3-7, pg. 101 FAQ in designed experiments (what’s the number of replicates to run?) Answer depends on lots of things;
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
STAT 3120 Statistical Methods I Statistical Power.
PCB 3043L - General Ecology Data Analysis.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
AP Statistics Chapter 21 Notes
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.
Hypothesis test flow chart
Chapter 13 Understanding research results: statistical inference.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
C HAPTER 2  Hypothesis Testing -Test for one means - Test for two means -Test for one and two proportions.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
More on Inference.
Hypothesis Testing: One Sample Cases
PCB 3043L - General Ecology Data Analysis.
Hypothesis Testing: Hypotheses
Statistical Process Control
More on Inference.
Introduction to ANOVA.
Statistical Inference for Managers
Section 11.1: Significance Tests: Basics
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

In SAS March 11, Mariya Cheryomina

 power (π) + β = 1  β = (1- π) = probability of accepting false Ho (ie. reject true Ha) - probability of Type II error – false positive  Power (π) = (1- β) = probability of detecting a difference when a difference does exist - probability of accepting true Ha (ie. reject false Ho) – how sensitive your test is to the existing difference between the compared samples

 Generally, the minimal sufficient (acceptable) value of power is 0.80  π ≥ 0.80

 1) Before gathering data  To determine the minimal sample size needed to have desired power in statistical testing (to detect a particular effect size)  2) After gathering data  To determine the magnitude of power that your statistical test will have given the sample parameters (n and s) and the magnitude of the effect that you want to detect

 Sample size (n)  Standard deviation (s)  Alpha level (α )  Size of effect/difference that you want to detect  Type of statistical test performed

 One-sample t-test  One-way Anova

proc power; onesamplemeans (type of statistical test you want the power to be calculated for) mean = ____ (difference you are interested in detecting) ntotal = ____ (sample size) stddev = ____ power = ____ ; run; *One of the four variables must be left blank – this is what you want SAS to calculate

You are asked to determine whether the use of a new soil type leads to a significantly different average height of young pine trees on plantation (compared to the historical/hypotherical mean height of 110cm recorded with the old soil type). You want to conduct a one-sample t test with a 2-sided α = You select150 trees (n). You decide that the minimal difference in height worth addressing is 8cm (effect size) α = 0.05 s = 40 n = 150 “mean” = 8 (effect size) What will the power of your statistical test be? Ho:  = 110cm Ha:  ≤ 102cm OR  ≥ 118cm

SAS text: proc power; onesamplemeans mean = 8 ntotal = 150 stddev = 40 power =.; run; *Unless you indicate otherwise, SAS will automatically assume that α = 0.05

There is a there is a probability of 0.68 that the t test will produce a significant result indicating a difference in mean tree height of at least 8cm

 Sample size  Standard deviation

You are interested in finding out how the changes in samples size (n), standard deviation (s) and minimal effect size of interest (“mean”) will effect the power of your one-sample t- test: proc power; onesamplemeans mean = 5 10 ntotal = 150 stddev = power =.; plot x=n min=100 max=200; run;

Effect size Increasing sample size does not infinitely increase power

 Example 2: You want to compare the average diversity of Canada’s native bee species in four types of habitat: 1) Urban 2) Agricultural : Monoculture plantations (with pesticide use) 3) Agricultural : Organic farms (no pesticide use) 4) National parks Ho: μ 1 = μ 2 = μ 3 = μ 4 Ha: one of μ is different

 What is the minimal number of sites of each habitat type that needs to be surveyed (ie. minimal sample size of each group) to detect whether a significant difference in bee species diversity exists between any of the four habitat types? (using a one-way Anova test with α = 0.05)  The desired power of 0.9.

*Analysis of power usually involves a number of simplifying assumptions Assumptions: Average number of bee species surveyed per site in Canada is ~35 species with a standard deviation of ~ 10 species Based on available research, you predict the following average bee species diversity for each habitat type: 1) Urban - 25 species 2) Monoculture plantations - 30 species 3) Organic farms– 40 species 4) National parks – 45 species *Assume that all groups have the same stdev (s) All numbers used in example were invented for the purpose of the exercise

proc power ; onewayanova groupmeans = 25| 30 | 40 | 45 stddev = 10 alpha = 0.05 npergroup =. power =.9; run;

SAS finds the group sample size that gives a power (actual power) closest to the power you desire (ie. to the nominal power)

 You will need to survey at least 7 locations with each of the four habitat types (ie. 7 urban sites, etc.) to detect the desired (significant) difference between the mean diversities of bee species found at each of the habitat types

Now you are trying to detect a smaller difference between sample means (at a significant level) Observe the new minimal sample size for each group

 Sample size (↑) X approaches  when n approaches N  Standard deviation (↓) Difference between samples is less likely to cur simply due to random sampling effects  (α ) (↑) Higher α leads to lower β which results in higher power (ie. the more willing you are to reject Ho, the less likely you are to accept false Ho, which leads to a higher probability of detecting a truly existing significant difference)  Minimal effect size (↑) (difference that you want to detect) A large difference between samples is less likely to occur due to random variability between samples than a small difference is

 When you are expecting a large effect size, but are not fully confident that the true effect is as large, use a larger sample size (ie. one that the analysis of power suggests for detecting a smaller effect )

 Analysis of power for Anova (in SAS):  Analysis of power for one-sample and two-sample t-tests (in SAS):