UWHC Scholarly Forum April 17, 2013 Ismor Fischer, Ph.D. UW Dept of Statistics, UW Dept of Biostatistics and Medical Informatics

Slides:



Advertisements
Similar presentations
Is it statistically significant?
Advertisements

© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Chapter 10: Hypothesis Testing
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Hypothesis Testing After 2 hours of frustration trying to fill out an IRS form, you are skeptical about the IRS claim that the form takes 15 minutes on.
Final Jeopardy $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 LosingConfidenceLosingConfidenceTesting.
Business Statistics for Managerial Decision
Topic 2: Statistical Concepts and Market Returns
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
Click on image for full.pdf article Links in article to access datasets.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Inference about Population Parameters: Hypothesis Testing
7.1 Lecture 10/29.
Power and Sample Size IF IF the null hypothesis H 0 : μ = μ 0 is true, then we should expect a random sample mean to lie in its “acceptance region” with.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview Definition Hypothesis
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
LECTURE 21 THURS, 23 April STA 291 Spring
Lecture 20 Dustin Lueker.  The p-value for testing H 1 : µ≠100 is p=.001. This indicates that… 1.There is strong evidence that μ=100 2.There is strong.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Essential Statistics Chapter 141 Thinking about Inference.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
BPS - 5th Ed. Chapter 151 Thinking about Inference.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Introduction to Basic Statistical Methods Part 1: “Statistics in a Nutshell” UWHC Scholarly Forum March 19, 2014 Ismor Fischer, Ph.D. UW Dept of Statistics.
Testing a Single Mean Module 16. Tests of Significance Confidence intervals are used to estimate a population parameter. Tests of Significance or Hypothesis.
1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.
Introduction to Basic Statistical Methods Part 1: Statistics in a Nutshell UWHC Scholarly Forum May 21, 2014 Ismor Fischer, Ph.D. UW Dept of Statistics.
Lecture 19 Dustin Lueker.  The p-value for testing H 1 : µ≠100 is p=.001. This indicates that… 1.There is strong evidence that μ=100 2.There is strong.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Introduction to Statistics for Engineers
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
CHAPTER 6 Statistical Inference & Hypothesis Testing
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Week 10 Chapter 16. Confidence Intervals for Proportions
Chapter 9 Hypothesis Testing.
Sections 6-4 & 7-5 Estimation and Inferences Variation
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Statistical Inference for the Mean: t-test
Presentation transcript:

UWHC Scholarly Forum April 17, 2013 Ismor Fischer, Ph.D. UW Dept of Statistics, UW Dept of Biostatistics and Medical Informatics

UWHC Scholarly Forum April 17, 2013 Ismor Fischer, Ph.D. UW Dept of Statistics, UW Dept of Biostatistics and Medical Informatics All slides posted at

Click on image for full.pdf article Links in article to access datasets

POPULATION Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. “Statistical Inference”

~ The Normal Distribution ~  symmetric about its mean  unimodal (i.e., one peak), with left and right “tails”  models many (but not all) naturally-occurring systems  useful mathematical properties… “population mean” “population standard deviation” 

~ The Normal Distribution ~ “population mean” “population standard deviation”  symmetric about its mean  unimodal (i.e., one peak), with left and right “tails”  models many (but not all) naturally-occurring systems   useful mathematical properties…

~ The Normal Distribution ~ “population standard deviation”  symmetric about its mean  unimodal (i.e., one peak), with left and right “tails”  models many (but not all) naturally-occurring systems Approximately 95% of the population values are contained between  – 2 σ and  + 2 σ. 95% is called the confidence level. 5% is called the significance level. 95% 2.5% ≈ 2 σ “population mean”   useful mathematical properties…

POPULATION “Null Hypothesis” via… “Hypothesis Testing” Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? H 0 : pop mean age  = 25.4 (i.e., no change since 2010) “Statistical Inference” Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population.  cannot be found with 100% certainty, but can be estimated with high confidence (e.g., 95%).

Is the difference STATISTICALLY SIGNIFICANT, at the 5% level? POPULATION “Null Hypothesis” via… “Hypothesis Testing” Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? FORM ULA x1x1 x4x4 x3x3 x2x2 x5x5 x 400 … etc… H 0 : pop mean age  = 25.4 (i.e., no change since 2010) sample mean age Do the data tend to support or refute the null hypothesis? “Statistical Inference” T-test Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population.

Samples, size n ~ The Normal Distribution ~ … etc… CENTRAL LIMIT THEOREM

Approximately 95% of the sample mean values are contained between and 95% 2.5% ≈ 2 σ ~ The Normal Distribution ~ Approximately 95% of the population values are contained between  – 2 σ and  + 2 σ. Approximately 95% of the intervals from to contain , and approx 5% do not.

Approximately 95% of the intervals from to contain , and approx 5% do not. 95% margin of error

PROBLEM! σ is unknown the vast majority of the time!PROBLEM! Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. “Null Hypothesis” via… “Hypothesis Testing” H 0 : pop mean age  = 25.4 (i.e., no change since 2010) sample mean = 25.6 “Statistical Inference” POPULATION Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? FORM ULA SAMPLE n = 400 ages x3x3 x2x2 x5x5 x 400 … etc… x1x1 x4x4 95% margin of error Approximately 95% of the intervals from to contain , and approx 5% do not.

Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. POPULATION “Null Hypothesis” via… “Hypothesis Testing” Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? FORM ULA SAMPLE n = 400 ages H 0 : pop mean age  = 25.4 (i.e., no change since 2010) sample mean = 25.6 “Statistical Inference” x3x3 x2x2 x5x5 x 400 … etc… x1x1 x4x4 sample standard deviation sample variance 95% margin of error = modified average of the squared deviations from the mean

Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. POPULATION “Null Hypothesis” via… “Hypothesis Testing” Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? FORM ULA SAMPLE n = 400 ages sample mean = 25.6 “Statistical Inference” x3x3 x2x2 x5x5 x 400 … etc… x1x1 x4x4 sample variance = 1.6 sample standard deviation = % margin of error H 0 : pop mean age  = 25.4 (i.e., no change since 2010)

Approximately 95% of the intervals from to contain , and approx 5% do not.

BASED ON OUR SAMPLE DATA, the true value of μ today is between and years, with 95% “confidence” (…akin to “probability”). = % margin of error = 0.16

BASED ON OUR SAMPLE DATA, the true value of μ today is between and years, with 95% “confidence” (…akin to “probability”). 95% CONFIDENCE INTERVAL FOR µ “P-VALUE” of our sample Very informally, the p-value of a sample is the probability (hence a number between 0 and 1) that it “agrees” with the null hypothesis. Hence a very small p-value indicates strong evidence against the null hypothesis. The smaller the p-value, the stronger the evidence, and the more “statistically significant” the finding. Two main ways to conduct a formal hypothesis test:

Very informally, the p-value of a sample is the probability (hence a number between 0 and 1) that it “agrees” with the null hypothesis. Hence a very small p-value indicates strong evidence against the null hypothesis. The smaller the p-value, the stronger the evidence, and the more “statistically significant” the finding BASED ON OUR SAMPLE DATA, the true value of μ today is between and years, with 95% “confidence” (…akin to “probability”). 95% CONFIDENCE INTERVAL FOR µ Two main ways to conduct a formal hypothesis test: “P-VALUE” of our sample IF H 0 is true, then we would expect a random sample mean that is at least 0.2 years away from  = 25.4 (as ours was), to occur with probability 1.24%. FORMAL CONCLUSIONS:  The 95% confidence interval corresponding to our sample mean does not contain the “null value” of the population mean, μ = 25.4 years.  The p-value of our sample,.0124, is less than the predetermined α =.05 significance level. Based on our sample data, we may (moderately) reject the null hypothesis H 0 : μ = 25.4 in favor of the two-sided alternative hypothesis H A : μ ≠ 25.4, at the α =.05 significance level. INTERPRETATION: According to the results of this study, there exists a statistically significant difference between the mean ages at first birth in 2010 (25.4 years old) and today, at the 5% significance level. Moreover, the evidence from the sample data would suggest that the population mean age today is significantly older than in 2010, rather than significantly younger. FORMAL CONCLUSIONS:  The 95% confidence interval corresponding to our sample mean does not contain the “null value” of the population mean, μ = 25.4 years.  The p-value of our sample,.0124, is less than the predetermined α =.05 significance level. Based on our sample data, we may (moderately) reject the null hypothesis H 0 : μ = 25.4 in favor of the two-sided alternative hypothesis H A : μ ≠ 25.4, at the α =.05 significance level. INTERPRETATION: According to the results of this study, there exists a statistically significant difference between the mean ages at first birth in 2010 (25.4 years old) and today, at the 5% significance level. Moreover, the evidence from the sample data would suggest that the population mean age today is significantly older than in 2010, rather than significantly younger.

POPULATION “Null Hypothesis” via… “Hypothesis Testing” Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? FORM ULA x1x1 x4x4 x3x3 x2x2 x5x5 x 400 … etc… H 0 : pop mean age  = 25.4 (i.e., no change since 2010) sample mean age Do the data tend to support or refute the null hypothesis? Is the difference STATISTICALLY SIGNIFICANT, at the 5% level? “Statistical Inference” Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. T-test Two loose ends

POPULATION “Null Hypothesis” via… “Hypothesis Testing” Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? H 0 : pop mean age  = 25.4 (i.e., no change since 2010) “Statistical Inference” Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. T-test The reasonableness of the normality assumption is empirically verifiable, and in fact formally testable from the sample data. If violated (e.g., skewed) or inconclusive (e.g., small sample size), then “distribution-free” nonparametric tests can be used instead of the T-test. Examples: Sign Test, Wilcoxon Signed Rank Test (= Mann-Whitney Test) Two loose ends Check?

POPULATION “Null Hypothesis” via… “Hypothesis Testing” Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? x1x1 x4x4 x3x3 x2x2 x5x5 x 400 … etc… H 0 : pop mean age  = 25.4 (i.e., no change since 2010) “Statistical Inference” Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. T-test Two loose ends HOWEVER Sample size n partially depends on the power of the test, i.e., the desired probability of correctly rejecting a false null hypothesis. HOWEVER……

Approximately 95% of the intervals from to contain , and approx 5% do not. Samples, size n ~ The Normal Distribution ~ Approximately 95% of the population values are contained between  – 2 σ and  + 2 σ. … etc… “population standard deviation” 95% 2.5% ≈ 2 σ Approximately 95% of the sample mean values are contained between and “population mean”

Approximately 95% of the intervals from to contain , and approx 5% do not. Samples, size n ~ The Normal Distribution ~ Approximately 95% of the population values are contained between  – 2 s and  + 2 s. … etc… “population standard deviation” 95% 2.5% ≈ 2 σ Approximately 95% of the sample mean values are contained between and “population mean” …IF n is large,  30 traditionally. But if n is small… … this “T-score" increases (from ≈ 2 to a max of for a 95% confidence level) as n decreases  larger margin of error  less power to reject.

If n is large, T-score ≈ 2. If n is small, T-score > 2.