Breaking Statistical Rules: How bad is it really? Presented by Sio F. Kong Joint work with: Janet Locke, Samson Amede Advisor: Dr. C. K. Chauhan.

Slides:



Advertisements
Similar presentations
Hypothesis testing Another judgment method of sampling data.
Advertisements

Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
Probability & Statistical Inference Lecture 7 MSc in Computing (Data Analytics)
Chapter Seventeen HYPOTHESIS TESTING
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
IEEM 3201 One and Two-Sample Tests of Hypotheses.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Hypothesis Testing for the Mean and Variance of a Population Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College.
Chapter 9 Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
The t Tests Independent Samples.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Statistical hypothesis testing – Inferential statistics I.
Testing Hypotheses.
Hypothesis Testing:.
Chapter 7 Using sample statistics to Test Hypotheses about population parameters Pages
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Chapter 8 Introduction to Hypothesis Testing
Means Tests Hypothesis Testing Assumptions Testing (Normality)
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Dan Piett STAT West Virginia University
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Topic 7 - Hypothesis tests based on a single sample Sampling distribution of the sample mean - pages Basics of hypothesis testing -
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Statistical Significance The power of ALPHA. “ Significant ” in the statistical sense does not mean “ important. ” It means simply “ not likely to happen.
Unit 8 Section 8-3 – Day : P-Value Method for Hypothesis Testing  Instead of giving an α value, some statistical situations might alternatively.
Introduction to Statistical Inference Jianan Hui 10/22/2014.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Introduction to Hypothesis Testing: the z test. Testing a hypothesis about SAT Scores (p210) Standard error of the mean Normal curve Finding Boundaries.
Math 4030 – 9a Introduction to Hypothesis Testing
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
T tests comparing two means t tests comparing two means.
1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Ex St 801 Statistical Methods Part 2 Inference about a Single Population Mean (HYP)
More on Inference.
Chapter 9 Hypothesis Testing.
Tests of Significance The reasoning of significance tests
Math 4030 – 9b Introduction to Hypothesis Testing
Hypothesis Testing: Hypotheses
CONCEPTS OF HYPOTHESIS TESTING
Chapter 9 Hypothesis Testing.
More on Inference.
P-value Approach for Test Conclusion
Statistical inference
Chapter 9: Hypothesis Tests Based on a Single Sample
Hypothesis Tests for Proportions
Hypothesis tests for the difference between two proportions
Hypothesis Testing.
Power Section 9.7.
Tests of Significance Section 10.2.
Section 11.1: Significance Tests: Basics
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
STA 291 Spring 2008 Lecture 21 Dustin Lueker.
Presentation transcript:

Breaking Statistical Rules: How bad is it really? Presented by Sio F. Kong Joint work with: Janet Locke, Samson Amede Advisor: Dr. C. K. Chauhan

Background Make inference about populations based on information from random samples. The process is called Hypothesis Testing. Being Used in many areas such as Biology, Psychology, Business, etc.

Examples Mean heart rates: – white newborns vs. African American newborns. Mean daily intake of saturated fat: – Among a vegetarian population vs. 15 grams. Mean SAT score: – In a particular county vs. the national average.

Notations Population means: μ 1, μ 2 – (unknown most of the time) Sample means: Population standard deviations: σ 1, σ 2 – (unknown most of the time) Sample standard deviation: S 1, S 2 Pool standard deviation: S p Sample size: n 1, n 2

2-Samples Hypotheses Testing Example: Null Hypothesis:H 0 μ 1 -µ 2 = 0 (two means are equal) Research Hypothesis:H 1 μ 1 -µ 2 ≠ 0 (two means are not equal) is significantly away from reject Null Hypothesis That is, two means are NOT equal. The corresponding function has a t-distribution.

Important This test statistics has a t-distribution under certain conditions: – Samples are drawn randomly. – If samples are small, populations need to be normally distributed. – The two populations have equal variances, σ 1 = σ 2.

Objective To investigate the effect of the violation of equal variances on the testing procedure. Our textbook suggests that the effect of the violation is minimum when sample sizes are equal.

Measurement for a GOOD test Two types of errors: – Type 1 error – rejecting the true null hypothesis – Type 2 error – failing to reject a false hypothesis –  = the probabilities of type 1 error is selected in advance, usually 5%. – Power = 1- Pr( type 2 errors ) can be calculated under various alternatives. A test is good if the power is high under various alternatives while  stays the same level as selected.

In this research… 1000 tests are generated by simulations in each situation Simulation studies are done to calculate: – α: Probability of rejecting the true hypothesis – Power: Probability of rejecting the false hypothesis Based on various alternatives when equal variances assumption is violated.

Effect when σ 1 ≠ σ 2 : Pop1Pop2Pop1Pop2 Mean µ 1 = 10µ 2 = 10µ 1 = 10µ 2 = 14 Sample Size n 1 =10n 2 =10n 1 =10n 2 =10 αpower  1 =2,  2 =3 4.4%89.8%  1 =2,  2 =4 5.4%75.0%  1 =2,  2 =5 6.0%60.1%  1 =2,  2 =10 8.0%24.2%

Reject if

Condition not violated: σ 1 = σ 2 In this example: σ 1 = σ 2 = 2 n 1 = n 2 =10 n 1 ≠ n 2 α powern 1, n 2 α power 5.2% 98.5%12, 8 5.2% 98.9% 13, 7 5.0% 98.6% 14, 6 5.0% 97.7% Conclusion: When σ 1 = σ 2, it does not matter if n 1 = n 2 since it is not a requirement. Condition violated: σ 1 ≠ σ 2 In this example: σ 1 = 2 and σ 2 = 5 n 1 = n 2 = 10 n 1 ≠ n 2 α power n 1, n 2 α power 6.5% 60.1%12, 8 9.6% 65.2% 13, % 66.6% 14, % 64.2% Conclusion: When σ 1 ≠ σ 2, if n 1 ≠ n 2, effect on alpha is even more significant. Result

Conclusion If the difference between σ 1 and σ 2 get larger, α goes up and power goes up. Other interesting observations: – If smaller sample has larger standard deviation, α goes up. – If larger sample has larger standard deviation, α goes down.

Note This conclusion is only based on what this simulation study has shown. By selecting different parameters and choosing different alternatives, the result may be different.

Thank You!