Problems with Variance ©2005 Dr. B. C. Paul. Determining What To Do We have looked at techniques that depend on normally distributed data with variance.

Slides:



Advertisements
Similar presentations
ANALYSIS OF VARIANCE (ONE WAY)
Advertisements

Statistical Techniques I
Chapter 10 Estimation and Hypothesis Testing II: Independent and Paired Sample T-Test.
Introduction to Regression ©2005 Dr. B. C. Paul. Things Favoring ANOVA Analysis ANOVA tells you whether a factor is controlling a result It requires that.
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Matching level of measurement to statistical procedures
Tuesday, October 22 Interval estimation. Independent samples t-test for the difference between two means. Matched samples t-test.
Correlations and T-tests
Sample Size Determination In the Context of Hypothesis Testing
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
5-3 Inference on the Means of Two Populations, Variances Unknown
The t Tests Independent Samples.
SW388R6 Data Analysis and Computers I Slide 1 One-sample T-test of a Population Mean Confidence Intervals for a Population Mean.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Unequal Variance and ANOVA ©2005 Dr. B. C. Paul. ANOVA Assumptions ANOVA assumes the populations sampled in each class are normally distributed Also assumes.
Statistical Analysis Statistical Analysis
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of Homoscedasticity Homoscedasticity (aka homogeneity or uniformity of variance) Transformations.
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
QNT 531 Advanced Problems in Statistics and Research Methods
Statistical Techniques I EXST7005 Conceptual Intro to ANOVA.
Proportions for the Binomial Distribution ©2005 Dr. B. C. Paul.
One Way ANOVA ©2005 Dr. B. C. Paul modified 2009 Note – The concepts presented in these slides are considered common knowledge to those familiar with statistics.
Two Way ANOVA ©2005 Dr. B. C. Paul. ANOVA Application ANOVA allows us to review data and determine whether a particular effect is changing our results.
Inferential Statistics 2 Maarten Buis January 11, 2006.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Confidence intervals and hypothesis testing Petter Mostad
6/4/2016Slide 1 The one sample t-test compares two values for the population mean of a single variable. The two-sample t-test of population means (aka.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
ANOVA: Analysis of Variance.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Copyright © 2010 Pearson Education, Inc. Slide
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Non-Parametric Statistics ©2005 Dr. B. C. Paul. The Normal Problem Techniques we have used so far relied mostly on underlying distribution to be normal.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Quadratic Regression ©2005 Dr. B. C. Paul. Fitting Second Order Effects Can also use least square error formulation to fit an equation of the form Math.
Inferences Concerning Variances
Analysis of Variance STAT E-150 Statistical Methods.
© The McGraw-Hill Companies, Inc., Chapter 12 Analysis of Variance (ANOVA)
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Dr. C. Ertuna1 Hypothesis Testing 2 Samples (Chapter - 04/D)
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Statistical hypothesis Statistical hypothesis is a method for testing a claim or hypothesis about a parameter in a papulation The statement H 0 is called.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Tests of hypothesis Contents: Tests of significance for small samples
Make Sure You Have Your Dependent Variable and Factor Selected
Chapter 4. Inference about Process Quality
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Math 4030 – 10a Tests for Population Mean(s)
Independent samples t-test for the difference between two means.
Independent samples t-test for the difference between two means.
Hypothesis Tests for a Standard Deviation
Psych 231: Research Methods in Psychology
Chapter 10 Introduction to the Analysis of Variance
Chapter 26 Comparing Counts.
Chapter 10 – Part II Analysis of Variance
Statistical Inference for the Mean: t-test
Presentation transcript:

Problems with Variance ©2005 Dr. B. C. Paul

Determining What To Do We have looked at techniques that depend on normally distributed data with variance limited to some set of values. We have considered how to tell if a distribution is not normally distributed but not yet what to do  Besides hope for the best if the condition is border-line We have done many tests that assume that all samples have the same variance  T tests, ANOVA, to a lesser extent regression  We don’t yet know how to detect if there is a problem or what to do about it.

Variance Problems with T tests T tests are often used to decide if samples taken from two populations have a different average value. Example- Quincy Quality has a factory that makes boat widgets. Widgets that do not meet the quality control tests at the end have to be discarded and add cost to the operation. Quincy wants to know if his night and day shifts have the same rate of widget rejection so he looks at records of number of widgets rejected versus number produced on each assembly line.

Quincy Enters his Sample Data Quincy selects 20 days at random For his day and night shifts and Looks at the number of rejects per Thousand units manufactured. He enters the data into his trusty SPSS program as shown.

Quincy Does a T test to see if the means of the two shifts are equal Quincy goes to Analyze and clicks to get The pull down menu. He highlights compare means to get the Pop out side menu He highlights and clicks independent Samples T test.

He Sets Rejects as His Test Variable

He Needs to Use a Variable to Break His Data Up by Group Since he is concerned about shift he Enters shift as his grouping variable But the program still wants to know How he will use this variable to break Up his groups.

He Tells the Program Shift 1 and Shift 2 After clicking define groups the Define group menu pops up Quincy enters that group 1 has a Shift value of 1 (Day Shift) and Group 2 has a value of 2 (Night Shift) He will then click continue.

Quincy Checks His Options Quincy clicks options to bring up the Options menu. He has an option to determine what Kind of confidence interval he wants. The default is 95% which sounds good For now. Quincy Clicks continue And then OK.

Oh Hear Come the Results! The Day Shift averages 26.1 Rejects per thousand. The Night Shift averages 33. The samples exhibit a difference of 6.95 Rejects per thousand.

Interpreting The T statistic for this difference is Our confidence that one shift is in fact actually better than The other is 98.7%

Quincy’s Assumptions To Draw the conclusion Quincy must believe his rate of rejects is normally distributed and That the day to day (night to night?) differences in the number of rejects is the same for both samples  But is the variance really the same?

Looking Closer The F test for homogeneity of variances – ie the variances are The same for both shifts – is 25.3 The significance of that statistic is essentially 100% - ie there is no Chance in H_ _ _ that the day to day variance of the shifts is the Same.

How did we do that Thank goodness we didn’t have to (the computer did it for us) A sample variance estimated from samples of a normal distribution follows a Qui Squared distribution. To test variances we divide one variance by the other.  Two Qui Squared distributions – one dividing the other has an F distribution. If the F value is too strange we reject the null hypothesis that our variances are equal.

Ok We Reject it – Now What? The validity of our T test depended on homogeneity variance which we just lost  However Quincy suspects that his night shift might not be doing to well and wants to go kick some – well you get it The solution was put together by Brehens and Fisher.  When variance is not equal you get a slightly distorted T distribution  Rather than come up with a separate distribution table the distorted distribution will pretty well match a T distribution if you fudge the degrees of freedom For us the best thing is the computer does this degree of freedom fudging for us and just splits out the result.

So Does Quincy Get to Kick _ _ _ _ Without assuming homogeneity our T statistic stays the Same but the probability of getting the result by chance rises To 1.6%. 98.4% - Quincy is sure his night shift is messing up.