Copyright (c) Bani Mallick1 STAT 651 Lecture # 11.

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Lecture 6 Outline – Thur. Jan. 29
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Confidence Interval and Hypothesis Testing for:
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Copyright (c) Bani Mallick1 Stat 651 Lecture 5. Copyright (c) Bani Mallick2 Topics in Lecture #5 Confidence intervals for a population mean  when the.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #17.
Copyright (c) Bani Mallick1 Lecture 2 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #2 Population and sample parameters More on populations.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture 9.
Copyright (c) Bani Mallick1 STAT 651 Lecture 7. Copyright (c) Bani Mallick2 Topics in Lecture #7 Sample size for fixed power Never, ever, accept a null.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Copyright (c) Bani Mallick1 STAT 651 Lecture 10. Copyright (c) Bani Mallick2 Topics in Lecture #10 Comparing two population means using rank tests Comparing.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #16.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Lecture 9: One Way ANOVA Between Subjects
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #20.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #15.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #13.
Chapter 2 Simple Comparative Experiments
Chapter 11: Inference for Distributions
Inferences About Process Quality
Copyright (c) Bani K. Mallick1 STAT 651 Lecture # 12.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Copyright (c) Bani K. mallick1 STAT 651 Lecture #14.
5-3 Inference on the Means of Two Populations, Variances Unknown
Copyright (c)Bani K. Mallick1 STAT 651 Lecture #21.
Power and Sample Size IF IF the null hypothesis H 0 : μ = μ 0 is true, then we should expect a random sample mean to lie in its “acceptance region” with.
Chapter 12: Analysis of Variance
Chapter 24: Comparing Means.
Hypothesis testing – mean differences between populations
Confidence Intervals and Hypothesis Testing
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Lesson Comparing Two Means.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
More About Significance Tests
1 Design of Engineering Experiments Part 2 – Basic Statistical Concepts Simple comparative experiments –The hypothesis testing framework –The two-sample.
Comparing Two Population Means
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Business Statistics for Managerial Decision Comparing two Population Means.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Confidence intervals and hypothesis testing Petter Mostad
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Chapter 10 The t Test for Two Independent Samples
Copyright (c) Bani K. Mallick1 STAT 651 Lecture 6.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Handout Seven: Independent-Samples t Test Instructor: Dr. Amery Wu
 Assumptions are an essential part of statistics and the process of building and testing models.  There are many different assumptions across the range.
Copyright (c) Bani Mallick1 STAT 651 Lecture 8. Copyright (c) Bani Mallick2 Topics in Lecture #8 Sign test for paired comparisons Wilcoxon signed rank.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 24 Comparing Means.
Two-Sample-Means-1 Two Independent Populations (Chapter 6) Develop a confidence interval for the difference in means between two independent normal populations.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 10: The t Test For Two Independent Samples.
Two-Sample Hypothesis Testing
CHAPTER 29: Multiple Regression*
Two Independent Samples
Presentation transcript:

Copyright (c) Bani Mallick1 STAT 651 Lecture # 11

Copyright (c) Bani Mallick2 Topics in Lecture #11 Comparing two population means Review to date

Copyright (c) Bani Mallick3 Book Sections Covered in Lecture #11 All previous chapters

Copyright (c) Bani Mallick4 Lecture 10 Review: Robust Inference via Rank Tests Because sample means and sd’s are sensitive to outliers, so too are comparisons of populations based on them Rank tests form a robust alternative, that can be used to check the results of t-statistic inferences You are looking for major discrepancies, and then trying to explain them

Copyright (c) Bani Mallick5 Lecture 10 Review: Robust Inference via Rank Tests Typically called the Wilcoxon rank sum test The algorithm is to assign ranks to each observation in the pooled data set Then apply a t-test to these ranks Robust because ranks can never get wild

Copyright (c) Bani Mallick6 Lecture 10 Review: Robust Inference via Rank Tests The rank tests give the same answer no matter whether you take the raw data, their logarithms or their square roots. If you have data (raw or transformed) that pass q-q plots tests, then Wilcoxon and t-test should have much the same p-values In this case, you can use the latter to get CI’s

Copyright (c) Bani Mallick7 Lecture 10 Review: Inference for Equality of Variances SPSS uses what is called Levene’s test From the SPSS Help file (slightly edited) Levene Test For each case, it computes the absolute difference between the value of that case and its cell mean and performs a t-test on those absolute differences.

Copyright (c) Bani Mallick8 Inference When the Variances Are Not Equal Generally, you should try to find a scale (log, square root) for which the variances are approximately equal. If you cannot, then a small change is needed in the confidence intervals and test statistics.

Copyright (c) Bani Mallick9 The Confidence Interval With Unequal Variances The CI for is given by

Copyright (c) Bani Mallick10 The t-test Statistic With Unequal Variances The t-statistic is defined by

Copyright (c) Bani Mallick11 The Aortic Stenosis Data: A Review Let’s remember that the aortic stenosis data is comparing healthy with sick kids Aortic Valve Area (AVA) and Body Surface Area (BSA) are noninvasive measures The idea is to see if we can use these measures to understand whether a kid is stenotic, without having to do an invasive test Or at least to be able to narrow down those who need an invasive test

Copyright (c) Bani Mallick12 The Aortic Stenosis Data: A Review We will study the variable log(1 + AVA/BSA) I’ll try to review much of what we have done to this point by using these data There was a kids with an enormous AVA: I’ll delete this kid from all analyses First comes the relative frequency histogram

Copyright (c) Bani Mallick13 The Aortic Stenosis Data: A Review First comes the relative frequency histogram The height of each bar is the % of the sample which lies in the interval for that bar

Copyright (c) Bani Mallick14 The Aortic Stenosis Data: A Review: what % have X > 1? Log(1 + (AVA to BSA Ratio)) 0% 5% 10% 15% 20% Percent Healthy Stenoti Log(1 + (AVA to BSA Ratio))

Copyright (c) Bani Mallick15 The Aortic Stenosis Data: A Review Next comes the boxplot to Compare medians Compare variability Screen for massive outliers

Copyright (c) Bani Mallick16 The Aortic Stenosis Data: A Review: read off median, IQR, outliers

Copyright (c) Bani Mallick17 The Aortic Stenosis Data: A Review Next comes the q-q plot to check for approximate bell-shape Why does it matter that the data are approximately bell-shaped? First: statistical inferences such as confidence intervals are most accurate and most powerful Second: Probability calculations are most accurate

Copyright (c) Bani Mallick18 The Aortic Stenosis Data: A Review For the healthy kids: what do you think?

Copyright (c) Bani Mallick19 The Aortic Stenosis Data: A Review For the stenotic kids: what do you think?

Copyright (c) Bani Mallick20 The Aortic Stenosis Data: A Review Next we do some simple summary statistics Healthy: n = 70, mean = 0.84, median = 0.83, sd = 0.22, iqr = 0.31, se (of mean) = Stenotic: n = 55, mean = 0.47, median = 0.46, sd = 0.17, iqr = 0.22, se (of mean) = What does the IQR mean?

Copyright (c) Bani Mallick21 The Aortic Stenosis Data: A Review Healthy: n = 70, mean = 0.84, median = 0.83, sd = 0.22, iqr = 0.31, se (of mean) = Degrees of freedom = 70 – 1 = 69 95% CI for population mean (from SPSS) = 0.79 to 0.89 Interpret this!

Copyright (c) Bani Mallick22 The Aortic Stenosis Data: A Review Healthy: n = 70, mean = 0.84, median = 0.83, sd = 0.22, iqr = 0.31, se (of mean) = Estimate the probability that a healthy child has a log(1+ASA/BSA) less that 0.46 Pr(X < 0.46): z-score = (0.46 – 0.84)/0.22 = Table 1: Pr(X < 0.46) ~ Interpret this!

Copyright (c) Bani Mallick23 The Aortic Stenosis Data: A Review Note: for computing confidence intervals for the population mean I use the standard error In computing probabilities about the population I use the standard deviation Which one is affected by the sample size?

Copyright (c) Bani Mallick24 The Aortic Stenosis Data: A Review Next I will compare the population means We have concluded that the data are fairly (although not exactly) normally distributed We have also concluded that the variabilities are not too awfully different, although the stenotic kids appear to be less variable. So, we will try out the t-test type inference

Copyright (c) Bani Mallick25 The Aortic Stenosis Data: A Review SPSS computes a confidence interval for the difference in the population means We conclude the population means are different if the confidence interval does not include zero

Copyright (c) Bani Mallick26 The Aortic Stenosis Data: A Review SPSS

Copyright (c) Bani Mallick27 The Aortic Stenosis Data: A Review SPSS

Copyright (c) Bani Mallick28 The Aortic Stenosis Data: A Review The 95% confidence interval is from 0.30 to 0.45 Interpret this!

Copyright (c) Bani Mallick29 The Aortic Stenosis Data: A Review The p-value is Interpret this! What would have happened if I had done a 99% CI?

Copyright (c) Bani Mallick30 The Aortic Stenosis Data: A Review The p-value is Interpret this! What would have happened if I had done a 99% CI? It would not have covered zero, and I would have rejected the null hypothesis that the two population means are the same.

Copyright (c) Bani Mallick31 The Aortic Stenosis Data: A Review Outliers can really mess up an analysis We use the Wilcoxon rank sum test for this There were no massive outliers, the data were nearly normal, so we expect to get just about the same p-value It is 0.000, just as for the CI analysis

Copyright (c) Bani Mallick32 The Aortic Stenosis Data: A Review Next we will use Levene’s test to compare variability SPSS has its version of Levene’s test, which is a t-test on the absolute differences with the sample mean: the p-value for this is 0.07 Weak evidence of differences in variability

Copyright (c) Bani Mallick33 ANalysis Of VAriance We now turn to making inferences when there are 3 or more populations This is classically called ANOVA It is somewhat more formula dense than what we have been used to. Tests for normality are also somewhat more complex

Copyright (c) Bani Mallick34 ANOVA The Analysis of Variance is often known as ANOVA We are going to consider its simplest form, namely comparing 3 or more populations. If there are two populations, we have covered this in the first part of the course Confidence intervals for the difference in two population means Wilcoxon rank sum test.

Copyright (c) Bani Mallick35 ANOVA Suppose we form three populations on the basis of body mass index (BMI): BMI 28 This forms 3 populations We want to know whether the three populations have the same mean caloric intake, or if their food composition differs.

Copyright (c) Bani Mallick36 ANOVA Concho water snake data for males have four separate subpopulations, depending on the year class. Are there differences in snout-to-vent lengths among these populations?

Copyright (c) Bani Mallick37 ANOVA As in all analyses, we will combine graphical analyses, summary statistics and formal hypothesis tests and confidence intervals to for a picture of what the data are telling us

Copyright (c) Bani Mallick38 ANOVA There is considerable controversy as to how to compare 3 or more populations. One possibility is to compare them 2 at a time In the body mass example, there are 3 such comparisons Low BMI versus Middle BMI Low BMI versis High BMI Middle BMI versus High BMI

Copyright (c) Bani Mallick39 ANOVA In the Concho water snake example there are 6 such comparisons. 1 year olds versus 2 year olds 1 year olds versus 3 year olds 1 year olds versus 4 year olds 2 year olds versus 3 year olds 2 year olds versus 4 year olds 3 year olds versus 4 year olds

Copyright (c) Bani Mallick40 ANOVA In general, if there are t populations, there are t(t-1)/2 two-sample comparisons. Special Note: t is a symbol used in the book to denote the number of populations

Copyright (c) Bani Mallick41 ANOVA In general, if there are t populations, there are t(t-1)/2 two-sample comparisons. The controversy revolves around the concept of Type I error Specifically, if we do 6 different 95% confidence intervals, what is the probability that one or more of them do not include the true mean? Certainly, it is higher than 5%!

Copyright (c) Bani Mallick42 ANOVA If you do lots of 95% confidence intervals, you’d expect by chance that about 5% will be wrong Thus, if you do 20 confidence intervals, you expect 1 = 20 x 5% will not include the true population parameter This is a fact of life

Copyright (c) Bani Mallick43 ANOVA If you do lots of 95% confidence intervals, you’d expect by chance that about will be wrong One school of thought is to stick to a few major hypotheses (2-4 say), do 95% CI on them, and label anything else as exploratory, with possibly inflated Type I errors I am of this school

Copyright (c) Bani Mallick44 ANOVA If you do lots of 95% confidence intervals, you’d expect by chance that about 5% will be wrong Another school thinks that you should control the chance of making any errors at all to be 5% I am NOT of this school. Why not: am I just crazy?

Copyright (c) Bani Mallick45 ANOVA Another school thinks that you should control the chance of making any errors at all to be 5% I am NOT of this school. I worry about Type II error (power). To be 95% confident that every single one of my conclusions is correct, I will not have much power to detect meaningful changes or differences.