Comparison of Two Conditions for Continuous Measurements Lecture 6.

Slides:



Advertisements
Similar presentations
Section 9.3 Inferences About Two Means (Independent)
Advertisements

Introduction Comparing Two Means
The t-test, the paired t-test, and introduction to non-parametric tests July 8, 2004.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
PSY 307 – Statistics for the Behavioral Sciences
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
BCOR 1020 Business Statistics
Inferences About Process Quality
ESTIMATION AND HYPOTHESIS TESTING: TWO POPULATIONS
5-3 Inference on the Means of Two Populations, Variances Unknown
Hypothesis Testing Using The One-Sample t-Test
Hypothesis Testing: Two Sample Test for Means and Proportions
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 9 Comparing Means
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Hypothesis testing – mean differences between populations
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Statistical Analysis Statistical Analysis
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Lesson Comparing Two Means.
Education 793 Class Notes T-tests 29 October 2003.
More About Significance Tests
Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference.
Comparing Two Population Means
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Comparing Two Proportions
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
CHAPTER 18: Inference about a Population Mean
Chapter 22: Comparing Two Proportions
The Practice of Statistics Third Edition Chapter 13: Comparing Two Population Parameters Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Nonparametric Statistics
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Hypothesis Testing One-sample means and proportions Lecture 4.
to accompany Introduction to Business Statistics
Chapter 10 The t Test for Two Independent Samples
Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
More Contingency Tables & Paired Categorical Data Lecture 8.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 24, Slide 1 Chapter 25 Paired Samples and Blocks.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
ENGR 610 Applied Statistics Fall Week 7 Marshall University CITE Jack Smith.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Lecture 22 Dustin Lueker.  Similar to testing one proportion  Hypotheses are set up like two sample mean test ◦ H 0 :p 1 -p 2 =0  Same as H 0 : p 1.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Statistics 22 Comparing Two Proportions. Comparisons between two percentages are much more common than questions about isolated percentages. And they.
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
Lecture Slides Elementary Statistics Twelfth Edition
Hypothesis Testing Review
Chapter 8: Inference for Proportions
Elementary Statistics
Lesson Comparing Two Means.
Presentation transcript:

Comparison of Two Conditions for Continuous Measurements Lecture 6

Description of Problem Suppose that you are interested in the effect of a binary variable on a continuous variable Diesel exhaust vs. clean air → White Blood Cell count Special instruction vs. a control condition → test scores that measure students’ understanding of abstract ideas about risk

Two Variables of Interest The Predictor is the variable that is being used to “predict” the value of another variable. Also commonly referred to as the independent variable or the explanatory variable. For now, we assume this variable is binary or dichotomous, ie., a variable with two levels. The Response is the outcome variable in whose value we are interested. Also commonly referred to as the dependent variable. For now, we assume this variable is continuous. WARNING! The use of this terminology does not imply actual causation!

Two Study Designs Independent Groups Subjects at each level of the predictor are sampled independently of one another The levels of the predictor may be automatic (such as gender) or subjects may be randomly assigned to a “group” (Type of instruction) Paired Samples

Individual subjects are measured at both levels of the predictor. OR Subjects are paired such that one member of the pair is at one level of the predictor and the other member of the pair is at the other. The levels of the predictor may be automatic (such as before/after an exam or mother/daughter) or may be randomly assigned to each unit of the pair (special instruction/control) Pairs are often put together based on demographic information, such as age, gender, ethnicity, etc. or are natural pairings such as in the case of family linkages.

Hypotheses in Words H 0 : Predictor has no effect or H 0 : versus H A : Predictor has an effect (may specify which level is associated with a larger mean) or H A :

Hypotheses in Statistical Notation H 0 : Predictor has no effect/Group means are equal translates to H 0 : μ a = μ b or equivalently H 0 : μ a - μ b = 0 VERSUS H A : Predictor has an effect/Group means are unequal translates to H A : μ a ≠ μ b OR μ a > μ b OR μ a < μ b or equivalently H A : μ a - μ b ≠ 0 OR μ a - μ b > 0 OR μ a - μ b < 0

Example – Two Independent Groups “Risperidone improves disruptive behavior in children with low IQ. … Dr. Michael Aman from Ohio State University, Columbus, and colleagues randomly assigned 118 children … with the atypical antipsychotic at mean dose of 0.98 mg/kg daily for 6 weeks or placebo. Intention-to-treat analysis found that children receiving risperidone had a significant 47.3% reduction on the Conduct Problem subscale of the NCBRF compared with children receiving placebo (20.9%).”

Indep.Groups – Risperidone Example Let’s start the analysis by comparing the information presented at baseline to demonstrate that the two groups, into which children were randomized, did not have significantly different conduct problem scores at baseline. (See Table 1, Am J Psychiatry 2002;159:1337) To study the differences between groups, we are interested in the quantity μ a - μ b, where μ a is the mean of conduct problem scores for the placebo group and μ b is the mean for the risperidone group. The sample means are both approximately normally distributed and are independent of one another. Normality of the sampling distribution of the sample means follows by the CLT (n a =63, n b =55) Independence of the sample means follows from the fact that the groups contain subjects that are independent of one another.

Example – Paired Sample Description Each member of 11 heterosexual double income couples (in which both members were between the ages of 30 and 40 years) were questioned about the average amount of time that they spent reading the newspaper or watching the news on TV. Do women and men spend different amounts of time catching up on the news via the newspaper and TV?

Paired News Example To study the differences between groups (genders), we are interested in the quantity μ a - μ b, where μ a is the mean time for females and μ b is the mean for males. Are the sample means both approximately normally distributed? Only if the original population distributions are assumed normal. That is the times of all females in double income couples must approximately follow a normal distribution. The same is true for the time of all males. The sample means are approximately normally distributed, but are NOT independent of one another. The times spent catching up on the news by a man and a woman in the same couple would be expected be associated with each other.

Components needed for inference When the sampling distribution of the estimate is approximately normal, Confidence Interval is of the form Estimate ± (Critical value x Std.Error) Test Statistic is of the form ( Estimate - Null value) / Std.Error Hence, for each study design (independent groups or paired), we need to 1). 2). 3)..

Paired Sample #1: Estimate of μ a - μ b The Estimate of μ a - μ b is (Sample mean a – Sample mean b) For paired samples, this is equivalent to the sample mean of the pair differences, i.e., D = Y a – Y b Y a = measurement from member a of each pair Y b = measurement from member b of each pair If there are n pairs in the study, there are total of n observed D’s.

Paired Sample #2 cont.: Standard Error Since we can calculate the estimate of the difference of population means as the average of the differences (the D’s), we can calculate the standard error of the estimate as the standard error of an average. Hence, the standard error is given by s D /sqrt(n pairs ) where

Paired Differences Example – Statistics Group averages are and The difference of these averages is The average of the differences is also Standard Deviation of the 11 differences is The standard error of the average of the differences is 0.202/sqrt(11) = (The standard deviations for each of the groups are and ) Wife Husband Difference

Paired Differences Example – #3 What’s the Sampling Distribution? Estimate and SE are calculated the same as in the one-sample case. Therefore, to determine the sampling distribution, we need to consider the sample size or the population distribution. Here is a histogram of the differences.

#3 Sampling Distribution for Paired Samples If the sample size is large, then the sampling distribution of the averaged differences is approximately normal distribution (CLT). If the sample size is small and the distribution of differences approximately normal, then the sampling distribution is the t-distribution with n pairs -1 degrees of freedom. Confidence Intervals and p-values for Hypothesis tests for paired samples (when the outcome is continuous) are calculated the same way we have been doing these with one-group samples.

Independent Groups Sample #1: Estimate of μ a - μ b Just take the difference of the two averages! Average of Conduct problem scores at Baseline: Placebo Group: 34.5 Risperidone Group: 32.9 Difference: = 1.6

#2: Standard Error for Two Independent Groups FACT: The variance of the difference (or sum) of two independent normal random variables is the sum of the two variances.

#2, cont.: Std.Error, 2 Groups Recall: The Standard Error is the estimate of the standard deviation of the sampling distribution of the sample mean. The true standard error of the difference of the two sample means is How this is estimated in practice depends on whether σ a and σ b are assumed to be equal.

#2 cont.: Special Cases of Std.Error, 2 Groups Easiest case: σ a ≠ σ b The standard deviation is estimated by replacing σ a and σ b by their respective standard deviations in the equation give on the previous slide. Harder case: Assume σ a = σ b = σ The common value of the standard deviation, σ, is estimated by and replaces both σ a and σ b on the previous slide.

Two Group Example – Standard Deviations Are standard deviations for the two groups equal? The paper gives these values to be 6.9 and 7.6 for the placebo and Risperidone children respectively. Just eyeing them I can’t tell if the differences are due to random chance or not. We will try both tests (one for equal variances and one for unequal variances)

Cont. (unequal variances) If we assume unequal variances, then SE = sqrt(variance a /n a + variance b /n b ) = sqrt(6.9 2 / /55) = 1.344

Cont. (equal variances) If we assume equal variances, then first calculate the pooled estimate of the std. dev. And, SE=sqrt(SD pooled 2 /n a + SD pooled 2 /n b ) =sqrt( / /55) =1.335

The estimate of the difference of the means divided by its standard error Follows a normal distribution if the sample size is large enough Follows a t-distribution with n a +n b -2 degrees of freedom if with equal variances, the samples come from normal distributions Follows a mixture of t-distributions if the variances are unequal and the samples come from normal distributions #3: Sampling Distribution of the Estimate – 2 Independent Groups

Background: Sampling Distributions of the Estimates FACT: “Linear combinations” of normally distributed variables are also normally distributed. CONSEQUENCE: If the original measurements follow a normal distribution, then any estimate composed of these measurements that is a sum, a difference, an average, a difference of averages or an average of differences will also follow a normal distribution. Examples: Average for Single Group (if sample size is small and original measurements come from a normal distribution or, by CLT, if sample size is large) Independent Groups: Difference of averages Paired Samples: Average of differences

Choosing a Critical Values for CI’s and HT’s Paired Sample Large Sample: Gaussian Table Small Sample: t-distribution with (n pairs -1) d.f. Two Independent Groups Large Sample: Gaussian Table Small Sample: Population variances unequal – related to t- distribution Pop’n variances equal – t-dist’n with (n a +n b -2) d.f.

Paired Differences Example 95% Confidence Interval Assumptions for CI Independent, random sample of pairs, Differences have approximately normal pop’n distribution Confidence Interval (using t-distribution 10 d.f.) Avg Diff +/ SE / (0.061) (0.010, 0.282) Interpretation We are 95% confident that the difference in the time spent reading or watching the news between women and men is between and hours (with women spending more time). (between 0.6 and 16.9 minutes)

Paired Differences Example Hypothesis Test Study Hypothesis: Men and women spend different amounts of time reading/watching the news. 1. Assumptions: Independent, random sample of pairs, Differences have approximately normal pop’n distribution 2. Hypotheses: H 0 : μ a – μ b = 0 vs. H A : μ a – μ b ≠ 0 3. Test Statistic: t = (Avg.Diff-0)/SE = 0.146/0.061 = Interpretation:

Paired Differences HT, cont. 4. Critical Value: t-table, 2-sided 10 d.f.  P-value: From computer  Conclusion: With a p-value of 0.038, there is some evidence to suggest that, on average, men and women do spend different amounts of time either watching or reading the daily news.

Computer Output - Paired Paired t-test Data for wives and husbands are in two separate columns, with matched observations in the same row.  Analyze  Compare Means  Paired Samples T-test

Computer Output - Independent Independent Samples t-test Data for wives & husbands are in the same column; a second column indicates whether each observation is for the wife or husband.  Analyze  Compare Means  Independent Samples T-test

Paired Differences Example Hypothesis Test – Take 2 Study Hypothesis: Women spend more time than men. 1. Assumptions: Independent, random sample of pairs, Differences have approximately normal pop’n distribution 2. Hypotheses: H 0 : μ a – μ b = 0 vs. H A : μ a – μ b > 0 3. Test Statistic: t = (Avg.Diff-0)/SE = 0.146/0.061 = Interpretation:

Paired Differences HT- Take 2, cont. 4. P-value: ½ of computer’s two-sided p- value  0.038/2 = Conclusion: There is a moderate amount of evidence to suggest that women do spend more time on average than men reading or watching the daily news.

Two Group Example, CI Assumptions for Confidence Interval Large Sample Each group is an independent, random sample and the groups are chosen independently of one another Confidence Interval Equal SD: 1.6 +/ (1.335) = (-1.017, 4.217) Unequal SD: 1.6 +/ ( 1.344) = (-1.034, 4.234) Interpretation (Equal) We are 95% confident that at baseline the two groups of children (to be treated with placebo or risperidone) have Conduct Problem scores that are on average between –1.017 and points different. At the 0.05 level the two populations are not significantly different.

Two Group Example, HT Study Hypothesis: The two groups are different. 1. Assumptions: Large Sample Each group is an independent, random sample and the groups are chosen independently of one another 2. Hypotheses H 0 : μ a – μ b = 0 vs. H A : μ a – μ b ≠ 0 3. Test Statistic Equal SD: z = 1.6/1.335 = Unequal SD: z = 1.6/1.344 = 1.190

Two Group Example, HT, cont. 4. P-value Equal SD: t, with =116 d.f. approximately z  p-value= Unequal SD: Due to large sample size, the test statistic will follow an approximately normal (due to CLT)  p-value slightly larger than Conclusion At any reasonable significance level (say 0.05), we fail to reject the null hypothesis. Thus we would conclude that there is not a significant difference between the two groups with respect to their conduct problem score. There does not seem to be any bias with respect to their conduct problem score in the way that children were randomized to the two treatments.

Choosing a Study Design When I’m planning a study design, should I plan for paired (matched) samples or a study with two independent groups? Considerations: 1.Standard error for difference of means is generally smaller for paired samples resulting in a.Shorter confidence intervals b.More powerful tests 2.Cost 3.Feasibility Why is Std error smaller for paired samples?

Non-parametric statistics for small, non-normal samples Paired Data The same as for univariate data, except perform the test using the differences rather than the raw data. Two Independent Groups Mann-Whitney Rank Sum Test (Ch. 24) Procedure is similar to the Rank sum test, except that instead of dividing observations according to whether they are positive or negative, we divide observations according to group membership. Assumptions include (1) independent, random samples, (2) independently selected groups, and (3) the shape and spread of the two distributions are the same

Comparison of Two Proportions with Large Samples Lecture 6b

Confidence Interval ASSUMPTIONS Large sample size Independent, randomly selected sample Two groups are independent Estimate π 1 -π 2 95% CI: Estimate +/ SE o Estimate = p 1 -p 2 o SE 2 =p 1 (1-p 1 )/n 1 + p 2 (1-p 2 )/n 2

Cardiovascular Disease Risk Factors Example The Journal of Women’s Health (Vol.7, pp ) reports on the prevalence of risk factors for cardiovascular disease among women. Two independent samples of women were examined in 1992 and In 1992, it was found that 27.7 percent of the 36,836 women sampled had high cholesterol. In 1995, 29.2% of the 44,745 women sampled had high cholesterol. Is there a significant change in the percent of women with high cholesterol?

CVD Risk Factors: CI Estimate= =0.015 SE 2 =.292*.708/44, *.723/36,836 = SE = /- 1.96* = (0.0089, ) I am 95% confident that the risk of high cholesterol is between 0.89 and 2.12 percentage points higher in 1995 than it was in (This is not a relative change!) It looks like this change is statistically significant! Is it practically significant?

Homework Textbook Reading Chapters 7, 23, 24, 25 Homework Problems Available at