Sociology 601 Class 10: October 1, 2009 7.3: Small sample comparisons for two independent groups. o Difference between two small sample means o Difference.

Slides:



Advertisements
Similar presentations
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Advertisements

Inferential Statistics
Inferential Statistics & Hypothesis Testing
Sociology 601 Class 8: September 24, : Small-sample inference for a proportion 7.1: Large sample comparisons for two independent sample means.
Lab 4: What is a t-test? Something British mothers use to see if the new girlfriend is significantly better than the old one?
Sociology 601: Class 5, September 15, 2009
What z-scores represent
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Two Population Means Hypothesis Testing and Confidence Intervals With Known Standard Deviations.
BCOR 1020 Business Statistics
Sociology 601 Class 7: September 22, 2009
Lecture 9: One Way ANOVA Between Subjects
Sociology 601 Class12: October 8, 2009 The Chi-Squared Test (8.2) – expected frequencies – calculating Chi-square – finding p When (not) to use Chi-squared.
Chapter Sampling Distributions and Hypothesis Testing.
Chapter 9 Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
PSY 307 – Statistics for the Behavioral Sciences
Fall 2012Biostat 5110 (Biostatistics 511) Discussion Section Week 8 C. Jason Liang Medical Biometry I.
Chapter Eleven Inferential Tests of Significance I: t tests – Analyzing Experiments with Two Groups PowerPoint Presentation created by Dr. Susan R. Burns.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
More About Significance Tests
T tests comparing two means t tests comparing two means.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Significance Tests: THE BASICS Could it happen by chance alone?
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Essential Statistics Chapter 131 Introduction to Inference.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Statistical Inference
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
AP Statistics Unit 5 Addie Lunn, Taylor Lyon, Caroline Resetar.
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
366_7. T-distribution T-test vs. Z-test Z assumes we know, or can calculate the standard error of the distribution of something in a population We never.
T tests comparing two means t tests comparing two means.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Prof. Robert Martin Southeastern Louisiana University.
SECTION 1 TEST OF A SINGLE PROPORTION
Chapter 9 Introduction to the t Statistic
The Single-Sample t Test Chapter 9. t distributions >Sometimes, we do not have the population standard deviation, σ. Very common! >So what can we do?
04/10/
Presentation transcript:

Sociology 601 Class 10: October 1, : Small sample comparisons for two independent groups. o Difference between two small sample means o Difference between two small sample proportions 7.4: Comparisons for two dependent groups. o Test for dependent means o Advantages/ disadvantages of dependent groups o McNemar’s test for dependent proportions. 1

7.4 Dependent samples Dependent samples occur when each observation in the first sample has something in common with one observation in the second sample. o Also called matched pairs data. o Also called paired difference data. o Also called a randomized block design. o Repeated measurement data fall into this category. In a data set, dependent samples often appear as two variables for each respondent in the data set. 2

Dependent samples: an example Suppose that a researcher in geography had surveyed a small sample of undergraduates in May 2001, and collected answers to a series of questions on world geography. In May 2002, that researcher decided to retest the students to see if knowledge of world geography increased after the events of 9/11/2001 3

Data set for geography example. Scores for May 2001 sample: o (5,2,8,3,6,4,7) (n=7, Y bar1 = 5, s.d. = 2.16) Scores for May 2002 sample: o (6,4,9,_,10,5,8) (n=6, Y bar2 = 7, s.d. = 2.37) Each score in 2001 matches the corresponding score in The fourth student, who scored a 3 in 2001, refused to participate in May

A comparison for dependent samples Enter the scores as separate variables, matched by respondent clear input id scor2001 scor end 5

A comparison for dependent samples The t-test for dependent samples would look like this:. ttest scor2001=scor2002 Paired t test Variable | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] scor2001 | scor2002 | diff | Ho: mean(scor scor2002) = mean(diff) = 0 Ha: mean(diff) 0 t = t = t = P |t| = P > t =

Significance test for paired differences We give a random sample of seven UM students a set of world geography questions in May 2001, then in May We obtain matched scores for six of the students. D bar = 1.67, s.d. = 1.211, n = 6 Decide whether test scores were different in 2002 than in

Significance test for paired differences Assumptions: o We are working with a random sample of UM students. o The difference is measured as an interval-scale variable o Difference scores are normally distributed in the population, or the number of pairs is 30 or more. Hypothesis: o There is no average difference between a student’s score in 2002 and the student’s score in o H o :  D = 0 8

Significance test with dependent samples Test statistic: The test statistic for  D (for n matched pairs) is… t = D bar / (s D / sqrt(n)) = 1.67 / (1.211/√6) = d.f. = n – 1 = 6 – 1 = 5 9

Significance test with dependent samples t = 3.371, d.f.= 5 P-value: move down the columns to df = 5 move across to the t-scores that bracket 3.371: o for column 4, t = for column 5, t=4.032 move up to read the one-sided p-values o.01 > p >.005 one-sided. double the p-value to translate to a 2-sided p-value o.02 > p >.01 two sided, so p <.02 10

Significance test with dependent samples p <.02 two-sided. Conclusion: It is very unlikely that the difference in scores could have occurred by chance alone, so we reject the null hypothesis and conclude that the geography scores were different (increased) from 2001 to It is difficult to judge the substantive importance of the difference without knowing more about the test. The average difference was somewhat smaller than the typical variation among students. 11

Confidence interval for dependent samples: The confidence interval for  D is… (Essentially, statistical inference for the difference between dependent samples is the same as statistical inference for a single sample mean.) 12

Advantages and disadvantages of a matched sample test Known sources of potential bias are controlled. For example, comparing students who had taken the course with students who had not would probably be biased since the course probably selects out students already interested in (and so know more about) geography. (+) The standard deviation of the test statistic is usually smaller, making the power of your test proportionately greater. (+) Matched tests can be relatively expensive to do, because you have to find the same subjects, and you might lose some to attrition. (-) If you reject the null hypothesis, you may have difficulty arguing that the difference is due to global events instead of a test-retest “practice effect”. (-) 13

Dependent versus independent samples When is it appropriate to use a dependent-samples design? o repeated measures for the same individual/ area/ class? YES o studies with matched pairs of family members? YES o studies with samples matched for levels of another variable? YES (but multivariate statistics are better) o studies matched by values of the outcome under study? NO, that would be cheating. o studies matched at random, say by caseid for separate samples? NO, not acceptable by convention. (On average this would provide no help, but if you do it as a fishing expedition, you might randomly soak up a bit of the unexplained error.) 14

Using “error” to understand the difference between independent and dependent samples. Think of an observed score as a sum of the population mean and “error” for that case. Y i = μ + e i e i includes any factors that cause the score to differ from the mean: o individual differences o differences unique to the observation o measurement error Across the population, error has an average of 0 and a typical value of ± σ 15

Error in a difference between scores The difference between two scores can be describes thusly: Y 2i - Y 1i = (μ 2 + e 2i ) – (μ 1 + e 1i ) = (μ 2 - μ 1 ) + (e 2i - e 1i ) where (μ 2 - μ 1 ) is the population difference we’re interested in and where (e 2i - e 1i ) is the unwanted variation If e 2i is independent of e 1i, then (e 2i - e 1i ) will be larger than either error alone by a factor of √2 (on average) However, if e 2i and e 1i have a shared error component, then (e 2i - e 1i ) will subtract out the shared error, thereby making it easier to study (μ 2 - μ 1 ) 16

Error in a difference between scores In the example of pairs of geography test scores, define the following as “shared error”, “unshared error”, or something else. Some of the respondents have a strong prior geography background. One of the respondents was feeling sick the day of the 2002 test. One respondent guessed at all the answers both times. Three of the respondents read some books about the Middle East and South Asia between 2001 and In 2002, two of the respondents vaguely remembered the test questions from the previous year. 17

Comparing proportions in dependent samples: A survey asked 340 registered voters their opinions about government spending. o 90% (306 favored more spending on law enforcement. o 93.24% (317) favored more spending on health care. You can break this pattern down further o 292 favored more spending for both o 9 favored less spending for both o 25 favored more for health, less for law o 14 favored more for law, less for health 18

Comparing proportions in dependent samples Assumptions: o The observations were drawn as a random sample. o We are working with categorical data o We need to assume minimum sample size for a normal sampling distribution: n 12 + n 21 > 20 Hypothesis: o There is no difference in the proportion supporting either social program o H 0 : π law – π health = 0 19

Comparing proportions in dependent samples Test statistic: o The test statistic for a comparison of paired proportions is a z-score, estimated using McNemar’s Test. z = (n 12 - n 21 ) / √(n 12 + n 21 ) Z = (25 – 14) / √(25+14) = 11/6.245 = st item YesNoTotal Yesn 11 n 21 n 11 + n 21 2 nd itemNon 12 n 22 n 12 + n 22 Totaln 11 + n 12 n 21 + n 22 n total 20

Comparing proportions in dependent samples P-value: for z=1.76, p=.08 (two-tailed) Conclusion: I would not reject the null hypothesis. In the sample it appears that increased health spending may be more popular than increased spending on law enforcement, but this difference could have occurred by chance alone. 21

How is McNemar’s test derived? P 1 = (n 11 + n 21 ) / n total P 2 = (n 11 + n 12 ) / n total Difference of population proportions: P 1 – P 2 = (1*n 12 +(-1)* n 21 +0*n 11 +0*n 22 ) / n total = (n 12 - n 21 ) / n total Standard error of difference (the tough one): s.e. = SQRT (P 1 (1-P 1 )+ P 2 (1-P 2 ) - 2 (P 11 *P 22 - P 12 *P 21 ) / n) = SQRT((n 12 + n 21 )) / n total Z-score:z = (P 1 – P 2 ) / s.e. = (n 12 - n 21 ) / SQRT((n 12 + n 21 )) 22

An alternative to McNemar’s Test in STATA. The TTEST command can also be used for comparing dependent samples when the samples are proportions. o Assume you have downloaded a data set with case-by-case data for variables “health” and “law”. ttest health=law Paired t test Variable | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] health | law | diff | Ho: mean(health - law) = mean(diff) = 0 Ha: mean(diff) 0 t = t = t = P |t| = P > t =