Comparing Two Groups’ Means or Proportions

Slides:



Advertisements
Similar presentations
Comparing Two Groups’ Means or Proportions: Independent Samples t-tests.
Advertisements

Chapter 16 Inferential Statistics
Statistics Review.
Inferential Statistics
Comparing Two Groups’ Means or Proportions Independent Samples t-tests.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Objectives (BPS chapter 24)
Inference1 Data Analysis Inferential Statistics Research Methods Gail Johnson.
1 Test a hypothesis about a mean Formulate hypothesis about mean, e.g., mean starting income for graduates from WSU is $25,000. Get random sample, say.
1 SOC 3811 Basic Social Statistics. 2 Reminder  Hand in your Assignment 4.
1 Practicals, Methodology & Statistics II Laura McAvinue School of Psychology Trinity College Dublin.
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
Chapter Sampling Distributions and Hypothesis Testing.
Inference.ppt - © Aki Taanila1 Sampling Probability sample Non probability sample Statistical inference Sampling error.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
The one sample t-test November 14, From Z to t… In a Z test, you compare your sample to a known population, with a known mean and standard deviation.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Hypothesis Testing: Two Sample Test for Means and Proportions
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
1 GE5 Lecture 6 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson.
Lecture 14 Testing a Hypothesis about Two Independent Means.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
More About Significance Tests
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
A Sampling Distribution
Two Sample Tests Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 9: Testing Hypotheses
ANOVA Analysis of Variance Most Useful when Conducting Experiments.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
Copyright ©2011 Nelson Education Limited Large-Sample Estimation CHAPTER 8.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Statistical Significance. Office Hour Sign Up I’d like to meet with everybody 1 on 1 re papers Please sign up during office hours, or let me know If those.
Jeopardy Statistics Edition. Terms Calculator Commands Sampling Distributions Confidence Intervals Hypothesis Tests: Proportions Hypothesis Tests: Means.
Section 9.3 ~ Hypothesis Tests for Population Proportions Introduction to Probability and Statistics Ms. Young.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
© Copyright McGraw-Hill 2004
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Cross Tabs and Chi-Squared Testing for a Relationship Between Nominal/Ordinal Variables.
Lecture 19 Dustin Lueker.  The p-value for testing H 1 : µ≠100 is p=.001. This indicates that… 1.There is strong evidence that μ=100 2.There is strong.
What statistical tests have we learned so far? Descriptive statistics (chp. 12) –Mean, median, mode –Frequency of each response (frequencies), range, standard.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 7 l Hypothesis Tests 7.1 Developing Null and Alternative Hypotheses 7.2 Type I & Type.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
Presentation transcript:

Comparing Two Groups’ Means or Proportions Independent Samples t-tests

Review Y-bar? Y-bar? µ? Y-bar µo=guess 95% CI: Y-bar +/- 1.96 *(s.e.) Confidence Interval for a Mean Slap a sampling distribution* over a sample mean to determine a range in which the population mean has a particular probability of being—such as 95% CI. If our sample is one of the middle 95%, we know that the mean of the population is within the CI. Significance Test for a Mean Slap a sampling distribution* over a guess of the population mean to determine if the sample has a very low probability of having come from a population where the guess is true—such as α-level = .05. If our sample mean is in the outer 5%, we know to reject the guess, our sample has a low chance of having come from a population with the mean we guessed. Y-bar? Y-bar? µ? 2.5% 2.5% 2.5% 2.5% -1.96z +1.96z Y-bar 95% CI: Y-bar +/- 1.96 *(s.e.) -1.96z +1.96z µo=guess z or t = (Y-bar - µo)/ s.e. *sampling distribution: the way statistics for samples of a certain size would stack up or be distributed after all possible samples are collected

Review Let’s collect some data on educational aspirations and produce a 95% confidence interval to tell us where the population parameter likely falls and then let’s do a test of significance where we guess that average aspiration will be 16 years. I collected a sample of 625 kids who reported their educational aspirations where 12 = high school, 16 equals 4 years of college and so forth. The average for the sample was 15 years with a standard deviation of 2 years. 95% confidence interval 95% CI = Sample Mean +/- z * s.e. Find the standard error of the sampling distribution: s / n = 2/√625 = 2/25 = 0.08 Build the width of the Interval. 95% corresponds with a z of +/- 1.96. +/- z * s.e = 1.96 * 0.08 = 0.157 Insert the mean to build the interval: 95% CI = Sample Mean +/- z * s.e = 15 +/- 0.157 The interval: 14.84 to 15.16 We are 95% confident that the population mean falls between these values. (What does this say about my guess???)

Review Let’s collect some data on educational aspirations and produce a 95% confidence interval to tell us where the population parameter likely falls and then let’s do a test of significance where we guess that average aspiration will be 16 years. I collected a sample of 625 kids who reported their educational aspirations where 12 = high school, 16 equals 4 years of college and so forth. The average for the sample was 15 years with a standard deviation of 2 years. Significance Test z or t = (Y-bar - µo)/ s.e. Decide -level ( = .05) and nature of test (two-tailed) Set critical z or t: (+/- 1.96) Make guess or null hypothesis, Ho:  = 16 Ha:   16 Collect and analyze data Calculate Z or t: z/t = Y-bar - o (s.e. = s/√n = 2/√625 = 2/25 = .08) s.e. z/t = (15 – 16)/.08 = -1/.08 = -12.5 Make a decision about the null hypothesis (reject the null: -12.5 < -1.96) Find the P-value (look up 12.5 in z or t table). P < .0001 It is extremely unlikely that our sample came from a population where the mean is 16.

I love sophisticated statistics! Comparing Two Groups We’re going to move forward to more sophisticated statistics, building on what we have learned about confidence intervals and significance tests. Sociologists look for relationships between concepts in the social world. For example: Does one’s sex affect income? Focus on the relationship between the concepts: Sex and Income Does one’s race affect educational attainment? Focus on the relationship between the concepts: Race and Educational Attainment I love sophisticated statistics!

Comparing Two Groups In this section of the course, you will learn ways to infer from a sample whether two concepts are related in a population. Independent variable (X): That which causes another variable to change when it changes. Dependent variable (Y): That which changes in response to change in another variable. X  Y (X= Sex or Race) (Y= Income or Education) The statistical technique you use will depend of the level of measurement of your independent and dependent variables—the statistical test must match the variables! Levels of Measurement: Nominal, Ordinal, Interval-Ratio

Comparing Two Groups The test you choose depends on level of measurement: Independent Dependent Statistical Test Dichotomous Interval-ratio Independent Samples t-test Dichotomous Nominal Nominal Cross Tabs Ordinal Ordinal Dichotomous Dichotomous Nominal Interval-ratio ANOVA Ordinal Dichotomous Interval-ratio Interval-ratio Correlation and OLS Regression

Comparing Two Groups Independent Dependent Statistical Test Dichotomous Interval-ratio Independent Samples t-test Dichotomous An independent samples t-test is concerned with whether a mean or proportion is equal between two groups. For example, does sex affect income? ♀ Income ♂ Income µ µ Women’s mean = Men’s Mean ???

Comparing Two Groups Independent Samples t-tests: Earlier, our focus was on the mean. We used the mean of the sample (statistic) to infer a range for what our population mean (parameter) might be (confidence interval) or whether it was like some guess or not (significance test). Now, our focus is on the difference in the mean for two groups. We will use the difference of the sample (statistic) to infer a range for what our population difference (parameter) might be (confidence interval) or whether it is like some guess (significance test).

Comparing Two Groups The difference will be calculated as such: D-bar = Y-bar2 – Y-bar1 For example: Average Difference in Income by Sex = Male Average Income – Female Average Income (What would it mean if men’s income minus women’s income equaled zero?)

Comparing Two Groups Like the mean, if one were to take random sample after random sample from two groups and calculate and record the difference between groups each time, one would see the formation of a Sampling Distribution for D-bar that was normal and centered on the two populations’ difference. average difference between two groups’ samples = Sampling Distribution of D-bar Z -3 -2 -1 0 1 2 3 95% Range

Comparing Two Groups It’s Power Time Again! Using just a sample, our statistics will allow us to pinpoint the difference between two groups in the population (confidence interval) or to determine whether our sample could have come from a population with a difference between two groups that we guessed (significance test).

Comparing Two Groups So the rules and techniques we learned for means and proportions apply to the differences in groups’ means and proportions. One creates sampling distributions to create confidence intervals and do significance tests in the same ways. However, the standard error of D-bar has to be calculated slightly differently. For Means: (s1)2 (s2)2 s.e. (s.d. of the sampling distribution) = n1 + n2 For Proportions: s.e. = 1 (1 - 1) 2 (1 - 2) n1 + n2

Comparing Two Groups Calculating a Confidence Interval for the Difference between Two Groups’ Means By slapping the sampling distribution for the difference over our sample’s difference between groups, D-bar, we can find the values between which the population difference is likely to be. 95% C.I. = D-bar +/- 1.96 * (s.e.) = (Y-bar2 – Y-bar1) +/- 1.96 * (s.e.) Or = (2 – 1) +/- 1.96 * (s.e.) 99% C.I. = D-bar +/- 2.58 * (s.e.) = (Y-bar2 – Y-bar1) +/- 2.58 * (s.e.) Or = (2 – 1) +/- 2.58 * (s.e.)

Comparing Two Groups EXAMPLE: We want to know what the likely difference is between male and female GPAs in a population of college students with 95% confidence. Sample: 50 men, average gpa = 2.9, s.d. = 0.5 50 women, average gpa = 3.1, s.d. = 0.4 95% C.I. = Y-bar2 – Y-bar1 +/- 1.96 * s.e. Find the standard error of the sampling distribution: s.e. = (.5)2/ 50 + (.4)2/50 =  .005 + .003 =  .008 = 0.089 Build the width of the Interval. 95% corresponds with a z or t of +/- 1.96. +/- z * s.e = +/- 1.96 * 0.089 = +/- 0.174 Insert the mean difference to build the interval: 95% C.I. = (Y-bar2 – Y-bar1) +/- 1.96 * s.e. = 3.1 - 2.9 +/- 0.174 = 0.2 +/- 0.174 The interval: 0.026 to 0.374 We are 95% confident that the difference between men’s and women’s GPAs in the population is between .026 and 0.374. If we had guessed zero difference, would the difference be a significant difference?

Comparing Two Groups Conducting a Test of Significance for the Difference between Two Groups’ Means By slapping the sampling distribution for the difference over a guess of the difference between groups, Ho, we can find out whether our sample could have been drawn from a population where the difference is equal to our guess. Two-tailed significance test for -level = .05 Critical z or t = +/- 1.96 To find if there is a difference in the population, Ho: 2 - 1 = 0 Ha: 2 - 1  0 Collect Data Calculate z or t: z or t = (Y-bar2 – Y-bar1) – (2 - µ1) s.e. Make decision about the null hypothesis (reject or fail to reject) Report P-value

Comparing Two Groups EXAMPLE: We want to know whether there is a difference in male and female GPAs in a population of college students. Two-tailed significance test for -level = .05 Critical z or t = +/- 1.96 To find if there is a difference in the population, Ho: 2 - 1 = 0 Ha: 2 - 1  0 Collect Data Sample: 50 men, average gpa = 2.9, s.d. = 0.5 50 women, average gpa = 3.1, s.d. = 0.4 s.e. = (.5)2/ 50 + (.4)2/50 =  .005 + .003 =  .008 = 0.089 Calculate z or t: z or t = 3.1 – 2.9 – 0 = 0.2 = 2.25 0.089 0.089 Make decision about the null hypothesis: Reject the null. There is enough difference between groups in our sample to say that there is a difference in the population. 2.25 >1.96 Find P-value: p or (sig.) = .0122 We have a 1.2 % chance that the difference in our sample could have come from a population where there is no difference between men and women. That chance is low enough to reject the null, for sure!

The steps outlined above for Comparing Two Groups The steps outlined above for Confidence intervals And Significance tests for differences in means are the same you would use for differences in proportions. Just note the difference in calculation of the standard error for the difference.

Comparing Two Groups Now let’s do an example with SPSS, using the General Social Survey.