Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers.

Slides:



Advertisements
Similar presentations
Small differences. Two Proportion z-Interval and z-Tests.
Advertisements

Hypothesis Testing and Comparing Two Proportions Hypothesis Testing: Deciding whether your data shows a “real” effect, or could have happened by chance.
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Hypothesis Testing IV Chi Square.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Cross-Tabulations.
Comparing Population Parameters (Z-test, t-tests and Chi-Square test) Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director,
Chi-square Goodness of Fit Test
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Presentation 12 Chi-Square test.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.5 Small Sample.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
CJ 526 Statistical Analysis in Criminal Justice
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Chi-Square Test of Independence Practice Problem – 1
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Section 9-1: Inference for Slope and Correlation Section 9-3: Confidence and Prediction Intervals Visit the Maths Study Centre.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Statistical Significance for a two-way table Inference for a two-way table We often gather data and arrange them in a two-way table to see if two categorical.
Copyright © 2010 Pearson Education, Inc. Slide
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Fri, Nov 12, 2004.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Introduction to Hypothesis Testing
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chapter 12 Chi-Square Tests and Nonparametric Tests.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Textbook Section * We already know how to compare two proportions for two populations/groups. * What if we want to compare the distributions of.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Comparing Counts Chi Square Tests Independence.
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Warm Up Check your understanding on p You do NOT need to calculate ALL the expected values by hand but you need to do at least 2. You do NOT need.
CHAPTER 11 Inference for Distributions of Categorical Data
Hypothesis Testing Review
CHAPTER 11 Inference for Distributions of Categorical Data
Hypothesis Testing and Comparing Two Proportions
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Contingency Tables: Independence and Homogeneity
Inference on Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Hypothesis Testing - Chi Square
Presentation transcript:

Section 4.2: How to Look for Differences

Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers said. For instance, 90% of those who binged more than once a week said they experienced a hangover and 63% said they did something they regretted. --- Newsday, 12/94

Making Sense of the Data Were binge drinkers more likely to do things they regretted than students who did not binge drink? Defn: Binge drinking – consuming 5 or more dinks in a row for men or four or more in a row for women.

Wechsler Study on American College Students Nonbinge drinkers (n=6894) Infrequent binge drinkers (n=4090) Frequent binge drinkers (n=3291) Did something you regret 14%37%63% Missed a class 8%30%61% Got behind in schoolwork 6%21%46%

Cross-Tab Tables Categories of one measurement are listed on the top and the categories of the other measurement are listed on the side. Also, list the number of people that appear in each cell.

Wechsler Cross Tab NonbingeInfrequent Binge Frequent Binge Kept up with schoolwork 6,4803,2311,777 Got behind with schoolwork ,514

Marginal Totals and Proportions Cross Tabs will often show the totals (proportions) of each column in the bottom margin under the column and the total (proportions) of each row in the margin to the right of the row.

Cross-Tab with Column Marginals NonbingeInfrequent Binge Frequent Binge Kept up with schoolwork 6,4803,2311,777 Got behind with schoolwork ,514 Column Count 6,8944,0903,291 Column %48%29%23%

Correlation A cross-tab is a useful way of checking for correlation between two categorical variables. Two variables are correlated if knowing an observation’s value for one of the measurements helps you to better estimate the likelihoods of that observation’s value on the other measurement.

Correlation – Another View If the counts you would expect if there was no correlation are the counts you observed, then there is no correlation.

Computing Expected Counts Expected counts can be thought of as the count that would be if the row proportion applied equally to all the data in it; that is, if there were no correlation between the observations in the cross-tab.

Correlation in the Sample vs. Population Even though the expected counts may show a correlation in the sample data, how does one show there is correlation in the population?  Perform a hypothesis test.

Step 1: State the Null Hypothesis H 0 : No correlation in the population data. Remember…. H 0 is always the hypothesis of “no difference”. In this instance “no difference” means no difference between expected and observed counts – or no correlation.

Step 2: Collect Sample Data Use the Wechsler data. Is the sample data statistically significant? That is, assuming the null hypothesis is correct (i.e. there is no correlation between the two variables), how likely is it to get the sample data that we observed?

Step 3: Compute the Appropriate Test Statistic In testing a claim about one sample proportion, the test statistic is the z-test statistic. Since we have several sample proportions we’ll need a new test statistic – the chi- squared test statistic.

To compute the test statistic: Calculate the expected counts for each cell. We’ll actually calculate the chi-squared test statistic tomorrow.

Step 4: Find the p-value A p-value means just what it did earlier…. Assuming the null hypothesis is true (that is, there is no correlation between the two variables), it is the probability of getting sample data as extreme or more extreme than we observed.

Or more simply…. If there is no correlation (association) between the two variables, could the differences in the observed and expected cell counts be attributed to chance? With this test we’ll use the TI-83 to find the p-value.

Step 4: Reject or retain the null hypothesis

Step 5: State Your Conclusions There is statistical evidence to suggest that there is (is not) correlation between the two variables…..