1 Always be mindful of the kindness and not the faults of others.

Slides:



Advertisements
Similar presentations
Categorical Data Analysis
Advertisements

Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
Inference about the Difference Between the
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
Chapter Ten Comparing Proportions and Chi-Square Tests McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Stat 512 – Lecture 14 Analysis of Variance (Ch. 12)
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Cross Tabulation and Chi Square Test for Independence.
1 Pertemuan 09 Pengujian Hipotesis Proporsi dan Data Katagorik Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Chapter 16 Chi Squared Tests.
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Chapter 11a: Comparisons Involving Proportions and a Test of Independence Inference about the Difference between the Proportions of Two Populations Hypothesis.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 Categorical Data (Chapter 10) Inference about one population proportion (§10.2). Inference about two population proportions (§10.3). Chi-square goodness-of-fit.
Testing for a Relationship Between 2 Categorical Variables The Chi-Square Test …
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
Analysis of Categorical Data
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Chapter 13: Categorical Data Analysis Statistics.
1 Always be mindful of the kindness and not the faults of others.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Contingency Tables 1.Explain  2 Test of Independence 2.Measure of Association.
AP Statistics Section 14.. The main objective of Chapter 14 is to test claims about qualitative data consisting of frequency counts for different categories.
Copyright © 2010 Pearson Education, Inc. Slide
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Inference for Distributions of Categorical Variables (C26 BVD)
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Dan Piett STAT West Virginia University Lecture 12.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics 300: Elementary Statistics Section 11-2.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Categorical Data Analysis
Statistics 300: Elementary Statistics Section 11-3.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chi-Två Test Kapitel 6. Introduction Two statistical techniques are presented, to analyze nominal data. –A goodness-of-fit test for the multinomial experiment.
Goodness-of-Fit and Contingency Tables Chapter 11.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
©2006 Thomson/South-Western 1 Chapter 12 – Analysis of Categorical Data Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western Concise.
St. Edward’s University
Presentation transcript:

1 Always be mindful of the kindness and not the faults of others.

Categorical Data Sections 10.1 to 10.5 Estimation for proportions Tests for proportions Chi-square tests

3 Example Researchers in the development of new treatments for cancer patients often evaluate the effectiveness of new therapies by reporting the proportion of patients who survive for a specified period of time after completion of the treatment. A new treatment of 870 patients with lung cancer resulted in 330 survived at least 5 years.

4 Example Estimate , the proportion of all patients with lung cancer who would survive at least 5 years after being administered this treatment How much would you estimate the proportion as?

5 Distribution of Sample Proportion Y: the number of successes in the n trials (independent and identical trials) What’s the distribution of Y? Sample proportion,

6 Distribution of Sample Proportion When n  ≥ 5 and n(1-  ≥ 5, the distribution of Y can be approximated by a normal distribution. (approximate) (1-  ) Confidence Interval for  : Optional: (exact) C.I. for  for small sample

7 Sample Size Where E is the largest tolerable error at (1-  confidence level.

8 Test for a Large Sample When n    ≥ 5 and n(1-    ≥ 5, the test statistic is:

9 Inference about 2 Proportions Notation: Population 1Population 2 Proportion  Sample sizen1n2 # of successesy1y2 Sample proportion

10 Estimation for  Point estimate:

11 Estimation for  (1-  ) Confidence Interval for two large samples:

12 Example 10.6 A company markets a new product in the Grand Rapids and Wichita. In Grand Rapids, the company’s advertising is based entirely on TV commercials. In Wichita, based on a balanced mix of TV, radio, newspaper, and magazine. 2 months after the ad campaign begins, the company conducts surveys to determine consumer awareness of the product.

13 Example 10.6: Data Set Grand RapidsWichita # of interviewed # of aware Q: Calculate a 95% C.I. for the regional difference in the proportion of all consumers who are aware of the product.

14 Example 10.6 (conti.) Conduct a test at  =0.05 to verify if there are >10% more Wichita consumers than Grand Rapids consumers aware of the product.

15 Test for  Large Samples) When n1  ≥ 5 and n1(1-  ≥ 5; n2  ≥ 5 and n2(1-  ≥ 5, the test statistic of Ho: p1-p2=d is Optional: Fisher Exact Test (p.511)

Minitab Z procedure for one proportion: Stat >> Basic Statistics >>1 proportion Z procedure for two proportions: Stat >> Basic Statistics >>2 proportion Sample size calculation: Stat >>Power & Sample size>>1 proportion or 2 proportion Stat >>Power & Sample size>>sample size for estimation 16

17 Chi-Square Goodness of Fit Test More than two possible outcomes per trial  the multinomial experiment 1. The experiment consists of n identical trials. 2. Each trial results in one of k outcomes with probabilities     ...  k. Y=(Y 1,…,Y k ); Y i = the # of outcome i.

18 Chi-square Goodness of Fit Test Goal:We are interested in testing a hypothesized distribution of Y (i.e. a set of  i ’s values). Hypotheses: Ho:  i =  io for all ivs. Ha: Ho is false

19 Chi-square Goodness of Fit Test Test Statistic: ni = the observed Yi Ei = the expected Yi = n  io

20 Chi-square Goodness of Fit Test Rejection Region: Reject Ho if where df=k-1. Note: This test can be trusted only when 80% of more cells of the Ei’s are at least 5.

21 Example CategoryHypothesized %Observed counts Marked decrease50120 Moderate decrease2560 Slight decrease10 Stationary of slight increase 1510

Minitab: Stat >> Tables >> Chi-Square Goodness-of-Fit Test(One Variable) 22 Example 10.11

23 Contingency Table(Example 10.12) n ij Age Category Severity of skin disease 1234 Total n i* Total n *j = n

24 Contingency Table 2 categorical variables: row and column indexed by i and j, respectively If they are independent, then

25 Test for Independence of 2 Var’s Hypotheses: Ho: the row and column variables are independent Ha: they are dependent Test Statistic:

26 Test for Independence of 2 Var’s Rejection Region: Reject Ho if where df=(r-1)(c-1). Note: This test can be trusted only when 80% of more cells of the are at least 5.

Minitab: Stat >> Tables >> Cross Tabulation and Chi-square Tabulated Statistics: C1, Worksheet columns Rows: C1 Columns: Worksheet columns All A B C All Cell Contents: Count % of Total Expected count Pearson Chi-Square = , DF = 8, P-Value = Likelihood Ratio Chi-Square = , DF = 8, P-Value = Example 10.12