Secondary Data, Measures, Hypothesis Formulation, Chi-Square Market Intelligence Julie Edell Britton Session 3 August 21, 2009.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
15- 1 Chapter Fifteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 12 Goodness-of-Fit Tests and Contingency Analysis
The Chi-Square Test for Association
Bivariate Analysis Cross-tabulation and chi-square.
Statistical Inference for Frequency Data Chapter 16.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Chapter 14 Analysis of Categorical Data
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter Goals After completing this chapter, you should be able to:
Market Intelligence Julie Edell Britton
Chi-square Test of Independence
Inferential Statistics  Hypothesis testing (relationship between 2 or more variables)  We want to make inferences from a sample to a population.  A.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Presentation 12 Chi-Square test.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing:.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Chapter Thirteen Part I
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
Quantitative Methods Partly based on materials by Sherry O’Sullivan Part 3 Chi - Squared Statistic.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Copyright © 2010 Pearson Education, Inc. Slide
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Reasoning in Psychology Using Statistics Psychology
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Introduction to Marketing Research
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 9: Non-parametric Tests
Chapter 11 Chi-Square Tests.
Chapter Fifteen McGraw-Hill/Irwin
Hypothesis Testing Review
Qualitative data – tests of association
The Chi-Square Distribution and Test for Independence
Is a persons’ size related to if they were bullied
Chapter 10 Analyzing the Association Between Categorical Variables
Contingency Tables: Independence and Homogeneity
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter 11 Chi-Square Tests.
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11 Chi-Square Tests.
Presentation transcript:

Secondary Data, Measures, Hypothesis Formulation, Chi-Square Market Intelligence Julie Edell Britton Session 3 August 21, 2009

Today’s Agenda  Announcements  Secondary data quality  Measure types  Hypothesis Testing and Chi-Square

3 National Insurance Case for Sat. 8/22 –Stephen will do a tutorial today, Friday, 8/21 from 1:00 -2:15 in the MBA PC Lab and be available tonight from 7 – 9 pm in the MBA PC Lab to answer questions –Submit slides by 8:00 am on Sat. 8/22 –2 slides with your conclusions – you may add Appendices to support you conclusions Announcements

Primary vs. Secondary Data  Primary -- collected anew for current purposes  Secondary -- exists already, was collected for some other purpose  Finding Secondary Data Fuqua 

Primary vs. Secondary Data

Evaluating Sources of Secondary Data  If you can’t find the source of a number, don’t use it. Look for further data.  Always give sources when writing a report.  Applies for Focus Group write-ups too  Be skeptical.

Secondary Data: Pros & Cons  Advantages  cheap  quick  often sufficient  there is a lot of data out there  Disadvantages  there is a lot of data out there  numbers sometimes conflict  categories may not fit your needs

Types of Secondary Data *IRI = Information Resources, Inc. (

Secondary Data Quality: KAD p. 120 & “What’s Behind the Numbers?”  Data consistent with other independent sources?  What are the classifications? Do they fit needs?  When were numbers collected? Obsolete?  Who collected the numbers? Bias, resources?  Why were the data collected? Self-interest?  How were the numbers generated?  Sample size  Sampling method  Measure type  Causality (MBA Marketing Timing & Internship)

It is Hard to Infer Causality from Secondary Data Took Core Marketing Got Desired Marketing Internship Did Not Get Desired Marketing Internship Term 176%24% Term 351%49%

Today’s Agenda  Announcements  Secondary data quality  Measure types  Hypothesis Testing and Chi-Square

Measure Types  Nominal: Unordered Categories  Male=1; Female = 2;  Ordinal: Ordered Categories, intervals can’t be assumed to be equal.  I-95 is east of I-85; I-80 is north of I-40; Preference data  Interval: Equally spaced categories, 0 is arbitrary and units arbitrary.  Fahrenheit temperature – each degree is equal, Attitudes  Ratio: Equally spaced categories, 0 on scale means 0 of underlying quantity.  $ Sales, Market Share

Meaningful Statistics & Permissible Transformations

Means and Medians with Ordinal Data GenderMeasure 1Measure 2Means M 11Measure 1 M 22M=5.4 < F=5.6 F 33Measure 2 F 44M=65.4 > F=25.6 F 55 F 66Medians M 7107Measure 1 M 8108M=7 > F=5 M 9109Measure 2 F10110M=107 > F=5

Ratio Scales & Index Numbers

Today’s Agenda  Announcements  Southwestern Conquistador Beer Case  Backward Market Research  Secondary data quality  Measure types  Hypothesis Testing and Chi-Square

Cross Tabs of MBA Acceptance by Gender A. Raw FrequenciesB. Cell Percentages

C. Row Percentages D. Column Percentages

Rule of Thumb  If a potential causal interpretation exists, make numbers add up to 100% at each level of the causal factor.  Above: it is possible that gender (row) causes or influences acceptance (column), but not that acceptance influences gender. Hence, row percentages (format C) would be desirable.

Hypothesis Formulation and Testing Hypothesis: What you believe the relationship is between the measures. Theory Empirical Evidence Beliefs Experience Here: Believe that acceptance is related to gender Null Hypothesis: Acceptance is not related to gender Logic of hypothesis testing: Negative Inference The null hypothesis will be rejected by showing that a given observation would be quite improbable, if the hypothesis was true. Want to see if we can reject the null.

Steps in Hypothesis Testing 1.State the hypothesis in Null and Alternative Form –Ho: There is no relationship between gender and MBA acceptance –Ha1: Gender and Acceptance are related (2-sided) –Ha2: Fewer Women are Accepted (1-sided) 2.Choose a test statistic 3.Construct a decision rule

Chi-Square Test  Used for nominal data, to compare the observed frequency of responses to what would be “expected” under the null hypothesis.  Two types of tests  Contingency (or Relationship) – tests if the variables are independent – i.e., no significant relationship exists between the two variables  Goodness of fit test – Compare whether the data sampled is proportionate to some standard

Chi-Square Test With (r-1)*(c-1) degrees of freedom Observed number in cell i Expected number in cell i under independence number of cellsnumber of rows number of columns = Column Proportion * Row Proportion * total number observed

MBA Acceptance Data Contingency A. Observed Frequencies B. Cell Percentages AcceptReject M.111*.556*1800= *.556*1800=890 F.111*.444*1800= *.444*1800=710 C. Expected Frequencies

Chi-Square Test With (r-1)*(c-1) degrees of freedom =( ) 2 /111 + ( ) 2 /890 + (60-89) 2 /89 + ( ) 2 /710 = So? 3. Construct a decision rule

Decision Rule 1.Significance Level - 2.Degrees of freedom - number of unconstrained data used in calculating a test statistic - for Chi Square it is (r-1)*(c-1), so here that would be 1. When the number of cells is larger, we need a larger test statistic to reject the null. 3.Two-tailed or One-tailed test – Significance tables are (unless otherwise specified) two tailed tables. Chi-Sq is on pg 517 Ha1: Gender and Acceptance are related (2-sided) Critical Value = 3.84 Ha2: Fewer Women are Accepted (1-sided) Critical Value = Decision Rule: Reject the Ho if calculated Chi-sq value (19.3) > the test critical value (3.84) for Ha1 or (2.71) for Ha2 Probability of rejecting the Null Hypothesis, when it is true

Chi-Square Table

Chi-Square Test  Used for nominal data, to compare the observed frequency of responses to what would be “expected” under some specific null hypothesis.  Two types of tests  Contingency (or Relationship) – tests if the variables are independent – i.e., no significant relationship exists  Goodness of fit test – Compare whether the data sampled is proportionate to some standard

Goodness of fit – Chi-Square Ho: Car Color Preferences have not shifted Ha: Car color Preferences have shifted Data Historic Distribution Expected # = Prob*n Red 68030% 750 Green 52025%625 Black 67525%625 White 62520%500 Tot (n)2500 Do we observe what we expected?

Chi-Square Test With (k-1) degrees of freedom =( ) 2 /750 + ( ) 2 /625 + ( ) 2 /625 + ( ) 2 /500 = So? 3. Construct a decision rule

Decision Rule 1.Significance Level - 2.Degrees of freedom - number of unconstrained data used in calculating a test statistic - for Chi Square it is (k-1), so here that would be 3. When the number of cells is larger, we need a larger test statistic to reject the null. 3.Two-tailed or One-tailed test – Significance tables are (unless otherwise specified) two tailed tables. Chi-Sq is on pg 517 Ha: Preference have changed (2-sided) Critical Value = Decision Rule: Reject the Ho if calculated Chi-sq value (59.42) > the test critical value (7.81). Probability of rejecting the Null Hypothesis, when it is true

Chi-Square Table

Recap  Finding & Evaluating Secondary Data  Measure Types  permissible transformations  Meaningful statistics  Index #s  Crosstabs  Casting right direction  Chi-square statistic  Contingency Test  Goodness of Fit Test