Inferential Statistics 3: The Chi Square Test

Slides:



Advertisements
Similar presentations
Advanced Higher Geography
Advertisements

Inferential Statistics 3: The Chi Square Test Advanced Higher Geography Statistics Ollie Bray – Knox Academy, East Lothian.
Inferential Statistics 3: The Chi Square Test
What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
CHI-SQUARE(X2) DISTRIBUTION
Finish Anova And then Chi- Square. Fcrit Table A-5: 4 pages of values Left-hand column: df denominator df for MSW = n-k where k is the number of groups.
Chi-Square Test Chi-square is a statistical test commonly used to compare observed data with data we would expect to obtain according to a specific hypothesis.
Chi Square Example A researcher wants to determine if there is a relationship between gender and the type of training received. The gender question is.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Hypothesis Testing - the scientists' moral imperative moral imperative moral imperative To tell whether our data supports or rejects our ideas, we use.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Phi Coefficient Example A researcher wishes to determine if a significant relationship exists between the gender of the worker and if they experience pain.
Two Variable Statistics
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chi-Square. All the tests we’ve learned so far assume that our data is normally distributed z-test t-test We test hypotheses about parameters of these.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Chi Square Classifying yourself as studious or not. YesNoTotal Are they significantly different? YesNoTotal Read ahead Yes.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
STATISTICS Advanced Higher Chi-squared test. Advanced Higher STATISTICS Chi-squared test Finding if there is a significant association between sets of.
Week 13a Making Inferences, Part III t and chi-square tests.
ContentFurther guidance  Hypothesis testing involves making a conjecture (assumption) about some facet of our world, collecting data from a sample,
Chi Squared Statistical test used to see if the results of an experiment support a theory or to check that categorical data is independent of each other.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi-squared Association Index. What does it do? Looks for “links” between two factors  Do dandelions and plantains tend to grow together?  Does the.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Basic Statistics The Chi Square Test of Independence.
Geog4B The Chi Square Test.
Statistical Analysis: Chi Square
CHI-SQUARE(X2) DISTRIBUTION
Chi-Square Test.
Test of independence: Contingency Table
Chapter 9: Non-parametric Tests
Lecture Slides Elementary Statistics Twelfth Edition
Chapter Fifteen McGraw-Hill/Irwin
Hypothesis Testing Review
Chi-squared Distribution
Hypothesis Testing Using the Chi Square (χ2) Distribution
Inferential Statistics
Chapter 11 Goodness-of-Fit and Contingency Tables
Chi-Square Test.
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Chi Square Two-way Tables
Chi-Square Test.
Reasoning in Psychology Using Statistics
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Contingency Tables (cross tabs)
Contingency Tables: Independence and Homogeneity
Overview and Chi-Square
Chi-Square Test.
Chi-squared Association Index
Lecture 42 Section 14.4 Wed, Apr 17, 2007
Lecture 37 Section 14.4 Wed, Nov 29, 2006
Chi – square Dr. Anshul Singh Thapa.
The 2 (chi-squared) test for independence
12.2 The Chi-Square Distribution
Lecture 43 Sections 14.4 – 14.5 Mon, Nov 26, 2007
11E The Chi-Square Test of Independence
Inference for Two Way Tables
Skills 5. Skills 5 Standard deviation What is it used for? This statistical test is used for measuring the degree of dispersion. It is another way.
Chi-Square Test A fundamental problem in Science is determining whether the experiment data fits the results expected. How can you tell if an observed.
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Analysis of two-way tables
What is Chi-Square and its used in Hypothesis? Kinza malik 1.
Presentation transcript:

Inferential Statistics 3: The Chi Square Test Advanced Higher Geography Statistics Inferential Statistics 3: The Chi Square Test

Introduction (1) We often have occasions to make comparisons between two characteristics of something to see if they are linked or related to each other. One way to do this is to work out what we would expect to find if there was no relationship between them (the usual null hypothesis) and what we actually observe.

Introduction (2) The test we use to measure the differences between what is observed and what is expected according to an assumed hypothesis is called the chi-square test.

For Example Some null hypotheses may be: ‘there is no relationship between the height of the land and the vegetation cover’. ‘there is no difference in the location of superstores and small grocers shops’ ‘there is no connection between the size of farm and the type of farm’

Important The chi square test can only be used on data that has the following characteristics: The frequency data must have a precise numerical value and must be organised into categories or groups. The data must be in the form of frequencies The expected frequency in any one cell of the table must be greater than 5. The total number of observations must be greater than 20.

Formula χ 2 = ∑ (O – E)2 E χ2 = The value of chi square O = The observed value E = The expected value ∑ (O – E)2 = all the values of (O – E) squared then added together

Worked Example Write down the NULL HYPOTHESIS and ALTERNATIVE HYPOTHESIS and set the LEVEL OF SIGNIFICANCE. NH ‘ there is no difference in the distribution of old established industries and food processing industries in the postal district of Leicester’ AH ‘There is a difference in the distribution of old established industries and food processing industries in the postal district of Leicester’ We will set the level of significance at 0.05.

Table Time! Observed Frequencies (O) Construct a table with the information you have observed or obtained. Observed Frequencies (O) Post Codes LE1 LE2 LE3 LE4 LE5 & LE6 Row Total Old Industry 9 13 10 8 50 Food Industry 4 3 5 21 42 Column Total 16 15 19 29 92 (Note: that although there are 3 cells in the table that are not greater than 5, these are observed frequencies. It is only the expected frequencies that have to be greater than 5.)

Now Work out the expected frequency. Expected frequency = row total x column total Grand total Eg: expected frequency for old industry in LE1 = (50 x 13) / 92 = 7.07 Post Codes LE1 LE2 LE3 LE4 LE5 & LE6 Row Total Old Industry 7.07 Food Industry Column Total

Check your answers Post Codes LE1 LE2 LE3 LE4 LE5 & LE6 Row Total Old Industry 7.07 8.70 8.15 10.33 15.76 50 Food Industry 5.93 7.30 6.85 8.67 13.24 42 Column Total 13 16 15 19 29 92

Now (O – E)2 E For each of the cells calculate. Eg: Old industry in LE1 is (9 – 7.07)2 / 7.07 = 0.53 Post Codes LE1 LE2 LE3 LE4 LE5 & LE6 Row Total Old Industry 0.53 Food Industry Column Total

Check your answers Then Post Codes LE1 LE2 LE3 LE4 LE5 &L E6 Old Industry 0.53 2.13 0.42 0.01 3.82 Food Industry 0.63 2.54 0.50 4.55 Then Add up all of the above numbers to obtain the value for chi square: χ2 = 15.14.

Finally Look up the significance tables. These will tell you whether to accept the null hypothesis or reject it. The number of degrees of freedom to use is: the number of rows in the table minus 1, multiplied by the number of columns minus 1. This is (2-1) x (5-1) = 1 x 4 = 4 degrees of freedom. We find that our answer of 15.14 is greater than the critical value of 9.49 (for 4 degrees of freedom and a significance level of 0.05) and so we reject the null hypothesis.

Now you have to look for geographical factors to explain your findings In other words ‘The distribution of old established industry and food processing industries in Leicester is significantly different.’ The hard bit! Now you have to look for geographical factors to explain your findings

Your Turn Read page 46, 47 and 48 of ‘Geographical Measurements and Techniques: Statistical Awareness, by LT Scotland, June 2000. Answer Task 1 on page 48.