Degrees of Freedom COM 631/731.

Slides:



Advertisements
Similar presentations
What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
Advertisements

CHI-SQUARE(X2) DISTRIBUTION
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Crosstabs and Chi Squares Computer Applications in Psychology.
Chi-square notes. What is a Chi-test used for? Pronounced like kite, not like cheese! This test is used to check if the difference between expected and.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Statistics for the Behavioral Sciences (5 th ed.) Gravetter & Wallnau Chapter 17 The Chi-Square Statistic: Tests for Goodness of Fit and Independence University.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Chi Square Classifying yourself as studious or not. YesNoTotal Are they significantly different? YesNoTotal Read ahead Yes.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Reasoning in Psychology Using Statistics Psychology
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
ContentFurther guidance  Hypothesis testing involves making a conjecture (assumption) about some facet of our world, collecting data from a sample,
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Test of independence: Contingency Table
Chapter 9: Non-parametric Tests
Lecture #19 Tuesday, October 25, 2016 Textbook: Sections 12.3 to 12.4
Effect Sizes (continued)
Chapter Fifteen McGraw-Hill/Irwin
Hypothesis Testing Review
Chapter 12 Tests with Qualitative Data
Qualitative data – tests of association
Chapter 13 (1e), (Ch. 11 2/3e) Association Between Variables Measured at the Nominal Level: Phi, Cramer’s V, and Lambda.
Goodness of Fit Tests The goal of goodness of fit tests is to test if the data comes from a certain distribution. There are various situations to which.
Goodness of Fit Tests The goal of χ2 goodness of fit tests is to test is the data comes from a certain distribution. There are various situations to which.
What goes in a results section?
Data Analysis for Two-Way Tables
Section 6-4 – Confidence Intervals for the Population Variance and Standard Deviation Estimating Population Parameters.
Chi-Square Test.
The Chi-Square Distribution and Test for Independence
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Chi Square Two-way Tables
X 2 (Chi-Square) ·. one of the most versatile. statistics there is ·
Ch. 15: The Analysis of Frequency Tables
Chi-Square Test.
Are you planning on doing a project in Math 124
Reasoning in Psychology Using Statistics
Data Case Hospitalized? Gender Age Insured Count of Patients 1 Yes
Chapter 6 Confidence Intervals.
Frequency Tables Statistics 2126.
Extra Brownie Points! Lottery To Win: choose the 5 winnings numbers from 1 to 49 AND Choose the "Powerball" number from 1 to 42 What is the probability.
Inference on Categorical Data
Chi-Square Test.
Contingency tables and goodness of fit
Lecture 37 Section 14.4 Wed, Nov 29, 2006
CHI SQUARE TEST OF INDEPENDENCE
Calculating the 1-way χ2 and 2-way χ2 Statistics
Reasoning in Psychology Using Statistics
Comparing Two Variables
Copyright © Cengage Learning. All rights reserved.
Inference for Two Way Tables
Inferential Statistical Tests
Quadrat sampling & the Chi-squared test
Contingency Tables (cross tabs)
Quadrat sampling & the Chi-squared test
Chi Square Test of Homogeneity
Mortality Analysis.
What is Chi-Square and its used in Hypothesis? Kinza malik 1.
MATH 2311 Section 8.6.
Presentation transcript:

Degrees of Freedom COM 631/731

The Freedom To Vary First, forget about statistics. Imagine you’re a person who loves to wear hats. You couldn't care less what a degree of freedom is. You believe that variety is the spice of life. Unfortunately, you have constraints. You have only 7 hats. Yet you want to wear a different hat every day of the week.

On the first day, you can wear any of the 7 hats On the first day, you can wear any of the 7 hats. On the second day, you can choose from the 6 remaining hats, on day 3 you can choose from 5 hats, and so on. When day 6 rolls around, you still have a choice between 2 hats that you haven’t worn yet that week. But after you choose your hat for day 6, you have no choice for the hat that you wear on Day 7. You must wear the one remaining hat. You had 7-1 = 6 days of “hat” freedom—in which the hat you wore could vary!

That’s kind of the idea behind degrees of freedom in statistics That’s kind of the idea behind degrees of freedom in statistics. Degrees of freedom are often broadly defined as the number of "observations" (pieces of information) in the data that are free to vary when estimating statistical parameters.

Degrees of Freedom for a Mean Imagine you have measured number of cats a person owns. You have done this with 5 people. What are your degrees of freedom?

We know the total, and the n, so we know the mean (2.0). Case ID Number of Cats Person 1 Person 2 Person 3 Person 4 Person 5 TOTAL CATS 10 MEAN CATS PER PERSON 10/5 = 2 We know the total, and the n, so we know the mean (2.0). If we know the total, do we have “freedom” for case 1?

Yes. Let‘s say person 1 has 2 cats. Case ID Number of Cats Person 1 2 Person 2 Person 3 Person 4 Person 5 TOTAL CATS 10 MEAN CATS PER PERSON 10/5 = 2 Yes. Let‘s say person 1 has 2 cats. Now, do we have freedom for case 2?

Yes. Let‘s say person 2 has no cats. Case ID Number of Cats Person 1 2 Person 2 Person 3 Person 4 Person 5 TOTAL CATS 10 MEAN CATS PER PERSON 10/5 = 2 Yes. Let‘s say person 2 has no cats. Now, do we have freedom for case 3?

Yes. Let‘s say person 3 has 1 cat. Now, do we have freedom for case 4? Case ID Number of Cats Person 1 2 Person 2 Person 3 1 Person 4 Person 5 TOTAL CATS 10 MEAN CATS PER PERSON 10/5 = 2 Yes. Let‘s say person 3 has 1 cat. Now, do we have freedom for case 4?

Yes. Let‘s say person 4 has 4 cats. Case ID Number of Cats Person 1 2 Person 2 Person 3 1 Person 4 4 Person 5 TOTAL CATS 10 MEAN CATS PER PERSON 10/5 = 2 Yes. Let‘s say person 4 has 4 cats. Now, do we have freedom for case 5?

No. Person 5 must have 3 cats. Case ID Number of Cats Person 1 2 Person 2 Person 3 1 Person 4 4 Person 5 3 (NO FREEDOM) TOTAL CATS 10 MEAN CATS PER PERSON 10/5 = 2 No. Person 5 must have 3 cats. Thus, for n = 5, we had 4 degrees of freedom. For the calculation of a mean, df is n – 1.

Degrees of Freedom for a Chi-Square Nuclear family Single parent Blended family TOTAL Permissive 20 Authoritarian 65 Authoritative 15 30 45 25 100 Imagine we have two nominal variables, parenting style and family type. And imagine that the marginal frequencies are as shown above.

Do we have freedom for the cell marked ??? above? Nuclear family Single parent Blended family TOTAL Permissive ??? 20 Authoritarian 60 Authoritative 30 40 100 Do we have freedom for the cell marked ??? above?

Now, do we have freedom for the cell marked ??? above? Nuclear family Single parent Blended family TOTAL Permissive 10 ??? 20 Authoritarian 65 Authoritative 15 30 45 25 100 Yes. Let’s say there are 10 people who have a Nuclear family and practice Permissive parenting. Now, do we have freedom for the cell marked ??? above?

Now, do we have freedom for the cell marked ??? above? Nuclear family Single parent Blended family TOTAL Permissive 10 5 ??? 20 Authoritarian 65 Authoritative 15 30 45 25 100 Yes. Let’s say there are 5 people who are a Single parent and practice Permissive parenting. Now, do we have freedom for the cell marked ??? above?

No. That cell must have 5 cases, because of the marginal total of 20. Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian ??? 65 Authoritative 15 30 45 25 100 No. That cell must have 5 cases, because of the marginal total of 20. Now, do we have freedom for the cell marked ??? above?

Now, do we have freedom for the cell marked ??? above? Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian ??? 65 Authoritative 15 30 45 25 100 Yes. Let’s say there are 10 cases that have a Nuclear family and practice Authoritarian parenting. Now, do we have freedom for the cell marked ??? above?

Now, do we have freedom for the cell marked ??? above? Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian 40 ??? 65 Authoritative 15 30 45 25 100 Yes. Let’s say there are 40 cases that are a Single parent and practice Authoritarian parenting. Now, do we have freedom for the cell marked ??? above?

No. That cell must have 15 cases, because of the marginal total of 65. Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian 40 15 (NO FREEDOM) 65 Authoritative ??? 15 30 45 25 100 No. That cell must have 15 cases, because of the marginal total of 65. Now, do we have freedom for the cell marked ??? above?

No. That cell must have 10 cases because of the marginal total of 30. Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian 40 15 (NO FREEDOM) 65 Authoritative 10 (NO FREEDOM) ??? 15 30 45 25 100 No. That cell must have 10 cases because of the marginal total of 30. Do we have freedom for the cell marked ??? above?

No. That cell must have 0 cases because of the marginal total of 45. Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian 40 15 (NO FREEDOM) 65 Authoritative 10 (NO FREEDOM) 0 (NO FREEDOM) ??? 15 30 45 25 100 No. That cell must have 0 cases because of the marginal total of 45. Finally, do we have freedom for the cell marked ??? above?

So how many degrees of freedom does this chi-square table have? Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian 40 15 (NO FREEDOM) 65 Authoritative 10 (NO FREEDOM) 0 (NO FREEDOM 15 30 45 25 100 No. That cell must have 5 cases because of the marginal total of 25, and also because of the marginal total of 15. So how many degrees of freedom does this chi-square table have?

The yellow cells were “free”...thus, we have 4 degrees of freedom. Nuclear family Single parent Blended family TOTAL Permissive 10 5 5 (NO FREEDOM) 20 Authoritarian 40 15 (NO FREEDOM) 65 Authoritative 10 (NO FREEDOM) 0 (NO FREEDOM 15 30 45 25 100 The yellow cells were “free”...thus, we have 4 degrees of freedom. For a chi-square test, degrees of freedom are (r – 1) x (c – 1) where r = number of rows and c = number of columns

A B C D TOTAL E F G H I So for a 5 by 4 chi-square table, the df is (5 – 1) x (4 – 1) = 4 x 3 = 12.

end