o We chose to do our project on cars in South's parking lot to see if our parking lot could be considered representative of the entire population of cars.

Slides:



Advertisements
Similar presentations
(Hypothesis test for small sample sizes)
Advertisements

The mystery of the CHI SQUARE Is it CHEE square Or CHAI Square?!
STATISTICS HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering.
Overview of Lecture Parametric vs Non-Parametric Statistical Tests.
Chapter 7 Sampling and Sampling Distributions
Hypothesis Test II: t tests
Statistical Methods Lecture 26
1 T-test for the Mean of a Population: Unknown population standard deviation Here we will focus on two methods of hypothesis testing: the critical value.
Chapter 13: Chi-Square Test
Chapter 16 Goodness-of-Fit Tests and Contingency Tables
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Statistical Significance for 2 x 2 Tables Chapter 13.
CB South Student Lunches By: Lindsay, Danielle, Brit.
Chapter 18: The Chi-Square Statistic
Chapter 11: Chi – Square Goodness – of – Fit Tests
Median and Mode Lesson
Chi Squared Tests Hypothesis Tests for Linear Regression
Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
By Josh Spiezle, Emy Chinen, Emily Lopez, Reid Beloff.
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
By: John Marron Nicole Scamuffo
FINAL PROJECT Bliss Brannon, Dan Faulkner, Ryan Dickson.
Chi-square Goodness of Fit Test
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
MICHAEL KITA, ANDREW KATSMAN, GEORGE CONNOR. Our project is about the make, model, and style of cars in the Central Bucks parking lot and the parking.
Kayla KrasleyKayla Krasley Mandy SchaferMandy Schafer.
Chapter 13: Inference for Tables – Chi-Square Procedures
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 15 Inference for Counts:
13.1 Goodness of Fit Test AP Statistics. Chi-Square Distributions The chi-square distributions are a family of distributions that take on only positive.
Section 10.1 Goodness of Fit. Section 10.1 Objectives Use the chi-square distribution to test whether a frequency distribution fits a claimed distribution.
Vending Machine Adventures By: Pat Casey, Dan Cardamone, Heejun Yang.
10.1: Multinomial Experiments Multinomial experiment A probability experiment consisting of a fixed number of trials in which there are more than two possible.
Chi Square Fr Chris - St Francis High School March 2006.
Chapter 11 Inference for Tables: Chi-Square Procedures 11.1 Target Goal:I can compute expected counts, conditional distributions, and contributions to.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 12: The Analysis of Categorical Data and Goodness- of-Fit Test.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
Tara Levine, Bridget Sanelli, Madeline Stenken Block 3 AP Statistics.
By: Avni Choksi and Brittany Nguyen
Lecture 11. The chi-square test for goodness of fit.
By.  Are the proportions of colors of each M&M stated by the M&M company true proportions?
Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
WARM – UP: The Math club and the Spanish club traditionally are composed of a similar distribution of class level. A random sample of this year’s math.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Inference for Tables: Chi-Squares procedures (2 more chapters to go!)
Check your understanding: p. 684
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter Nine Hypothesis Testing.
Ch 26 – Comparing Counts Day 1 - The Chi-Square Distribution
Chi-square test or c2 test
Sizes and parking locations of aircraft
Elementary Statistics: Picturing The World
Chi-Square - Goodness of Fit
Goodness of Fit Test - Chi-Squared Distribution
15.1 Goodness-of-Fit Tests
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 26 Part 2 Comparing Counts.
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Inference for Distributions of Categorical Data
Presentation transcript:

o We chose to do our project on cars in South's parking lot to see if our parking lot could be considered representative of the entire population of cars o We studied many aspects of them like color, size, type, etc. to try and make conclusions about the population o We went outside in the parking lot to explore!

o First self-propelled car o Internal combustion engines developed o First road traffic death o Green cars

o Every 5 cars in CB South parking lot o More variables to decide from Color Type (Car, SUV, Truck, Other) Number of Doors

RedSilverGreenBlueWhiteBlackOtherTotal 2-Door Car Door Car SUV Door Truck Door Truck Door Other Door Other Total

o Chose to this graph to display the overall results in a simple and general form o Majority of vehicles are cars o Few trucks, but may be different in population where more workers use trucks to transport heavy items

Colors o Used this graph because it’s more specific but not too specific o Shows silver is the main color of the population o Most colors have about equal amounts of SUVs

Color and Type o Stacked bar graph o Shows all variables o Silver 4- door cars are dominant o Blue has no 2- door cars, so in the population they must be minimal

o Chi- Square Test: Goodness of Fit o Uniform o Our Sample vs. North America o Chi- Square Test for Association: Color vs. Car Size o One Proportion Z Test for SUVs o One Proportion Z Interval for SUVs

o Assumed (we performed the test) o √ (all expected counts are ) o SRS o Sample size large enough that all expected counts are greater than or equal to 5

Goodness Of Fit

We found a website releasing the car colors of 2006 in North America from the DuPont Annual Color Popularity Report.website We decided to see if this distribution (“overall”) fit the distribution of car colors from CB South’s parking lot. We included light brown and yellow/gold in the “other” category, making “other” 11%. We included silver and gray together, as we had in our study, making “silver” 32%. We also combined white pearl with white, making “white” 19%.

Expected: Red - (.11)(116) = Silver - (.32)(116) = Green - (.04)(116) = 4.64 Blue - (.11)(116) = White – (.19)(116) = Black – (.13)(116) = Other – (.11)(116) = Observed: Red - 15 Silver - 32 Green - 12 Blue - 13 White - 19 Black - 15 Other - 10 H o : The observed frequency distribution of car colors fits the expected. H a : The observed frequency distribution of car colors does not fit the expected. Check Assumed Does not check- Since green’s expected count of 4.64 is close to 5 we proceed State SRS Sample size large enough so all expected counts ≥ 5 …= df=6 α=.05

We reject H o in favor of H a because p-value of.0320 < α =.05. We have sufficient evidence that the observed frequency distribution of car colors does not fit the expected distribution. The category of green seemed to be the category furthest off. We suspected that if green were not part of the test, we might have failed to reject. Even though our p-value was still less than.05, it was much higher than our p-value in the previous test, which was We could have improved this test by having a sample size large enough that all expected counts were greater than 5. Perhaps we could have included green in the “other” group in order to avoid this problem, or we could have increased our sample size (every fourth car instead of every fifth, for example). H o : The observed frequency distribution of car colors fits the expected. H a : The observed frequency distribution of car colors does not fit the expected.

State Two Independent SRS Sample size large enough that all expected counts ≥ 5 Check Assumed Does not check- we choose to proceed H o : Car color and size are independent. H a : Car color and size are dependent. RedSilverGreenBlueWhiteBlackOther Small Large …=4.629 df=6 α=.05

Test of Association We fail to reject Ho in favor of Ha because p-value of.5922>α=.05. We have sufficient evidence that car color and size are independent. o Chose to this test to see whether or not there was an association between the size of the car and its color o Thought that it was more likely for a small, sporty car to be a flashy color like red rather than a large truck o Grouped our sample into two categories: o Small, for cars, and large, for SUVs, trucks, and other o Because the samples of SUVs, trucks, and other vehicles were too small to be statistically viable on their own. ***We concluded that there was no association between car color and size. Thus, it is no more or less likely for a large car to be a certain color than a small car. ***

Comparing Size and Color o Displays the amount of cars of each color o Stacked bar graph helps us easily compare the car sizes to the color. For the most part, the number of small cars of each color is greater than the number of large cars because our sample of small cars was so much greater. The exception is blue, where the number of large cars of that color is greater than the number of small cars.

One Proportion Z Test for SUVs H o : p=.3 H a : p<.3 State SRS np ≥10 n(1-p) ≥10 pop ≥ 10n Check Assumed 116×.3 ≥10 116×.7 ≥10 pop ≥1,160 We reject H o in favor of H a because the p-value of is greater than α=.05. We have sufficient evidence that the proportion of SUVs is less than.30. We decided to do this test because we felt that the proportion of SUVs was almost.5, so we used.3 because we knew the proportion wouldn’t be that close to.5. We felt that the proportion of SUVs would be less than.3 because they use a lot of gasoline and the majority of people are trying to become more eco-friendly.

One Proportion Z Interval for SUVs Confidence level of 96% State SRS np ≥10 n(1-p) ≥10 pop ≥ 10n Check Assumed 116×.3 ≥10 116×.7 ≥10 pop ≥1,160 We are 96% confident that the proportion of SUVs lies between and We decided to do interval because we wanted to find out where the actual proportion of SUVs fell. By doing a confidence interval we found that the proportion of SUVs is not that high compared to what we originally thought. People really are becoming more eco-friendly!

o Not every car in the parking lot is there everyday, so we probably missed some o We did not meet all the assumptions, but we continued with the tests o We should have picked the cars by using a software to randomize which cars we looked at by their spot number