Presentation is loading. Please wait.

Presentation is loading. Please wait.

CLASS 5 Normal Distribution Hypothesis Testing Test between means of related groups Test between means of different groups Analysis of Variance (ANOVA)

Similar presentations


Presentation on theme: "CLASS 5 Normal Distribution Hypothesis Testing Test between means of related groups Test between means of different groups Analysis of Variance (ANOVA)"— Presentation transcript:

1 CLASS 5 Normal Distribution Hypothesis Testing Test between means of related groups Test between means of different groups Analysis of Variance (ANOVA) CLASS 6 Tutorial 2 Term Project Introduction – survey area; survey data file; variable definition file; instruction sheets (for the term project, and term project analysis report) Give Back Exercise Set 1 Factorial ANOVA Tutorial 3 for Factorial ANOVA Chi Square test (Time Permitting) One sample/way Two factor/way Correlation Coefficient (Time Permitting) URBP 204 A Class 6 Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

2 Five Wounds Brookwood Terrace Project One of 20 neighborhood areas identified in the City of San Jose’s Strong Neighborhood Initiative (SNI) Source: http://www.strongneighborhoods.org/strongneighborhoodareas.htm FWBT Neighborhood Plan at: http://www.sanjoseca.gov/planning/sni/fivewounds/

3 Five Wounds Brookwood Terrace Project Source: http://www.sanjoseca.gov/planning/sni/fivewounds/existing.pdf Coyote Creek to its West Lower Silver Creek to North Highway 280 to South

4 Five Wounds Brookwood Terrace Project Source: Chapter 2. FWBT Neighborhood Plan. Population = 20,000 Hispanic – 73.5% (city avg. 32.8%) Asian/ Pacific Islander – 14.5% White/ Non- Hispanic – 7.6% Other – 5% HH Size = 3.5 (city avg. 3.1) Median age lower than the city’s median age of 33.7 years Income Median HH Income - $49,013 (city median HH Income - $73,804) Substantially less than the city median HH Income Education High School Diploma or less (Age greater or equal to 25 years) – 77.3% (city avg. 43.2%) Associate Degree – 18.4% (city avg. 31.5%) Bachelor’ Degree – 7.9% (city avg. 25.3%) Overall less educated than the city-wide population

5 Five Wounds Brookwood Terrace Project Source: Chapter 2. FWBT Neighborhood Plan. Diverse range of housing types Predominantly SF

6 Five Wounds Brookwood Terrace Project Census tract numbers 5014.00, 50515.01 and 5015.02 lie fully within the FWBT area while part of census tract number 5036.01 falls within the FWBT area. Source: Mike Reilly, Instructor URBP 179, Fall 2004

7 Factorial ANOVA Source: Salkind, pg. 215 Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

8 Factorial ANOVA More than 2 factors (independent variables) Education level – low education and high education Gender – male and female 4 groups Low educated male Low educated female High educated female High educated male Dependent variable - minutes biked per day (ratio level variable / continuous variable) Test effect of main independent variables + the interaction term Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

9 Steps for testing 1. Statement of null and research hypothesis H 0 : Ц loweduc = Ц higheduc H 0 : Ц male = Ц female H 0 : Ц loweduc X male = Ц higheduc x male = Ц higheduc x female = Ц loweduc x female H 1 : X loweduc = X higheduc H1: X male = X female H1: X loweduc X male = X higheduc x male = X higheduc x female = X loweduc x female 2. Set level of risk Level of risk of type I error = 5%, or level of significance = 0.05 3. Selection of appropriate test statistic Choose Factorial ANOVA Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

10 4. Compute the F statistic (Obtained Value) Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

11 5. Determination of the value needed for rejection of null hypothesis (critical value) See table B3, pg. 361-363 Critical value = 3.99 (see Salkind, p.363) 6. Comparison of obtained and critical value Obtained value more extreme than the critical value (F= 111.115, 43.211, and 11.24, are all greater than 3.99) 7. Decision Reject all the three null hypotheses (null hypotheses - there is NO difference between the mean of groups) Probability is less than 5% on any one test of the null hypothesis that the difference between the mean of groups is due to chance alone. Df for numerator: Number of factors-1 = 2-1 =1 Df for denominator: No of observations- no if groups = 68-4 = 64 Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

12 Correlation How the value of one variable changes if the value of the other variable changes. For example, correlation between: Distance from city center and housing price Both variables need to be ratio or interval level. Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

13 Correlation Coefficient Q: How do we know of the correlation is statistically significant? A: Test for significance of the correlation coefficient Source: Salkind, p 230 Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

14 Steps for testing 1. Statement of null and research hypothesis H 0 : ρ xy = 0 No relationship between variables x and y H 1 : r xy = 0 There is relationship between variables. 2. Set level of risk Level of risk of type I error = 5%, or level of significance = 0.05 3. Selection of appropriate test statistic Choose t-test for the significance of the correlation coefficient Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

15 Coefficient of correlation 4. Obtained value For relationship between Density and distance, r = - 0.74 Density and price, r = 0.81 Distance and price, r = - 0.98 Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

16 5. Determination of the value needed for rejection of null hypothesis (critical value) See table B4, pg. 365 Critical value = 0. 35 (see Salkind, p.363) 6. Comparison of obtained and critical value Obtained value more extreme than the critical value for following relationships: Density and distance, r = - 0.74 Density and price, r = 0.81 Distance and price, r = - 0.98 7. Decision Reject the null hypothesis (null hypothesis - there is NO relationship between the variables). The relationship is not due to chance alone. Degree of freedom = n-2 = 30-2 = 28 Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

17 Non parametric tests When assumption of normality does not hold (small sample size – less than 30 observations) Need ordinal or nominal level data. Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15

18 One factor/sample chi square Is the distribution of frequencies what you would expect by chance alone? 1. Statement of null and research hypothesis Proportion of occurrence under each category is equal. H 1 : Proportion of occurrence under each category is not equal. 2. Set level of risk Level of risk of type I error = 5%, or level of significance = 0.05 3. Selection of appropriate test statistic Choose chi square test H 0 : P 1 = P 2 = P 3 = P 4 = P 5 P 1 = P 2 = P 3 = P 4 = P 5

19 4. Obtained value Note: the class notes summarize Salkind (2004) Chapters 12, 13 and 15 Note: Should have minimum of 5 observations under each category

20 5. Determination of the value needed for rejection of null hypothesis (critical value) See table B5, pg. 367 Critical value = 9.49 (see Salkind, p.363) 6. Comparison of obtained and critical value Obtained value more extreme than the critical value 7. Decision Reject the null hypothesis (null hypothesis - that the distribution of frequencies is equal). Degrees of freedom = number of categories of data – 1 = 5-1 = 4

21 Two factor/way chi square Explore relationships when both the dependent and the independent variables are nominal or ordinal level, that is “Categorical data” Does the respondents’ age affect their perception about the condition of street lighting? 1. Statement of null and research hypothesis Proportion of occurrence under each category is equal. H 1 : Proportion of occurrence under each category is not equal. 2. Set level of risk Level of risk of type I error = 5%, or level of significance = 0.05 3. Selection of appropriate test statistic Choose chi square test H 0 : P 1 = P 2 = P 3 = P 4 P 1 = P 2 = P 3 = P 4

22 4. Obtained value Note: Should have minimum of 5 observations under each category Observed Frequency Table Expected Frequency Table λ 2 = (6-9.75) 2 / 9.75 + (20-16.25) 2 /16.25 + (5.25-9) 2 /5.25 + (8.75-5) 2 /8.75 = 6.59 9.75 = (26 x 15) / 40 16.25 = (26 x 25) / 40 5.25 = (14 x 15) / 40 8.75 = (14 x 25) / 40

23 5. Determination of the value needed for rejection of null hypothesis (critical value) See table B5, pg. 367 Critical value = 3.84 (see Salkind, p.363) 6. Comparison of obtained and critical value Obtained value more extreme than the critical value 7. Decision Reject the null hypothesis (null hypothesis - that the distribution of frequencies is equal). URBP 204 A Class 6 Degrees of freedom = (row-1) x (column -1) = (2-1) x (2-1) = 1 x 1 = 1


Download ppt "CLASS 5 Normal Distribution Hypothesis Testing Test between means of related groups Test between means of different groups Analysis of Variance (ANOVA)"

Similar presentations


Ads by Google