1 Power 16
2 Projects
3 Logistics Put power point slide show on a high density floppy disk for a WINTEL machine. the slide-show as a PowerPoint attachment
4 Assignments 1. Project choice 2. Data Retrieval 3. Statistical Analysis 4. PowerPoint Presentation 5. Executive Summary 6. Technical Appendix Power_12
5 PowerPoint Presentations: Member 4 1. Introduction: Members 1,2, 3 –What –Why –How 2. Executive Summary: Member 5 3. Exploratory Data Analysis: Member 3 4. Descriptive Statistics: Member 3 5. Statistical Analysis: Member 3 6. Conclusions: Members 3 & 5 7. Technical Appendix: Table of Contents, Member 6
6 Executive Summary and Technical Appendix
7
8 Technical Appendix Table of Contents Spreadsheet of data used and sources or if extensive, a subsample of the data Descriptive Statistics and Histograms for the variables in the study If time series data, a plot of each variable against time If relevant, plot of the dependent Vs. each of the explanatory variables
9 Technical Appendix (Cont.) Statistical Results, for example regression Plot of the actual, fitted and error and other diagnostics Brief summary of the conclusions, meanings drawn from the exploratory, descriptive, and statistical analysis.
10 Slide Show Challenger disaster
11 Challenger Disaster Failure of O-rings that sealed grooves on the booster rockets Was there any relationship between o-ring failure and temperature? Engineers knew that the rubber o-rings hardened and were less flexible at low temperatures But was ther launch dat that showed a problem
12 Challenger Disaster What: Was ther a relationship between launch temperature and o-ring failure prior to the Challenger disaster? Why: Should the launch have proceeded? How: Analyze the relationship between launch temperature and o-ring failure
13 Launches Before Challenger Data –number of o-rings that failed –launch temperature
14
15
16
17 Exploratory Analysis Launches where there was a problem
Orings temperature
21 Outline ANOVA and Regression Non-Parametric Statistics Goodman Log-Linear Model
22 Anova and Regression: One-Way Salesaj = c(1)*convenience+c(2)*quality+c(3)*price+ e E[salesaj/(convenience=1, quality=0, price=0)] =c(1) = mean for city(1) –c(1) = mean for city(1) (convenience) –c(2) = mean for city(2) (quality) –c(3) = mean for city(3) (price) –Test the null hypothesis that the means are equal using a Wald test: c(1) = c(2) = c(3)
One-Way ANOVA and Regression Regression Coefficients are the City Means; F statistic
24 Anova and Regression: One-Way Alternative Specification Salesaj = c(1) + c(2)*convenience+c(3)*quality+e E[Salesaj/(convenience=0, quality=0)] = c(1) = mean for city(3) (price, the omitted one) E[Salesaj/(convenience=1, quality=0)] = c(1) + c(2) = mean for city(1) (convenience) –c(1) = mean for city(3), the omitted city –c(2) = mean for city(1) minus mean for city(3) –Test that the mean for city(1) = mean for city(3) –Using the t-statistic for c(2)
25 Anova and Regression: One-Way Alternative Specification Salesaj = c(1) + c(2)*convenience+c(3)*price+e E[Salesaj/(convenience=0, price=0)] = c(1) = mean for city(2) (quality, the omitted one) E[Salesaj/(convenience=1, price=0)] = c(1) + c(2) = mean for city(1) (convenience) –c(1) = mean for city(2), the omitted city –c(2) = mean for city(1) minus mean for city(2) –Test that the mean for city(1) = mean for city(2) –Using the t-statistic for c(2)
26 ANOVA and Regression: Two-Way Series of Regressions; Compare to Table 11, Lecture 15 Salesaj = c(1) + c(2)*convenience + c(3)* quality + c(4)*television + c(5)*convenience*television + c(6)*quality*television + e, SSR=501,136.7 Salesaj = c(1) + c(2)*convenience + c(3)* quality + c(4)*television + e, SSR=502,746.3 Test for interaction effect: F 2, 54 = [( )/2]/( /54) = (1609.6/2)/ = 0.09
Table of Two-Way ANOVA for Apple Juice Sales
28 ANOVA and Regression: Two-Way Series of Regressions Salesaj = c(1) + c(2)*convenience + c(3)* quality + e, SSR=515,918.3 Test for media effect: F 1, 54 = [( )/1]/( /54) = 13172/ = 1.42 Salesaj = c(1) +e, SSR = Test for strategy effect: F 2, 54 = [( )/2]/( /54) = ( /2)/(9280.3) = 5.32
29 Nonparametric Statistics What to do when the sample of observations is not distributed normally
30 3 Nonparametric Techniques Wilcoxon Rank Sum Test for independent samples –Data Analysis Plus Signs Test for Matched Pairs: Rated Data –Eviews, Descriptive Statistics Wilcoxon Signed Rank Sum Test for Matched Pairs: Quantitative Data –Eviews
31 Wilcoxon Rank Sum Test for Independent Samples Testing the difference between the means of two populations when they are non-normal A New Painkiller Vs. Aspirin, Xm17-02
32 Rating scheme
33 Ratings
34 Rank the 30 Ratings 30 total ratings for both samples 3 ratings of 1 5 ratings of 2 etc
continued
Rank Sum
38 Rank Sum, T E (T )= n 1 (n 1 + n 2 + 1)/2 = 15*31/2 = VAR (T) = n 1 * n 2 (n 1 + n 2 + 1)/12 VAR (T) = 15*31/12, T = 24.1 For sample sizes larger than 10, T is normal Z = [T-E(T)]/ T = ( )/24.1 = 1.83 Null Hypothesis is that the central tendency for the two drugs is the same Alternative hypothesis: central tendency for the new drug is greater than for aspirin: 1- tailed test
%