Download presentation
Presentation is loading. Please wait.
1
1 Power 16
2
2 Review Post-Midterm Cumulative
3
3 Projects
4
4 Logistics Put power point slide show on a high density floppy disk, or e-mail as an attachment, for a WINTEL machine. Email Llad@econ.ucsb.edu the slide-show as a PowerPoint attachment
5
5 Assignments 1. Project choice 2. Data Retrieval 3. Statistical Analysis 4. PowerPoint Presentation 5. Executive Summary 6. Technical Appendix 7. Graphics Power_13
6
6 PowerPoint Presentations: Member 4 1. Introduction: Members 1,2, 3 What Why How 2. Executive Summary: Member 5 3. Exploratory Data Analysis: Member 3 4. Descriptive Statistics: Member 3 5. Statistical Analysis: Member 3 6. Conclusions: Members 3 & 5 7. Technical Appendix: Table of Contents, Member 6
7
7 Executive Summary and Technical Appendix
8
8
9
9 Technical Appendix Table of Contents Spreadsheet of data used and sources or if extensive, a subsample of the data Descriptive Statistics and Histograms for the variables in the study If time series data, a plot of each variable against time If relevant, plot of the dependent Vs. each of the explanatory variables
10
10 Technical Appendix (Cont.) Statistical Results, for example regression Plot of the actual, fitted and error and other diagnostics Brief summary of the conclusions, meanings drawn from the exploratory, descriptive, and statistical analysis.
11
11 Post-Midterm Review Project I: Power 16 Contingency Table Analysis: Power 14, Lab 8 ANOVA: Power 15, Lab 9 Survival Analysis: Power 12, Power 11, Lab 7 Multi-variate Regression: Power 11, Lab 6
12
12 Slide Show Challenger disaster
13
13 Project I Number of O-Rings Failing On Launch i: y i (#) = a + b*temp i + e i Biased because of zeros, even if divide equation by 6 Two Ways to Proceed Tobit, non-linear estimation: y i (#) = a + b*temp i + e i Bernoulli variable: probability models Probability Models: y i (0,1) = a + b*temp i + e i
14
14 Project I (Cont.) Probability Models: y i (0,1) = a + b*temp i + e i OLS, Linear Probability Model, linear approximation to the sigmoid Probit, non-linear estimate of the sigmoid Logit, non-linear estimate of the sigmoid Significant Dependence on Temperature t-test (or z-test) on slope, H 0 : b=0 F-test Wald test
15
15 Project I (Cont.) Plots of Number or Probability Vs Temp. Label the axes Answer all parts, a-f The most frequent sins Did not explicitly address significance Did not answer b, 66 0 : all launches at lower temperatures had one or more o-ring failures Did not execute c, estimate linear probability model
16
16 Challenger Disaster Failure of O-rings that sealed grooves on the booster rockets Was there any relationship between o- ring failure and temperature? Engineers knew that the rubber o-rings hardened and were less flexible at low temperatures But was there launch data that showed a problem?
17
17 Challenger Disaster What: Was there a relationship between launch temperature and o-ring failure prior to the Challenger disaster? Why: Should the launch have proceeded? How: Analyze the relationship between launch temperature and o-ring failure
18
18 Launches Before Challenger Data number of o-rings that failed launch temperature
19
19
20
20
21
21
22
22 Exploratory Analysis Launches where there was a problem
23
23 158 157 170 163 170 275 353 Orings temperature
24
.
25
25 Exploratory Analysis All Launches Plot of failures per observation versus temperature range shows temperature dependence: Mean temperature for the 7 launches with o-ring failures was lower, 63.7, than for the 17 launches without o-ring failures, 72.6. - Contingency table analysis
26
26 Launches and O-Ring Failures (Yes/No)
27
27 Launches and O-Ring Failures (Yes/No) Expected/Observed
28
28 Launches and O-Ring Failures Chi- Square, 2dof=9.08, crit( =0.05)=6
29
Number of O-ring Failures Vs. Temperature
30
30 Logit Extrapolated to 31F: Probit extrapolated to 31F:
31
31 Extrapolating OLS to 31F: OLS: Tobit:
32
32 Conclusions From extrapolating the probability models to 31 F, Linear Probability, Probit, or Logit, there was a high probability of one or more o-rings failing From extrapolating the Number of O- rings failing to 31 F, OLS or Tobit, 3 or more o-rings would fail. There had been only one launch out of 24 where as many as 3 o-rings had failed. Decision theory argument: expected cost/benefit ratio:
33
33 Conclusions Decision theory argument: expected cost/benefit ratio:
34
34 Ways to Analyze Challenger Difference in mean temperatures for failures and successes Difference in probability of one or more o-ring failures for high and low temperature ranges Probabilty models: LPM (OLS), probit, logit Number of o-ring failure per launch Vs. Temp. OLS, Tobit Contingency table analysis ANOVA
35
35 Contingency Table Analysis Challenger example
36
36 Launches and O-Ring Failures (Yes/No)
37
37 ANOVA and O-Rings Probability one or more o-rings fail Low temp: 53-62 degrees Medium temp: 63-71 degrees High temp: 72-81 degrees Average number of o-rings failing per launch Low temp: 53-62 degrees Medium temp: 63-71 degrees High temp: 72-81 degrees
38
38 Probability one or more o-rings fails
39
39 Number of o-rings failing per launch
40
40
41
41 Outline ANOVA and Regression (Non-Parametric Statistics) (Goodman Log-Linear Model)
42
42 Anova and Regression: One-Way Salesaj = c(1)*convenience+c(2)*quality+c(3)*price+ e E[salesaj/(convenience=1, quality=0, price=0)] =c(1) = mean for city(1) c(1) = mean for city(1) (convenience) c(2) = mean for city(2) (quality) c(3) = mean for city(3) (price) Test the null hypothesis that the means are equal using a Wald test: c(1) = c(2) = c(3)
43
43 One-Way ANOVA and Regression Regression Coefficients are the City Means; F statistic
44
44 Anova and Regression: One-Way Alternative Specification Salesaj = c(1) + c(2)*convenience+c(3)*quality+e E[Salesaj/(convenience=0, quality=0)] = c(1) = mean for city(3) (price, the omitted one) E[Salesaj/(convenience=1, quality=0)] = c(1) + c(2) = mean for city(1) (convenience) c(1) = mean for city(3), the omitted city c(2) = mean for city(1) minus mean for city(3) Test that the mean for city(1) = mean for city(3) Using the t-statistic for c(2)
45
45 Anova and Regression: One-Way Alternative Specification Salesaj = c(1) + c(2)*convenience+c(3)*price+e E[Salesaj/(convenience=0, price=0)] = c(1) = mean for city(2) (quality, the omitted one) E[Salesaj/(convenience=1, price=0)] = c(1) + c(2) = mean for city(1) (convenience) c(1) = mean for city(2), the omitted city c(2) = mean for city(1) minus mean for city(2) Test that the mean for city(1) = mean for city(2) Using the t-statistic for c(2)
46
46 ANOVA and Regression: Two-Way Series of Regressions; Compare to Table 11, Lecture 15 Salesaj = c(1) + c(2)*convenience + c(3)* quality + c(4)*television + c(5)*convenience*television + c(6)*quality*television + e, SSR=501,136.7 Salesaj = c(1) + c(2)*convenience + c(3)* quality + c(4)*television + e, SSR=502,746.3 Test for interaction effect: F 2, 54 = [(502746.3-501136.7)/2]/(501136.7/54) = (1609.6/2)/9280.3 = 0.09
47
Table of Two-Way ANOVA for Apple Juice Sales
48
48 ANOVA and Regression: Two-Way Series of Regressions Salesaj = c(1) + c(2)*convenience + c(3)* quality + e, SSR=515,918.3 Test for media effect: F 1, 54 = [(515918.3- 502746.3)/1]/(501136.7/54) = 13172/9280.3 = 1.42 Salesaj = c(1) +e, SSR = 614757 Test for strategy effect: F 2, 54 = [(614757- 515918.3)/2]/(501136.7/54) = (98838.7/2)/(9280.3) = 5.32
49
49 Survival Analysis Density, f(t) Cumulative distribution function, CDF, F(t) Probability you failed up to time t* =F(t*) Survivor Function, S(t) = 1-F(t) Probability you survived longer than t*, S(t*) Kaplan-Meier estimates: (#at risk- # ending)/# at risk Applications Testing a new drug
50
50 Chemotherapy Drug Taxol Current standard for ovarian cancer is taxol and a platinate such as cisplatin Previous standard was cyclophosphamide and cisplatin Kaplan-Meier Survival curves comparing the two regimens Lab 7: ( # at risk- #ending)/# at riak
51
51 Taxol ( Bristol-Myers Squibb) interrupts cell division (mitosis) It is a cyclical hydrocarbon
52
52 Top Panel: European Canadian and Scottish, 342 at risk for Tc, 292 Survived 1 year Bottom Panel: Gynecological Oncology Group, 196 at risk For Tc, 168 survived 1 year
53
53 2003 Final
54
54 Nonparametric Statistics What to do when the sample of observations is not distributed normally?
55
55 3 Nonparametric Techniques Wilcoxon Rank Sum Test for independent samples Data Analysis Plus Signs Test for Matched Pairs: Rated Data Eviews, Descriptive Statistics Wilcoxon Signed Rank Sum Test for Matched Pairs: Quantitative Data Eviews
56
56 Wilcoxon Rank Sum Test for Independent Samples Testing the difference between the means of two populations when they are non-normal A New Painkiller Vs. Aspirin, Xm17-02
57
57 Rating scheme
58
58 Ratings
59
59 Rank the 30 Ratings 30 total ratings for both samples 3 ratings of 1 5 ratings of 2 etc
60
60 3 15 12
61
61 5 30 27 continued
62
62 4 19.5 5 27 Rank Sum 276.5 188.5
63
63 Rank Sum, T E (T )= n 1 (n 1 + n 2 + 1)/2 = 15*31/2 = 232.5 VAR (T) = n 1 * n 2 (n 1 + n 2 + 1)/12 VAR (T) = 15*31/12, T = 24.1 For sample sizes larger than 10, T is normal Z = [T-E(T)]/ T = (276.5 - 232.5)/24.1 = 1.83 Null Hypothesis is that the central tendency for the two drugs is the same Alternative hypothesis: central tendency for the new drug is greater than for aspirin: 1- tailed test
64
1.645 5%
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.