Pearson Group Assignment 3 Coffee, Stress and Heart Example.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Regression and correlation methods
Multiple Regression W&W, Chapter 13, 15(3-4). Introduction Multiple regression is an extension of bivariate regression to take into account more than.
1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
U.S. Food and Drug Administration Notice: Archived Document The content in this document is provided on the FDA’s website for reference purposes only.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Linear Regression. PSYC 6130, PROF. J. ELDER 2 Correlation vs Regression: What’s the Difference? Correlation measures how strongly related 2 variables.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
GRA 6020 Multivariate Statistics The regression model OLS Regression Ulf H. Olsson Professor of Statistics.
1 A heart fills with loving kindness is a likeable person indeed.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Chapter Topics Types of Regression Models
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Part 18: Regression Modeling 18-1/44 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
Chapter 9 Hypothesis Testing.
Introduction to Regression Analysis, Chapter 13,
Pearson Group Assignment 3 Coffee, Stress and Heart Example.
ECONOMETRICS I CHAPTER 5: TWO-VARIABLE REGRESSION: INTERVAL ESTIMATION AND HYPOTHESIS TESTING Textbook: Damodar N. Gujarati (2004) Basic Econometrics,
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Simple Linear Regression Models
T tests comparing two means t tests comparing two means.
Power and Sample Size Determination Anwar Ahmad. Learning Objectives Provide examples demonstrating how the margin of error, effect size and variability.
Understanding the Variability of Your Data: Dependent Variable Two "Sources" of Variability in DV (Response Variable) –Independent (Predictor/Explanatory)
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
Decisions from Data: The Role of Simulation Gail Burrill Gail Burrill Michigan State University
Correlation & Regression
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
TEKS (6.10) Probability and statistics. The student uses statistical representations to analyze data. The student is expected to: (B) identify mean (using.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Jeopardy Statistics Edition. Terms Calculator Commands Sampling Distributions Confidence Intervals Hypothesis Tests: Proportions Hypothesis Tests: Means.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Statistical planning and Sample size determination.
What is a Confidence Interval?. Sampling Distribution of the Sample Mean The statistic estimates the population mean We want the sampling distribution.
Correlation & Regression Analysis
Chapter 8: Simple Linear Regression Yang Zhenlin.
T tests comparing two means t tests comparing two means.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Confidence Intervals for a Population Proportion Excel.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
T tests comparing two means t tests comparing two means.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Statistical Criteria for Establishing Safety and Efficacy of Allergenic Products Tammy Massie, PhD Mathematical Statistician Team Leader Bacterial, Parasitic.
732G21/732G28/732A35 Lecture 3. Properties of the model errors ε 4. ε are assumed to be normally distributed
Statistics for science fairs
GS/PPAL Section N Research Methods and Information Systems
Regression Analysis AGEC 784.
AP Biology Intro to Statistics
B&A ; and REGRESSION - ANCOVA B&A ; and
BIVARIATE REGRESSION AND CORRELATION
Linear Regression Models
Statistical Methods For Engineers
Inferences and Conclusions from Data
Multiple Regression Models
Simple Linear Regression
Putting It All Together: Which Method Do I Use?
Using Significance Tests
STATISTICS Topic 1 IB Biology Miss Werba.
Pearson Group Assignment 3 Coffee, Stress and Health Example
Simple Linear Regression
Pearson Group Assignment 3 Coffee, Stress and Heart Example
Simple Linear Regression
Essentials of Statistics for Business and Economics (8e)
Effect of Sample size on Research Outcomes
Presentation transcript:

Pearson Group Assignment 3 Coffee, Stress and Heart Example

Candidate Models Heart ~ Coffee Heart ~ Coffee Heart ~ Stress Heart ~ Stress Heart ~ Coffee + Stress Heart ~ Coffee + Stress

Data Ellipses…

Data Ellipses A data ellipse shows the spread of data (good visualization tool) and informs us about the variance and covariance structure of the data. Simply put, the distribution of our data. Red ellipses for r=1 (1 standard deviation) Green ellipses for r=2 (2 standard deviations)

Conditional Model BetaStress = BetaCoffee = Marginal Models BetaStress = BetaCoffee =

95% Confidence Ellipses and Intervals for theCoefficients for Stress and Coffee Green Ellipse: Scheffé Red Ellipse and Its 'Shadow' Intervals: Bonferroni Blue Interval: Bonferroni (marginal model)‏

Scheff Scheffé vs. Bonferroni Scheff CI Scheffé CI Larger than Bonferroni CI with minimal number of parameters Larger than Bonferroni CI with minimal number of parameters Allows for data snooping Allows for data snooping Based on F-statistic Based on F-statistic Bonferroni CI Incorporates a penalty for examining several coefficients simultaneously Incorporates a penalty for examining several coefficients simultaneously Based on t-statistic Based on t-statistic

Conditional Model Coefficent for Coffee Bonferonni 95% CI: ( , ) Bonferonni 95% CI: ( , ) Scheffé 95% CI: ( , )‏ Scheffé 95% CI: ( , )‏ –Both intervals include 0, thus not significant Coefficient for Stress Bonferonni 95% CI: ( , )‏ Bonferonni 95% CI: ( , )‏ Scheffé 95% CI: ( , ) Scheffé 95% CI: ( , ) –Both intervals do not include 0, thus are highly significant

Conditional Model In this case, either confidence interval we use will give us the same conclusion In this case, either confidence interval we use will give us the same conclusion If we move the ellipses upward introducing measurement error on Stress, we can make the coefficient of Coffee significant according to Bonferroni’s method. If we move the ellipses upward introducing measurement error on Stress, we can make the coefficient of Coffee significant according to Bonferroni’s method. Although, it is still insignificant according to Scheffé method. Although, it is still insignificant according to Scheffé method.

Conclusions Both Coffee and Stress alone are excellent predictors of Heart for this data set (positive linear relationship)‏ Both Coffee and Stress alone are excellent predictors of Heart for this data set (positive linear relationship)‏ Scenario 1: Coffee (consumption) may be a cause of Heart (condition) with (occupational) Stress as a mediating factor: use Heart ~ Coffee Scenario 1: Coffee (consumption) may be a cause of Heart (condition) with (occupational) Stress as a mediating factor: use Heart ~ Coffee Scenario 2: Stress may be a common cause for both Coffee and Heart: use Heart ~ Coffee + Stress Scenario 2: Stress may be a common cause for both Coffee and Heart: use Heart ~ Coffee + Stress Scenarios 1 and 2 and other possible scenarios cannot be decided from this small data set with highly confounded variables Scenarios 1 and 2 and other possible scenarios cannot be decided from this small data set with highly confounded variables To determine the proper scenario, we need to collect more data, less confounded if possible, and consult the relevant medical literature and health researchers for helpful insights To determine the proper scenario, we need to collect more data, less confounded if possible, and consult the relevant medical literature and health researchers for helpful insights Statistician take into consideration the significance of all possible models, and then decide on an appropriate model Statistician take into consideration the significance of all possible models, and then decide on an appropriate model Statistician must try to insure all important factors and predictors are considered in the model fitting process Statistician must try to insure all important factors and predictors are considered in the model fitting process