Exercise 1: Gestational age and birthweight

Slides:



Advertisements
Similar presentations
Qualitative predictor variables
Advertisements

Inference for Regression
AP Statistics Chapter 3 Practice Problems
Overview Correlation Regression -Definition
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Correlation and Regression Analysis
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Inference for regression - Simple linear regression
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Review of Statistical Models and Linear Regression Concepts STAT E-150 Statistical Methods.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
A first order model with one binary and one quantitative predictor variable.
SIMPLE LINEAR REGRESSION AND CORRELLATION
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Copyright © 2017, 2014 Pearson Education, Inc. Slide 1 Chapter 4 Regression Analysis: Exploring Associations between Variables.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Stats Methods at IC Lecture 3: Regression.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Chapter 13 Simple Linear Regression
Chapter 15 Multiple Regression Model Building
Examining Relationships
Regression Analysis.
Simple Linear Regression
Inference for Least Squares Lines
Modeling in R Sanna Härkönen.
*Bring Money for Yearbook!
Correlation & Regression
Regression Inferential Methods
A little VOCAB.
Examining the Relationship Between Two Variables
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Correlation and Regression
Regression Analysis Simple Linear Regression
The scatterplot shows the advertised prices (in thousands of dollars) plotted against ages (in years) for a random sample of Plymouth Voyagers on several.
Chapter 4 Correlation.
Regression Analysis.
INFERENTIAL STATISTICS: REGRESSION ANALYSIS AND STANDARDIZATION
Applied Statistical Analysis
Chapter 14: Correlation and Regression
Elementary Statistics
Interpret Scatterplots & Use them to Make Predictions
(Residuals and
Lecture Slides Elementary Statistics Thirteenth Edition
Week 5 Lecture 2 Chapter 8. Regression Wisdom.
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
CHAPTER 26: Inference for Regression
Unit 3 – Linear regression
Chapter 5 Correlation.
Inference for Regression
CORRELATION ANALYSIS.
Simple Linear Regression
The greatest blessing in life is
Examining the Relationship Between Two Variables
3 4 Chapter Describing the Relation between Two Variables
Warm Up The following data lists the prices of 12 different bicycle helmets along with the quality rating. Make a scatterplot of the data (price is the.
Correlation.
Objective: Interpret Scatterplots & Use them to Make Predictions.
Homework: pg. 180 #6, 7 6.) A. B. The scatterplot shows a negative, linear, fairly weak relationship. C. long-lived territorial species.
Adequacy of Linear Regression Models
Learning outcomes By the end of this session you should know about:
Correlation & Trend Lines
Exercise 1: Entering data into SPSS
BUS-221 Quantitative Methods
Exercise 1: Gestational age and birthweight
Presentation transcript:

Exercise 1: Gestational age and birthweight Draw a line of best fit through the data (with roughly half the points above and half below). Describe the relationship Is the relationship: strong/ weak? positive/ negative? linear?

Exercise 2: Interpretation Interpret the following correlation coefficients using Cohen’s classification and explain what they mean. Which correlations seem meaningful? Relationship Correlation Average IQ and chocolate consumption 0.27 Road fatalities and Nobel winners 0.55 Gross Domestic Product and Nobel winners 0.70 Mean temperature and Nobel winners -0.60

Exercise 3 Investigate whether mother’s pre-pregnancy weight and birth weight are associated using a scatterplot, correlation and simple regression

Exercise 3: correlation Pearson’s correlation coefficient = Describe the relationship using the scatterplot and correlation coefficient

Exercise 3: regression Adjusted R2 = Does the model result in reliable predictions? ANOVA p-value = Is the model an improvement on the null model (where every baby is predicted to be the mean weight)?

Exercise 3: Regression Pre-pregnancy weight coefficient and p-value: Regression equation: Interpretation:

Exercise 4: correlations Which variables are most strongly related to each other?

Exercise 5 Run a multiple regression model for Birth weight with gestational age, mothers weight and smoking status as independent variables Check the assumptions and interpret the output Does the model give more reliable predictions than the model with just gestational age? Add mother’s height to the model. Does anything change? Note you will need to create a variable for smoking status based on the number of cigarettes that the mother smokes (assuming that 0 cigarettes indicates someone who does not smoke)

Exercise 5: model 1 summary Variable Coefficient (β) P-value Significant? Constant Gestation Smoker Pre-pregnancy weight Adjusted R2 = Interpretation:

Exercise 5: model 2 summary Variable Coefficient (β) P-value Significant? Constant Gestation Smoker Pre-pregnancy weight Height Adjusted R2 = Interpretation:

Exercise 5: Compare p-values Model Gestation Smoking Weight Height P < 0.001 + Smoker 0.028 + Weight + Height

Exercise 5: Compare R2 Model R2 Adjusted R2 Gestation 0.499 0.486 + Smoker 0.558 0.535 + Weight + Height

Exercise 1: Gestational age and birthweight There is a strong positive relationship which is linear

Exercise 2: Interpretation Relationship Correlation Interpretation Average IQ and chocolate consumption 0.27 Weak positive relationship. More chocolate per capita = higher average IQ Road fatalities and Nobel winners 0.55 Strong positive. More accidents = more prizes! Gross Domestic Product and Nobel winners 0.7 Strong positive. Wealthy countries = more prizes Mean temperature and Nobel winners -0.6 Strong negative. Colder countries = more prizes.

Exercise 3: scatterplot Assumption 1: Is there a linear relationship? Yes!

Exercise 3: correlation Pearson’s correlation = 0.40 Describe the relationship using the scatterplot and correlation coefficient: There is a moderate positive relationship between mothers’ pre-pregnancy weight and birth weight (r = 0.40). Generally, birth weight increases as mothers weight increases

Exercise 3: regression Adjusted R2 = 0.14 Does the model result in reliable predictions? Not really. The adjusted R2 value is 0.14. ANOVA p-value = 0.009 Is the model an improvement on the null model (where every baby is predicted to be the mean weight)? Yes as p < 0.05

Exercise 3: regression Pre-pregnancy weight coefficient & p-value: 0.034 (p = 0.009) Regression equation: y = 1.379 + 0.034 Interpretation: There is a significant relationship between a mothers’ pre-pregnancy weight and the weight of her baby (p = 0.009). Pre-pregnancy weight has a positive affect on a baby’s weight with an increase of 0.034 kg for each extra kg a mother weighs.

Exercise 3: normality of the residuals? Yes – histogram roughly peaks in the middle

Exercise 3: homoscedasticity? Yes – no patterns in residuals

Exercise 4: correlations Which variables are most strongly related? Gestation and birth weight (0.708) Mothers height and weight (0.681) Mothers height and weight are strongly related. They don’t exceed 0.8 but try the model with and without height in case it’s a problem

Exercise 5: Model 1 residual assumptions Assumptions are met

Exercise 5: Model 1 Adjusted R2 Does the model result in reliable predictions? Yes – the adjusted R2 is reasonably high

Exercise 5: Model 1 ANOVA ANOVA p-value < 0.001 Is the model an improvement on the null model (where every baby is predicted to be the mean weight)? Yes as p < 0.05

Exercise 5: Model 1 regression coefficients Gestational age (p < 0.001), mother’s pre-pregnancy weight and smoking status (p=0.016) are significant predictors of birth weight after controlling for the weight and height of the mother. Weight increases by 0.14 kgs for each week of gestation, by 0.02 kgs for each extra kg a mother weighs and decreases by 0.31 kgs for smokers.

Exercise 5: Model 2 (including height) There’s a moderate positive linear relationship between maternal height and birth weight.

Exercise 5: Model 2 residual assumptions Assumptions are met

Exercise 5: Model 2 Adjusted R2 Does the model result in reliable predictions? Yes – the adjusted R2 is reasonably high

Exercise 5: Model 2 ANOVA ANOVA p-value < 0.001 Is the model an improvement on the null model (where every baby is predicted to be the mean weight)? Yes as p < 0.05

Exercise 5: Model 2 regression coefficients Interpretation: Gestational age (p < 0.001) and smoking status (p=0.016) are significant predictors of birth weight after controlling for the weight and height of the mother. Weight increases by 0.14 kgs for each week of gestation and decreases by 0.31 kgs for smokers.

Exercise 5: compare p-values Model Gestation Smoking Weight Height P < 0.001 + Smoker 0.024 + Weight 0.016 0.025 + Height 0.263 0.366 Smoking gets more significant as variables are added. Mothers’ weight becomes non-significant once height has been added. They are strongly related and are sharing some of the variation in birth weight when both in the model.

Exercise 5: compare Adjusted R2 Model R2 Adjusted R2 Gestation 0.502 0.489 + Smoker 0.563 0.541 + Weight 0.618 0.588 + Height 0.627 0.586 Adding smoker and weight improves the fit a little bit Adding height has not improved the fit of the model at all as the adjusted R2 decreases