N318b Winter 2002 Nursing Statistics Specific statistical tests: Regression Lecture 11.

Slides:



Advertisements
Similar presentations
Managerial Economics in a Global Economy
Advertisements

Regression and correlation methods
1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inference for Regression
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Linear Regression. PSYC 6130, PROF. J. ELDER 2 Correlation vs Regression: What’s the Difference? Correlation measures how strongly related 2 variables.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Lecture 23: Tues., Dec. 2 Today: Thursday:
PPA 501 – Analytical Methods in Administration Lecture 8 – Linear Regression and Correlation.
PPA 415 – Research Methods in Public Administration
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Lecture 24: Thurs., April 8th
REGRESSION AND CORRELATION
Multiple Regression Research Methods and Statistics.
Correlation and Regression Analysis
Relationships Among Variables
Basic Analysis of Variance and the General Linear Model Psy 420 Andrew Ainsworth.
Review Guess the correlation. A.-2.0 B.-0.9 C.-0.1 D.0.1 E.0.9.
Example of Simple and Multiple Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Chapter 13: Inference in Regression
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Simple Linear Regression
N318b Winter 2002 Nursing Statistics Specific statistical tests: Correlation Lecture 10.
EQT 272 PROBABILITY AND STATISTICS
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
MULTIPLE REGRESSION Using more than one variable to predict another.
Week 4: Multiple regression analysis Overview Questions from last week What is regression analysis? The mathematical model Interpreting the β coefficient.
INTRODUCTORY LINEAR REGRESSION SIMPLE LINEAR REGRESSION - Curve fitting - Inferences about estimated parameter - Adequacy of the models - Linear.
N318b Winter 2002 Nursing Statistics Hypothesis and Inference tests, Type I and II errors, p-values, Confidence Intervals Lecture 5.
Chapter 3: Introductory Linear Regression
Correlation and Regression PS397 Testing and Measurement January 16, 2007 Thanh-Thanh Tieu.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
N318b Winter 2002 Nursing Statistics Lecture 2: Measures of Central Tendency and Variability.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Linear correlation and linear regression + summary of tests
CHAPTER 3 INTRODUCTORY LINEAR REGRESSION. Introduction  Linear regression is a study on the linear relationship between two variables. This is done by.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
Education 793 Class Notes Multiple Regression 19 November 2003.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Statistics for the Social Sciences Psychology 340 Spring 2010 Introductions & Review of some basic research methods.
Handout Twelve: Design & Analysis of Covariance
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
N318b Winter 2002 Nursing Statistics Specific statistical tests: Tests for means when there are more than 2 groups ANOVA Lecture 9.
N318b Winter 2002 Nursing Statistics Specific statistical tests: The T-test for means Lecture 8.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
REGRESSION AND CORRELATION SIMPLE LINEAR REGRESSION 10.2 SCATTER DIAGRAM 10.3 GRAPHICAL METHOD FOR DETERMINING REGRESSION 10.4 LEAST SQUARE METHOD.
PSY 325 AID Education Expert/psy325aid.com FOR MORE CLASSES VISIT
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical.
Multiple Regression Scott Hudson January 24, 2011.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Stats Methods at IC Lecture 3: Regression.
Simple Linear Regression
Bivariate & Multivariate Regression Analysis
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Simple Linear Regression
Correlation and Regression
Applied Statistical Analysis
Correlation and Regression
Presentation transcript:

N318b Winter 2002 Nursing Statistics Specific statistical tests: Regression Lecture 11

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 2  Discussion of final exam  Regression >  Applying knowledge to assigned reading  Ferketich & Mercer (1995) Followed by small groups 12-2 PM Focus on interpreting Regression results Today’s Class

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 3 Focuses on Ferketich & Mercer (1995) Q1. Chance to interpret regression findings Q2. Know main advantage of regression Key points about regression relating to group work will be covered in the 2 nd part of lecture “In Group” Session

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 4 Last year’s exam is now on class web page Answers will go up next week ! 3 parts (60 marks): (3 hours) Section A: true/false (8 marks) Section B: multiple choice (22 marks) Section C: short answer (30 marks) on interpreting results of a research study (4 questions – choose 10 marks each) INCLUDES ALL LECTURES (TODAY) ! Outline of Final Exam

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 5 First part focuses on answering select questions from each of the 3 sections Second part focuses on addressing questions directly from students in class Work group will be a chance to review own mid-term results and ask further questions Outline of Review Lecture

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 6 CorrelationRegression Used for?associationprediction Variables?two onlytwo or more Statistics?“r” (+’ve or –’ve)“b” (+’ve or –’ve) and “R 2 ” Tells you?direction & strength direction, strength & magnitude Correlation vs. Regression

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 7 Multiple linear regression – one dependent variable and >1 independent variables Simple linear regression – one dependent variable and one independent variable Multiple logistic regression – one binary (e.g. Y/N) dependent variable and >1 independent variables (especially useful for case-control) Like ANOVA, regression is a family of methods Plus MANY others ! Types of Regression

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 8 Correlation tells you only if there is a relationship between two variables – strength (“r”) and direction (positive or negative) Simple linear regression tells you if there is a relationship between two variables and uses score on one to predict score on the other Multiple linear regression tells you if there is a relationship between two variables and uses score on one to predict score on the other and controls for other factors that might influence (“confound”) the relationship being studied Main Purpose of Regression

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 9 How do you known when to use which test? Helps to ask some basic questions: 1. What kind of data are used? 2. What kind of relationship is of interest? 3. How many groups (samples) involved? - one, two, or more than two - prediction, association or difference? - ratio/interval or categorical (ordinal/nominal) - dependent (e.g. follow-up) or independent Statistical Tests – Review

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 10 How do you known when to use regression? 1.What kind of data are used? 2. What kind of relationship is of interest? 3. How many groups (samples) involved? - most often numeric/continuous (ratio/interval) - can also be binary (logistic regression) - prediction (and/or association) - often just one sample involved (in some ways regression tries to “find” groups ) Referring back to the 3 “basic questions”: Regression – when to use it?

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page The relationship under study is linear – important non-linear relationships can be overlooked with simple regression analysis 2. Data are (approximately) normally distributed 3. Data in the (two) variables have roughly equal range of variability (i.e. homoscedasticity) BASICALLY SAME AS FOR CORRELATION Regression (linear) assumptions

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 12 Although useful for understanding concepts behind regression it is not used very often in research articles (as main analytic tool) – why? Example – can use simple linear regression to determine if there is a relationship between mother’s age and baby’s birth weight Analysis shows a significant relationship between age and birth weight (for each 1 yr of age there was a 12.3 g increase in birth weight) Any doubts about this result? Simple Linear Regression

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 13 What other things might be related to child’s birth weight other than just mother’s age? Mother’s weight? Smoking? Gestational age? Diabetes? Income? Simple linear regression can be used to separately examine relationship between birth weight and each of these but not the combined effects of any of them together Need multiple regression to do this ! Simple Linear Regression – cont’d

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 14 Regression quantifies a (linear) relationship that minimizes variation in predicted versus observed values around the regression line Y' = a + bX + e Y' = predicted value (of dependent variable) a = intercept (value of Y' when X = 0) b = slope of regression line (“beta” coefficient) X = value of independent variable (“predictor”) e = error (how much Y' differs from observed) The Simple Regression Model

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 15 Very similar to ANOVA – how was that done? F = Between-group variability Within-group variability Variance attributed to model term(s) Variance from random error For regression, we also use variance from two sources: model (variables) and error (chance) F = larger F-statistic = smaller p-value Testing Fit of Regression Model

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 16 What does a significant F-statistic tell you? independent variable(s), X significantly predicts value of dependent variable, Y' (i.e. a “good” fit) What if F is non-significant? X not associated with Y' or there is a non-linear relationship (i.e. model not a “good” fit for data) What does a significant beta coefficient tell you? amount Y' increases for each unit change in X What does a significant “R 2 ” tell you? total variation in Y' attributable to model terms Interpreting Regression Output

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 17 Regression Output – Example Systolic BPAge Is there a relationship between systolic BP and age? Sample of ten subjects

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 18 Regression Table Same as correlation coefficient seen before ! (true only for simple reg.)

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 19 Plot of BP and Age Is age is a good predictor of systolic BP?

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page minute break !

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 21 What about other factors – e.g. weight? Systolic BPAgeWeight Does weight affect the relationship between systolic BP and age? Same ten subjects Multiple Linear Regression

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 22 Multiple Regression Table Note R 2 increases since more terms are in this model (thus less “unexplained” variance)

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 23 Multiple Regression Summary What does a significant F-statistic tell you? together age and weight are strong predictors of systolic BP (i.e. model is a “good” fit of data) Why did effects of age change so much? age and weight are strongly correlated with each other (note “r” = 0.93 !) What do non-significant beta coefficients tell you? each has non-significant (independent) effect

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 24 Model Building in Regression How does this happen? Statistical models are developed that (should) fit the theoretical model proposed by researchers The main advantage of regression is that is allows you to “control” for multiple factors Different procedures available but all essentially try to “weed out” non-significant factors to explain the most variation with least number of variables As terms are added (or deleted) in model process, you look for significant changes in R 2 – if not, then variable has no significant (independent) effect

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 25 Part 2: Application to the Assigned Readings

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 26 Quick summary of the paper: – a follow-up study examining the effects of previous fatherhood experience on paternal competence – also looked at “predictors” of competence – 172 subjects (79 exp., 93 inexp.) were recruited during partner’s pregnancy – analysis included t-tests, ANOVA and multiple regression Ferketich & Mercer (1995)

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 27 “Are there any differences in paternal competence between first-time (inexp) and experienced (exp) fathers in the transition to the father role?.” Is this question related to regression results? Not directly – more specific if rephrased as: “Are there any differences in “predictors” of paternal competence between … e.g. last sentence of 1 st paragraph under “Method” see first sentence of article – page 53 of syllabus Main study research question

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 28 How did the researchers “structure” analysis? 1. Comparison of the two main groups on several demographic characteristics Started with ….. 2. Comparison of the two main groups on key independent variables  How Then ….. (t-tests, RANOVA) 3. Identification of predictors of competence for the two main groups  How Then ….. (multiple regression) Overview of analysis

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 29 See Table 1 page 92 of paper Look at structure of the table … 4. Was it statistically significant – e.g. p< How is it being tested – e.g. test used? 2. What is being tested – e.g. dep/indep? 1. Who is in the table – e.g. groups ? - only the experienced fathers - predictors of competence (many factors) - multiple regression (at 4 time points) - everything ?! (What happened to rest?) Interpreting Table 1 …

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 30 What do all the columns mean (at each time)? Unique R 2 = variance for that term only Why does Table 1 seem to have different variables being tested at each time point? Only those that were significant shown ! Cumulative R 2 = variance for all model terms Beta = change in Y per unit of X F = test statistic from regression output (model) p = significance of test statistic, F (model) Interpreting Table 1 - cont’d

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 31 Questions for interpreting results in workshop … 4. What seems to be related? 3. Was it in the expected direction? 2. How was model developed? 1. What is the relative contribution of the terms in each model? - i.e. what is its change in R 2 ? - i.e. where are the other terms? - i.e. is there a +’ve or –’ve relationship? - i.e. how can a term be significant in one model but not in another? For Table 1 (cont’d)

School of Nursing Institute for Work & Health Nur 318b 2002 Lecture 11: page 32 For next week’s class please review: 1.Last year’s final exam (web page) Next Week: Review