The greatest blessing in life is

Slides:



Advertisements
Similar presentations
Forecasting Using the Simple Linear Regression Model and Correlation
Advertisements

Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Inference for Regression
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Class 16: Thursday, Nov. 4 Note: I will you some info on the final project this weekend and will discuss in class on Tuesday.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
Chapter Topics Types of Regression Models
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
1 Simple Linear Regression Linear regression model Prediction Limitation Correlation.
Chapter 7 Forecasting with Simple Regression
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Correlation & Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Review of Statistical Models and Linear Regression Concepts STAT E-150 Statistical Methods.
1 Everyday is a new beginning in life. Every moment is a time for self vigilance.
Examining Relationships in Quantitative Research
Chapter 14 Inference for Regression © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Regression Analysis Presentation 13. Regression In Chapter 15, we looked at associations between two categorical variables. We will now focus on relationships.
Chapter 12: Correlation and Linear Regression 1.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Stats Methods at IC Lecture 3: Regression.
Lecture #25 Tuesday, November 15, 2016 Textbook: 14.1 and 14.3
Lecture 9 Sections 3.3 Objectives:
Regression Analysis AGEC 784.
Linear Regression.
AP Statistics Chapter 14 Section 1.
Statistics for Managers using Microsoft Excel 3rd Edition
Examining Relationships Least-Squares Regression & Cautions about Correlation and Regression PSBE Chapters 2.3 and 2.4 © 2011 W. H. Freeman and Company.
Inferences for Regression
Inference for Regression
SIMPLE LINEAR REGRESSION MODEL
Happiness comes not from material wealth but less desire.
Correlation and Simple Linear Regression
CHAPTER 29: Multiple Regression*
Linear Regression/Correlation
No notecard for this quiz!!
Unit 3 – Linear regression
Treat everyone with sincerity,
CHAPTER 3 Describing Relationships
PENGOLAHAN DAN PENYAJIAN
Correlation and Simple Linear Regression
Simple Linear Regression
Simple Linear Regression and Correlation
Linear Regression and Correlation
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Linear Regression and Correlation
CHAPTER 3 Describing Relationships
Chapter 13 Additional Topics in Regression Analysis
Linear Regression and Correlation
CHAPTER 3 Describing Relationships
Inferences for Regression
Regression and Correlation of Data
Honors Statistics Review Chapters 7 & 8
CHAPTER 3 Describing Relationships
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

The greatest blessing in life is in giving and not taking.

Statistical Package Usage Topic: Simple Linear Regression By Prof Kelly Fan, Cal State Univ, East Bay

Overview Correlation analysis Linear regression model Goodness of fit of the model Model assumption checking How to handle outliers

Example: Weight vs. Height Gender Height (inches) Weight (pounds) Age M 68.5 155 23 F 61.2 99 20 63.0 115 21 70.0 205 45 68.6 170 -- 65.1 125 30 72.4 220 48

Correlation between X and Y X and Y might be related to each other in many ways: linear or curved. Correlation analysis provided here is only for the linear association.

(Pearson) Correlation Coefficient of X and Y A measurement of the strength of the “LINEAR” association between X and Y Sx: the standard deviation of the data values in X, Sy: the standard deviation of the data values in Y; the correlation coefficient of X and Y is:

Correlation Coefficient of X and Y The magnitude of r measures the strength of the linear association of X and Y The sign of r indicate the direction of the association: “-”  negative association “+”  positive association

Examples of Different Levels of Correlation Median Linearity r=.98 Strong Linearity

Examples of Different Levels of Correlation Nearly Uncorrelated r=.00 Nearly Curved

Example: Weight vs. Height

Graphical Summary of Two Quantitative Variable Scatterplot of response variable against explanatory variable What is the overall (average) pattern? What is the direction of the pattern? How much do data points vary from the overall (average) pattern? Any potential outliers?

Summary for Height and Weight Some Simple Conclusions Scatterplot (Weight vs. Height) Weight is somewhat Linearly related with Height Weight is Increasing as Height increases. Data points are more or less around the line. No potential outlier.

Regression Equation The regression line models the relationship between X and Y on average. The math equation of a regression line is called regression equation.

The Usage of Regression Equation Predict the value of Y for a given X value Eg. What is the weight for a student with 60” high?

Predicted Value (Fitted Value) is called “predicted Y,” pronounced as “y hat,” which estimates the average Y value for a specified X value. Eg. The predicted weight for a given height

The Limitation of the Regression Equation The regression equation cannot be used to predict Y value for the X values which are (far) beyond the range in which data are observed. Eg. The predicted WT of a given HT: Given HT of 50”, the regression equation will give us WT of -587.1+11.1x50 = -32.1 pounds!!

The Unpredicted Part The value is the part the regression equation (model) cannot predict, and it is called “residual.”

} residual

Goodness of Fit R^2 is the proportion of Y variance explained/accounted by the model we use to fit the data When there is only one X (simple linear regression) R^2 = r^2.

SAS Output

Confidence Interval of Mean Y

Prediction Interval

Model Checking Scatter plot shows linear pattern Normality: The residuals must follow a normal distribution Equal Variances: The variance of residuals must be constant over the different values of X

Checking the Normality

Tests for Normality

Checking the Equal Variances

Identify Outliers using Residual Plots Use “studentized” residuals!! The cases with studentized residuals of size 3 or more are outliers

Transformation If data show non-normal or unequal variances, then transform Y. If data show a non-linear pattern, then add X^2 into the model

The Influence of Outliers Type I Outlier The slope becomes bigger (toward outliers) The r value becomes smaller (less linear)

The Influence of Outliers Type II Outlier The slope becomes clear (toward outliers) The | r | value becomes larger (more linear: 0.1590.935)

How to Handle Outliers Use Spearman correlation coef (the r of rank of X and rank of Y) For Type I outliers: Try to find the special factor behind the outliers. Report two models with and without outliers. For Type II outliers: Report the model without outliers and mention the existence of outliers. Whenever possible, collect more data in the range of X where outliers appear.

SAS Output