Regression Statistics

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Kin 304 Regression Linear Regression Least Sum of Squares
Correlation and regression
Chapter 12 Simple Linear Regression
Forecasting Using the Simple Linear Regression Model and Correlation
BA 275 Quantitative Business Methods
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
Simple Linear Regression
Chapter 12 Simple Linear Regression
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
The Simple Regression Model
Chapter Topics Types of Regression Models
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
RLR. Purpose of Regression Fit data to model Known model based on physics P* = exp[A - B/(T+C)] Antoine eq. Assumed correlation y = a + b*x1+c*x2 Use.
Simple Linear Regression Analysis
1 1 Slide Simple Linear Regression Chapter 14 BA 303 – Spring 2011.
Correlation & Regression
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Regression. Population Covariance and Correlation.
Thomas Knotts. Engineers often: Regress data  Analysis  Fit to theory  Data reduction Use the regression of others  Antoine Equation  DIPPR.
2014. Engineers often: Regress data  Analysis  Fit to theory  Data reduction Use the regression of others  Antoine Equation  DIPPR We need to be.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Regression: Checking the Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Regression Analysis Relationship with one independent variable.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
Prediction, Goodness-of-Fit, and Modeling Issues Prepared by Vera Tabakova, East Carolina University.
Simple Linear Regression (SLR)
AP STATISTICS LESSON 14 – 1 ( DAY 1 ) INFERENCE ABOUT THE MODEL.
Chapter 10: Determining How Costs Behave 1 Horngren 13e.
Curve Fitting Pertemuan 10 Matakuliah: S0262-Analisis Numerik Tahun: 2010.
Engineers often: Regress data to a model  Used for assessing theory  Used for predicting  Empirical or theoretical model Use the regression of others.
Statistics Review ChE 475 Fall 2015 Dr. Harding. 1. Repeated Data Points Use t-test based on measured st dev (s) true mean measured mean In Excel, =T.INV(
Regression Analysis1. 2 INTRODUCTION TO EMPIRICAL MODELS LEAST SQUARES ESTIMATION OF THE PARAMETERS PROPERTIES OF THE LEAST SQUARES ESTIMATORS AND ESTIMATION.
Dependent (response) Variable Independent (control) Variable Random Error XY x1x1 y1y1 x2x2 y2y2 …… xnxn ynyn Raw data: Assumption:  i ‘s are independent.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Lecture Slides Elementary Statistics Twelfth Edition
The simple linear regression model and parameter estimation
Regression Analysis AGEC 784.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
AP Statistics Chapter 14 Section 1.
Ch12.1 Simple Linear Regression
Relationship with one independent variable
Simple Linear Regression
Slides by JOHN LOUCKS St. Edward’s University.
Simple Linear Regression - Introduction
Linear Regression Models
Statistical Methods For Engineers
CHAPTER 29: Multiple Regression*
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Prediction of new observations
BASIC REGRESSION CONCEPTS
Chapter 14 Inference for Regression
Simple Linear Regression
Relationship with one independent variable
Simple Linear Regression
Lesson 2.2 Linear Regression.
Regression and Correlation of Data
St. Edward’s University
Presentation transcript:

Regression Statistics 2013

Engineers and Regression Engineers often: Regress data Analysis Fit to theory Data reduction Use the regression of others Antoine Equation DIPPR We need to be able to report uncertainties associated with regression. Do the data fit the model? What are the errors in the prediction? What are the errors in the parameters?

Linear Regression There are two classes of regressions Non-linear “Linear” refers to the parameters, not the functional dependence of the independent variable You can use the Mathcad function “linfit” on linear equations

Quiz Linear Regression There are two classes of regressions Non-linear “Linear” refers to the parameters, not the functional dependence of the independent variable You can use the Mathcad function “linfit” on linear equations 1. 2. 3. 4. 5.

Straight Line Model residual (error) slope Intercept “X” data “Y” Measured “Y” Predicted

residual (error) “X” data “Y” predicted “Y” data Straight Line Model intercept 0.92291455 slope 0.516173934   1 2.749032178 1.439088483 1.309943694 2 3.719910224 2.362003033 1.357907192 3 0.925995017 3.284917582 -2.35892257 4 2.623482686 4.207832132 -1.58434945 5 6.539797342 5.130746681 1.409050661 6 6.779909177 6.053661231 0.726247946 7 4.946150401 6.976575781 -2.03042538 8 9.674178069 7.89949033 1.774687739 9 7.61959821 8.82240488 -1.20280667 10 7.650020996 9.745319429 -2.09529843 11 11.514 10.66823398 0.845766021 12 13.18285068 11.59114853 1.591702152 13 13.28173635 12.51406308 0.767673275 14 13.60444592 13.43697763 0.16746829 15 12.79535218 14.35989218 -1.56454 16 17.82374778 15.28280673 2.540941056 17 14.55068379 16.20572128 -1.65503748 42.76608602 SSE residual (error) “X” data “Y” predicted “Y” data Number of fitted parameters: 2 for a two-parameter model sum squared error mean squared error

The R2 Statistic A useful statistic but not definitive Tells you how well the data fit the model. It does not tell you if the model is correct. How much of the distribution of the data about the mean is described by the model.

Problems with R2

Residuals (ei) should be normally distributed

Residuals (ei) should be normally distributed

Other Questions About Fit How well does the line fit the data at each point (not just the mean)? In what range should the data lie (are there outliers)? What are the confidence intervals on the slope and intercept?

Confidence Intervals from Example Green is confidence interval How well does the model fit the data? Red is the prediction band Where should the data fall? Can I throw out any points?

Statistics on the Slope/Intercept When you fit data to a straight line, the slope and intercept are only estimates of the true slope and intercept. (1-a)100% Confidence Intervals Standard Errors (slope) (intercept)

Confidence Interval on the Prediction & Expected Range of Data Prediction Band Confidence Interval on the Prediction Given a specific value for x, 𝑋 0 , what is the error in 𝑌 0 ? Prediction Band: Expected Range of Data If I take more data, where will the data fall? (Where will 𝑌 0 be found if measured at 𝑋 0 ?)

Confidence Interval vs. Prediction Band Shows possible errors in where the line will go Interval narrows with increasing number of data points See equation on previous slide Prediction Band Shows where the data should lie Interval does not narrow much with increasing number of data points See equation on previous slide

In-Class Assignment Good News: IGOR! Igor has the equations already programmed and will Plot the confidence intervals Plot the prediction bands Print the  confidence intervals on the slope and intercept In-Class Assignment

Answers

Example Using Excel

Generalized Linear Regression Linear regression can be written in matrix form. X Y 21 186 24 214 32 288 47 425 50 455 59 539 68 622 74 675 62 562 453 41 370 30 274 Straight Line Model Quadratic Model

Statistics with Matrices Parameter Confidence Intervals Predicted Variable Confidence Intervals Standard Error of bi is the square root of the i-th diagonal term of the matrix