Why Model? Make predictions or forecasts where we don’t have data.

Slides:



Advertisements
Similar presentations
Inference in the Simple Regression Model
Advertisements

Lecture 17: Tues., March 16 Inference for simple linear regression (Ch ) R2 statistic (Ch ) Association is not causation (Ch ) Next.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Regression and correlation methods
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
The Multiple Regression Model.
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Forecasting Using the Simple Linear Regression Model and Correlation
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Linear regression models
Objectives (BPS chapter 24)
Introduction to Regression Analysis
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Pertemua 19 Regresi Linier
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Simple Linear Regression and Correlation
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 7 Forecasting with Simple Regression
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Linear Regression/Correlation
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Correlation & Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Chapter 11 Simple Regression
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Chapter 14 Simple Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
© Copyright McGraw-Hill 2004
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Hypothesis Testing and Statistical Significance
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Linear model. a type of regression analyses statistical method – both the response variable (Y) and the explanatory variable (X) are continuous variables.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Lecturer: Ing. Martina Hanová, PhD..  How do we evaluate a model?  How do we know if the model we are using is good?  assumptions relate to the (population)
The simple linear regression model and parameter estimation
Why Model? Make predictions or forecasts where we don’t have data.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Statistics for Managers using Microsoft Excel 3rd Edition
Linear Regression/Correlation
Presentation transcript:

Why Model? Make predictions or forecasts where we don’t have data

Linear Regression wikipedia

Modeling Process Observe Define Theory/ Type of Model Design Experiment Collect Data Select Model Evaluate the Model Qualify Data Estimate Parameters Publish Results

Bouncing Balls Observation: balls bounce more when dropped from higher height Theory: there is a linear relationship between the height of a drop and the number of bounces people.rit.edu

Bounding Balls (con’t) Experimental Design? Collect Data? Qualify Data? Select Model: –Start with linear regression

Parameter Estimation Excel spreadsheet X, Y columns Add “trend line”

Definitions Horizontal axis: Used to create prediction –Independent variable –Predictor variable –Covariate –Explanatory variable –Control variable –Typically a raster –Examples: Temperature, aspect, SST, precipitation Vertical axis: What we are trying to predict –Dependent variable –Response variable –Measured value –Explained –Outcome –Typically an attribute of points –Examples: Height, abundance, percent, diversity, …

Linear Regression: Assumptions Predictors are error free Linearity of response to predictors Constant variance within and for all predictors (homoscedasticity) Independence of errors Lack of multi-colinearity Also: –All points are equally important –Residuals are normally distributed (or close).

Linear Regression

Normal Distribution To positive infinity To negative infinity

Linear Data Fitted w/Linear Model Should be a diagonal line for normally distributed data

Non-Linear Data Fitted with a Linear Model This shows the residuals are not normally distributed

Homoscedasticity Residuals have the same normal distribution throughout the range of the data

Ordinary Least Squares

Linear Regression Residual

Parameter Estimation

Evaluate the Model

Evaluation Find the highest performing model in Excel for the golf ball data 1XMMIYhttps:// 1XMMIY

“Goodness of fit”

Good Model?

Two Approaches Hypothesis Testing –Is a hypothesis supported or not? –What is the chance that what we are seeing is random? Which is the best model? –Assumes the hypothesis is true (implied) –Model may or may not support the hypothesis Data mining –Discouraged in spatial modeling –Can lead to erroneous conclusions

Significance (p-value) H0 – Null hypothesis (flat line) Hypothesis – regression line not flat The smaller the p-value, the more evidence we have against H0 –Our hypothesis is probably true It is also a measure of how likely we are to get a certain sample result or a result “more extreme,” assuming H0 is true The chance the relationship is random

Confidence Intervals 95 percent of the time, values will fall within a 95% confidence interval Methods: –Moments (mean, variance) –Likelihood –Significance tests (p-values) –Bootstrapping

Model Evaluation Parameter sensitivity Ground truthing Uncertainty in data AND predictors –Spatial –Temporal –Attributes/Measurements Alternative models Alternative parameters

Model Evaluation?

Robust models Domain/scope is well defined Data is well understood Uncertainty is documented Model can be tied to phenomenon Model validated against other data Sensitivity testing completed Conclusions are within the domain/scope or are “possibilities” See: HuyMQ-S9jGshttps:// HuyMQ-S9jGs

Modeling Process II Investigate Find Data Select Model Evaluate the Model Qualify Data Estimate Parameters Publish Results

Research Papers Introduction –Background –Goal Methods –Area of interest –Data “sources” –Modeling approaches –Evaluation methods Results –Figures –Tables –Summary results Discussion –What did you find? –Broader impacts –Related results Conclusion –Next steps Acknowledgements –Who helped? References –Include long URLs