Trip Generation II Meeghat Habibian Transportation Demand Analysis

Slides:



Advertisements
Similar presentations
Managerial Economics in a Global Economy
Advertisements

Regression and correlation methods
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Kin 304 Regression Linear Regression Least Sum of Squares
CHAPTER 3: TWO VARIABLE REGRESSION MODEL: THE PROBLEM OF ESTIMATION
Chapter 12 Simple Linear Regression
Forecasting Using the Simple Linear Regression Model and Correlation
Hypothesis Testing Steps in Hypothesis Testing:
Correlation and Regression
Quantitative Data Analysis: Hypothesis Testing
Correlation and regression
PSY 307 – Statistics for the Behavioral Sciences
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
SIMPLE LINEAR REGRESSION
REGRESSION AND CORRELATION
Ch. 14: The Multiple Regression Model building
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Introduction to Regression Analysis, Chapter 13,
Lecture 5 Correlation and Regression
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Transportation Planning, Transportation Demand Analysis Land Use-Transportation Interaction Transportation Planning Framework Transportation Demand Analysis.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
11 Chapter 12 Quantitative Data Analysis: Hypothesis Testing © 2009 John Wiley & Sons Ltd.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Environmental Modeling Basic Testing Methods - Statistics III.
Chapter 8: Simple Linear Regression Yang Zhenlin.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Hypothesis Testing Example 3: Test the hypothesis that the average content of containers of a particular lubricant is 10 litters if the contents of random.
1 AAEC 4302 ADVANCED STATISTICAL METHODS IN AGRICULTURAL RESEARCH Part II: Theory and Estimation of Regression Models Chapter 5: Simple Regression Theory.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Transportation Planning Asian Institute of Technology
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
F-tests continued.
Chapter 14 Introduction to Multiple Regression
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Regression Analysis AGEC 784.
Linear Regression.
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Chow test.
THE LINEAR REGRESSION MODEL: AN OVERVIEW
Chapter 11: Simple Linear Regression
Fundamentals of regression analysis
ECONOMETRICS DR. DEEPTI.
Chapter 3: TWO-VARIABLE REGRESSION MODEL: The problem of Estimation
Chapter 15 Linear Regression
BPK 304W Correlation.
Probability and Statistics for Computer Scientists Second Edition, By: Michael Baron Section 11.1: Least squares estimation CIS Computational.
Diagnostics and Transformation for SLR
Regression Analysis Week 4.
CHAPTER 29: Multiple Regression*
6-1 Introduction To Empirical Models
The regression model in matrix form
J.-F. Pâris University of Houston
Undergraduated Econometrics
Chapter 8: Weighting adjustment
Trip Generation II Meeghat Habibian Transportation Demand Analysis
Simple Linear Regression
SIMPLE LINEAR REGRESSION
Trip Generation I Meeghat Habibian Transportation Demand Analysis
SIMPLE LINEAR REGRESSION
Linear Regression Summer School IFPRI
Diagnostics and Transformation for SLR
Ch3 The Two-Variable Regression Model
Financial Econometrics Fin. 505
Introduction to Regression
Presentation transcript:

Trip Generation II Meeghat Habibian Transportation Demand Analysis

Content Linear regression Statistical tests Aggregate vs. Disaggregate approach The dummy variable Transferability and temporal stability of model Accessibility Transportation Demand Analysis- Trip Generation 2

Introduction Calibrating models to forecast trips produced from (attracted to) each zone in the future Methods: Graphs Land use based factors (ITE) Growth factor Cross classification Linear regression Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis Trip Generation 2 Linear Regression

Linear Regression An approach for modeling a relationship between Dependent variable (y) and One (or more) independent variable(s) (xk) y=β0+β1x1+β2x2 +β3x3 +β4x4 +…+βkxk +ε Goal: To calculate coefficients (βi), such to minimize sum of the squares of errors (differences between observations and estimation) Transportation Demand Analysis- Trip Generation 2

Matrix Notation Y=X β+ ε n: number of observation β0 Transportation Demand Analysis- Trip Generation 2 n: number of observation k: number of independent variables

Assumptions for Error Term 1- εi has normal distribution: εi ~ Normal 2- Mean of εi is 0: E(εi)=0 3- Expectation of εi2 is finite: E(εi2)=σ2 Var(εi)=E[εi-E(εi)]2=E[εi-0]2=E(εi2)= σ2 Therefore: εi~ N(0 , σ2 ) Transportation Demand Analysis- Trip Generation 2

Assumptions for Error Term 4- Errors are independent: ∀i≠j: E(εi.εj)=0 E(εi.εj)= E(εi).E(εj)=0 (Non auto regression) 5- Xi s are deterministic such to 6-Number of observations is grater than number of coefficients: n > k+1 7- Xi s are independent from each other Transportation Demand Analysis- Trip Generation 2

Society and Sample Calculations are based on the sample but results should reflect the society E(yi)=β0+β1xi β 0~N(β0, σ2) β 1~N(β1, σ2) Transportation Demand Analysis- Trip Generation 2

Dependent Variable Distribution (y) y=β0+β1x1+β2x2 +β3x3 +β4x4 +…+βKxK + ε Therefore: y~N(β0+β1x1+β2x2 +β3x3 +β4x4 +…+βKxK , σ2) 𝐸(𝑦)=𝐸( 𝛽 0 + 𝑖=1 𝐾 𝛽 𝑖 𝑥 𝑖 +𝜀)=𝐸( 𝛽 0 )+𝐸( 𝑖=1 𝐾 𝛽 𝑖 𝑥 𝑖 )+𝐸(𝜀) = 𝛽 0 + 𝑖=1 𝐾 𝐸( 𝛽 𝑖 𝑥 𝑖 ))+0= 𝛽 0 + 𝑖=1 𝐾 𝑥 𝑖 𝐸( 𝛽 𝑖 )= 𝛽 0 + 𝑖=1 𝐾 𝑥 𝑖 𝛽 𝑖 𝑉𝑎𝑟(𝑦)=𝐸 𝑦−𝐸(𝑦) 2 =𝐸 𝛽 0 + 𝑖=1 𝑘 𝛽 𝑖 𝑥 𝑖 +𝜀−( 𝛽 0 + 𝑖=1 𝑘 𝛽 𝑖 𝑥 𝑖 ) 2 =𝐸( 𝜀 2 )= 𝜎 2 Transportation Demand Analysis- Trip Generation 2

Question? Given (society): y~N(β0+β1xi, σ2) What are the mean and standard error of the estimated coefficients based on the sample (i.e., )? Given (society): y~N(β0+β1xi, σ2) β 0~N(β0, σ2) β 1~N(β1, σ2) Transportation Demand Analysis- Trip Generation 2

Remember from TP We were looking for estimation of β0 and β1: Error sum of squares definition: We calculated in order to minimize ESS: Transportation Demand Analysis- Trip Generation 2 𝐸𝑆𝑆= 𝑖=1 𝑛 𝑦 𝑖 − 𝛽 ∧ 0 − 𝛽 ∧ 1 𝑥 𝑖 ) 2

Remember from TP 𝐸𝑆𝑆= 𝑖=1 𝑛 𝑦 𝑖 − 𝛽 ∧ 0 − 𝛽 ∧ 1 𝑥 𝑖 ) 2 ∧ ∧ 𝐸𝑆𝑆= 𝑖=1 𝑛 𝑦 𝑖 − 𝛽 ∧ 0 − 𝛽 ∧ 1 𝑥 𝑖 ) 2 ∧ Transportation Demand Analysis- Trip Generation 2 ∧

Remember from TP By solving the recent equations: Transportation Demand Analysis- Trip Generation 2

Definitions Therefore: 𝑆 𝑥𝑦 = 𝑖=1 𝑛 𝑥 𝑖 − 𝑥 )( 𝑦 𝑖 − 𝑦 ) 𝑆 𝑥𝑦 = 𝑖=1 𝑛 𝑥 𝑖 − 𝑥 )( 𝑦 𝑖 − 𝑦 ) Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis- Trip Generation 2

  Demonstration: Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis- Trip Generation 2  

We can rewrite 𝛽 1 in form of a linear combination of 𝑦 𝑖 as follows : Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis- Trip Generation 2

=E [Σci2εi2+2ΣiΣjcicjεiεj] ) Transportation Demand Analysis- Trip Generation 2 =E [Σci2εi2+2ΣiΣjcicjεiεj]

Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis- Trip Generation 2 Page 43; Slide 166:

Demonstrate 𝛽 𝑜 is a linear combination of 𝑦 𝑖 : Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis- Trip Generation 2

2 Transportation Demand Analysis- Trip Generation 2

Conclusion 𝜷 Least square estimators have normal Distribution as follows: Transportation Demand Analysis- Trip Generation 2 Best Linear Unbiased Estimator = BLUE

Sample standard error σ is a measure of variability of y lower values for σ2 show that observations are closer to regression line For 1 independent variable: For k independent variables: Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis Trip Generation 2 Statistical Tests

Correlation Coefficient Square of correlation coefficient is called Coefficient of Determination, R2 -1≤R≤1 Transportation Demand Analysis- Trip Generation 2

Goodness of Fit Measure Define: TSS=RSS+ESS Transportation Demand Analysis- Trip Generation 2

Coefficient Significance T-student test (T-Test) T-Table  Tcritical (α,n-k-1) Transportation Demand Analysis- Trip Generation 2 T0<Tcritical  H0 can not be rejected at α level T0>Tcritical  H0 is rejected at α level

Statistical Hypothesis F test: For checking whole model: H0:R2=0 (RSS=0) k: Number of independent variables Transportation Demand Analysis- Trip Generation 2 F0<Fcritical  H0 can not be rejected at α level F0>Fcritical  H0 is rejected at α level

Choosing Best Model 1- Low correlation of independent variables (IVs) 2- Intuitive sign of coefficients 3- T-Student test 4- Smaller value for β0 5- Goodness of fit value 6- F test 7- Lower number of variables Transportation Demand Analysis- Trip Generation 2

Aggregate vs. Disaggregate Approaches Transportation Demand Analysis Trip Generation 2 Aggregate vs. Disaggregate Approaches

Modeling Approaches Disaggregate models Lower sample size Aggregate total (zonal average level) Aggregate rate (household-based average level) Higher variation Lower goodness of fit Makes more sense Transportation Demand Analysis- Trip Generation 2

Problem definition 𝑦 𝑗 : 𝑥 𝑗 : 𝑌 𝐽 : 𝑋 𝐽 : Household j daily trips 𝑦 𝑗 : 𝑥 𝑗 : 𝑌 𝐽 : 𝑋 𝐽 : Household j daily trips Household j car ownership Transportation Demand Analysis- Trip Generation 2 Household based zonal daily trips (zone J) Household based zonal car ownership (zone J)

General Relationships (I) Transportation Demand Analysis- Trip Generation 2

Errors distributions Transportation Demand Analysis- Trip Generation 2

General Relationships (II) 𝑋 = 1 𝑇 𝐽=1 𝑇 𝑋 𝐽 Transportation Demand Analysis- Trip Generation 2

General Relationships (III) Transportation Demand Analysis- Trip Generation 2

Variation of β 1 Transportation Demand Analysis- Trip Generation 2

Variation of β 1 Transportation Demand Analysis- Trip Generation 2

General Comparison Values of R2 for aggregate models are much higher than disaggregate models Coefficient of aggregate models have grater variance than disaggregate models Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis Trip Generation 2 The Dummy Variable

Why We Need Dummy Variables? Non quantifiable variables e.g., Gender, Marital status, … 2. Non uniform effect (in different intervals) variables e.g., Age, distance, … Transportation Demand Analysis- Trip Generation 2 Uniform effect Non-uniform effect

Structure Dummy variable = Example: D1= D2= 1, if a condition is satisfied 0, otherwise Dummy variable = Example: D1= D2= 1, if a household has 1 car 0, otherwise Transportation Demand Analysis- Trip Generation 2 1, if a household has 2 or more car 0, otherwise

Example (Trip Generation) An ordinary regression model: Trip number = 𝛽 0+ 𝛽 1 (Car ownership)+ 𝛽 2 (HHSZ) Trips Cars HHSZ=2 HHSZ=1 HHSZ=0 β1 Transportation Demand Analysis- Trip Generation 2 Transportation Demand Analysis- Trip Generation 2 𝛽 0+2 𝛽 2 𝛽 0+ 𝛽 2 𝛽 0

Example (Trip Generation) An ordinary regression model: Trip number = 𝛽 0+ 𝛽 1 (Car ownership)+ 𝛽 2 (HHSZ) A dummy enhanced model: Trip number = 𝛽 0+ 𝛽 1X2i+…+ 𝛽 5X6i+ 𝛽 6Z2i+ 𝛽 7Z3i+εi X1i= Z1i= X2i= Z2i= … X6i= Z3i= Therefore: Household i: 1, household i has 1 member 0, otherwise 1, household i has not car 0, otherwise Transportation Demand Analysis- Trip Generation 2 1, household i has 2 members 0, otherwise 1, household i has 1 car 0, otherwise 1, household i has 6 members 0, otherwise 1, household i has 2+ cars 0, otherwise X1i+X2i+X3i+X4i+X5i+X6i=1 Z1i+Z2i+Z3i=1

Example (Trip Generation) Dummy enhanced model: Trip number = 𝛽 0+ 𝛽 1X2i+…+ 𝛽 5X6i+ 𝛽 6Z2i+ 𝛽 7Z3i+εi X1i= Z1i= X2i= Z2i= … X6i= Z3i= Examples: Household containing 1 member and no car: 𝑌 = 𝛽 0 Household containing 1 member and 1 car: 𝑌 = 𝛽 0+ 𝛽 6 Household containing 3 members and 3 cars: 𝑌 = 𝛽 0+ 𝛽 3+ 𝛽 7 1, household i has 1 member 0, otherwise 1, household i has not car 0, otherwise 1, household i has 2 members 0, otherwise 1, household i has 1 car 0, otherwise Transportation Demand Analysis- Trip Generation 2 1, household i has 6 members 0, otherwise 1, household i has 2+ cars 0, otherwise

Notes Each pair of dummy variables must not overlapped Union of all levels must be the Universal set At least, one level (known as base level) should excluded from the modeling A multiplicative dummy may also use (e.g., XZ) which would be a more complicated case Transportation Demand Analysis- Trip Generation 2

Transferability and Temporal Stability of a Model Transportation Demand Analysis Trip Generation 2

Model Transferability Aggregate model Aggregate Total Different zone size, topography, … Aggregate rate Households behavior can be similar Disaggregate model Individuals may behave more similar E.g., New mass transit choice for new towns Transportation Demand Analysis- Trip Generation 2

Temporal Stability Use a model after a long period in the future Shall a new dataset for new time (t2) be collected? Two issues should be checked (small dataset) Macro Observation – estimation graph Micro Statistical assessment of coefficients Transportation Demand Analysis- Trip Generation 2

Macro (Observation – Estimation Graph) Use an observation-estimation graph Yt2=α+βY^t1 α Statistically, α should be 0 and β should be 1 Yt2 β Transportation Demand Analysis- Trip Generation 2 Y^t1

Micro (Statistical Assessment of Coefficients) Differences of respective βs should be statistically 0 As βs have normal distributions, their respective difference has also normal distribution: βt1-βt2 ~ N( 0, SE(βt1-βt2) ) Transportation Demand Analysis- Trip Generation 2

Micro (Example) Trip generation data for 357 household is available for times t1 and t2 Trip number = β0+β1 (Car ownership)+ β2 (HHSZ) Two models have been calibrated as follows: Transportation Demand Analysis- Trip Generation 2 t1 t2 β0 -0.45 -0.19 β1 1.40 (0.13) 1.46 (0.15) β2 1.92 (0.31) 1.52 (0.27) R2 0.34 0.36

Micro (Example) Assessing β1 Trip number = β0+β1 (Car ownership)+ β2 (HHSZ) H0: βt1-βt2 =0 T=(βt1-βt2 )/SE(βt1-βt2 ) Remember: Var(X-Y)=Var(X)+Var(Y)-2Cov(X,Y) Independent samples: Cov(βt1,βt2 )=0 SE2=(0.13)2+(0.15)2-2*0=0.0394  SE=0.198 T=(1.40-1.46)/0.198=-0.303  H0 can not be rejected t1 t2 β0 -0.45 -0.19 Β1 (SE) 1.40 (0.13) 1.46 (0.15) Β2 (SE) 1.92 (0.31) 1.52 (0.27) R2 0.34 0.36 Transportation Demand Analysis- Trip Generation 2

Micro (Example) Assessing β2 Trip number = β0+β1 (Car ownership)+ β2 (HHSZ) H0: βt1-βt2 =0 T=(βt1-βt2 )/SE(βt1-βt2 ) Remember: Var(X-Y)=Var(X)+Var(Y)-2Cov(X,Y) Independent samples: Cov(βt1,βt2 )=0 SE2=(0.31)2+(0.27)2-2*0=0.169  SE=0.411 T=(1.92-1.52)/0.411=0.97  H0 can not be rejected t1 t2 β0 -0.45 -0.19 Β1 (SE) 1.40 (0.13) 1.46 (0.15) Β2 (SE) 1.92 (0.31) 1.52 (0.27) R2 0.34 0.36 Transportation Demand Analysis- Trip Generation 2

Notes No change in the role of variables during the studied period E.g., Car as a proxy of income vs. Car as an essential instrument in individual’s lifestyle Pooling the data together and use a respective dummy is also recommended to calibrate a model for a long period with two datasets Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis Trip Generation 2 Accessibility

Necessity Trip generation models are not sensitive to policy making Because they are not sensitive to attributes of the transportation system (e.g., time and cost) Note: Time and cost depend on both origin and destination which is not known in trip generation stage Transportation Demand Analysis- Trip Generation 2

Definition Acci=ΣjcijAj Acci: Accessibility index for zone i Cij: Cost of travel between I and j Aj: Opportunities at zone j (e.g., Population, Student number, School number) Transportation Demand Analysis- Trip Generation 2

Transportation Demand Analysis- Trip Generation 2 Finish