Matt’s Schedule. Headway Variation Estimated Load vs. Passenger Movement.

Slides:



Advertisements
Similar presentations
Direction Direction = 0: Northbound Direction = 1: Southbound.
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Empirical Model Building I: Objectives: By the end of this class you should be able to: find the equation of the “best fit” line for a linear model explain.
D2 Roadway Discussion Sound Transit Board September 22, 2011.
© Copyright 2001, Alan Marshall1 Regression Analysis Time Series Analysis.
Introduction to Regression ©2005 Dr. B. C. Paul. Things Favoring ANOVA Analysis ANOVA tells you whether a factor is controlling a result It requires that.
1 Reliability-Based Timepoint Schedules for Long Headway Transit Routes Peter G. Furth, Northeastern University with Theo H.J. Muller, Delft University.
Overview Motivation Data and Sources Methods Results Summary.
CH 27. * Data were collected on 208 boys and 206 girls. Parents reported the month of the baby’s birth and age (in weeks) at which their child first crawled.
Lecture 5 Bus Rapid Transit, ridership estimation procedures and headway requirements Dr. Muhammad Adnan.
Stat 112: Lecture 15 Notes Finish Chapter 6: –Review on Checking Assumptions (Section ) –Outliers and Influential Points (Section 6.7) Homework.
BA 555 Practical Business Analysis
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
Simple Linear Regression Analysis
Cheryl Thole, Jennifer Flynn CUTR/NBRTI, Senior Research Associates Transit in GIS Conference September 14, 2011 St. Petersburg, Florida.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
2011 Long-Term Load Forecast Review ERCOT Calvin Opheim June 17, 2011.
Fall, 2012 EMBA 512 Demand Forecasting Boise State University 1 Demand Forecasting.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Advantages of Multivariate Analysis Close resemblance to how the researcher thinks. Close resemblance to how the researcher thinks. Easy visualisation.
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
Joel Koepke, P.E. ERCOT Operations Support Engineer ERCOT Experiences During Summer 2011.
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
Calculus is the branch of mathematics that we use to describe change.
© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs.
STAT E100 Section Week 3 - Regression. Review  Descriptive Statistics versus Hypothesis Testing  Outliers  Sample vs. Population  Residual Plots.
Warm-Up 1.The probability distribution for the number of games played in each World Series for the years 1923–2004 is given below. Find the expected number.
TRB 88th Annual Meeting, Washington DC January, 2009 Huan Li and Robert L. Bertini Transportation Research Board 88th Annual Meeting Washington, DC January.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Materials developed by K. Watkins, J. LaMondia and C. Brakewood Timetabling Components Unit 5: Staff & Fleet Scheduling.
Climographs Climate is what we expect, weather is what we get… Mark Twain What is Mark Twain saying in this quote? – Discuss with your partner, provide.
1 Chapter 4: Describing Distributions 4.1Graphs: good and bad 4.2Displaying distributions with graphs 4.3Describing distributions with numbers.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Maths Study Centre CB Open 11am – 5pm Semester Weekdays
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 7 Section 4 – Slide 1 of 11 Chapter 7 Section 4 Assessing Normality.
Measures of Center vs Measures of Spread
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
I was on the second day of a road trip when I decided to keep a record of how far I had traveled from home. The table below shows how many hours I drove.
Warm-up 2.4 Summary Statistics 1.The following is an cumulative percent of the number of ounces of alcohol (one ounce is about 30 milliliters) consumed.
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 2: Review of Multiple Regression (Ch. 4-5)
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
MEASURES OF VARIATION OR DISPERSION THE SPREAD OF A DATA SET.
6-8 Measures of Central Tendency and Variation Objective Find measures of central tendency and measures of variation for statistical data. Examine the.
Bus Service Reliability Developing new performance measures Unreliability Vs. Bus Bunching Meead Saberi K.
CHAPTER 8 Linear Regression. Residuals Slide  The model won’t be perfect, regardless of the line we draw.  Some points will be above the line.
Model Validation of Transit Ridership at the Corridor and Transit Route Level by Mark Charnews October 19, 2006.
Materials developed by K. Watkins, J. LaMondia and C. Brakewood Frequency Determination Unit 5: Staff & Fleet Scheduling.
Applied Quantitative Analysis and Practices LECTURE#30 By Dr. Osman Sadiq Paracha.
Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.
Transit Signal Priority: The Importance of AVL Data David T. Crout Tri-County Metropolitan Transportation District of Oregon (TriMet) Presented at Transportation.
Making Headways Smart Card Fare Payment and Bus Dwell Time in Los Angeles Daniel Shockley Fehr & Peers Julia Salinas Los Angeles Metropolitan Transportation.
TIME SERIES ‘Time series’ data is a bivariate data, where the independent variable is time. We use scatterplot to display the relationship between the.
DSWG - June 25, Four-CP Response for Transmission- and Distribution- Connected ESIIDs in ERCOT Competitive Area Carl L Raish.
Statistics 8 Linear Regression. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
STANDARD: 8.SP.1 STATISTICS AND PROBABILITY: SCATTER PLOTS.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Belinda Boateng, Kara Johnson, Hassan Riaz
Regression model Y represents a value of the response variable.
Stats Club Marnie Brennan
Transit Signal Priority: The Importance of AVL Data
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Summary (Week 1) Categorical vs. Quantitative Variables
Summary (Week 1) Categorical vs. Quantitative Variables
Indicator Variables Response: Highway MPG
Correlation and Covariance
Presentation transcript:

Matt’s Schedule

Headway Variation

Estimated Load vs. Passenger Movement

Weather

Interesting to note the below average passenger boardings in the summer and x-mas week Need to calculate the average by quarter or by month, since the summer is a distinct season

I tried to normalize the data, creating a summer and non-summer period to account for the lower ridership over the summer…not sure if the dates I picked for the normalization are the best. In this chart, summer is June, July or August. I could probably be more precise to match the school year.

Boardings vs Ave Temp AM Average, Direction = 1

Dwell vs. Ave Temp AM Average, Direction = 1

Trip Time vs. Ave Temp AM Average, Direction = 1

Boardings vs. Precipitation AM Average, Direction = 1

Boardings vs Ave Temp AM Average, Direction = 1

Trip Time vs. Precipitation AM Average, Direction = 1

Trip Time vs. Ave Temp AM Average, Direction = 1

Dwell vs. Precipitation AM Average, Direction = 1

Dwell vs. Ave Temp AM Average, Direction = 1

Boardings vs. Precipitation Deviation from Mean

Boardings vs. Ave Temp Deviation from Mean

Trip Time vs. Precipitation Deviation from Mean

Trip Time vs. Ave Temp Deviation from Mean

Dwell Time Scatter Plots

Dwell 3-D

Dwell 3-D Axes Reversed

Dwell Regression Dwell <= 1 min, Boardings Only X1 = Boardings X2 = Alightings X3 = Late (> 3 minutes) X4 = Timepoint (dummy) X5 = Precipitation X6 = Ave Temp

Dwell Regression Dwell <= 1 min, Boardings Only X1 = Boardings X2 = Alightings X3 = Late (> 3 minutes) X4 = Timepoint (dummy) X5 = Precipitation X6 = Ave Temp X7 = Boardings 2 X8 = Alightings 2

Dwell Regression Dwell <= 1 min, Alightings Only X1 = Boardings X2 = Alightings X3 = Late (> 3 minutes) X4 = Timepoint (dummy) X5 = Precipitation X6 = Ave Temp

Dwell Regression Dwell <= 1 min, Alightings Only X1 = Boardings X2 = Alightings X3 = Late (> 3 minutes) X4 = Timepoint (dummy) X5 = Precipitation X6 = Ave Temp X7 = Boardings 2 X8 = Alightings 2

Dwell Regression Dwell <= 1 min, Both Boardings & Alightings X1 = Boardings X2 = Alightings X3 = Late (> 3 minutes) X4 = Timepoint (dummy) X5 = Precipitation X6 = Ave Temp

Dwell Regression Dwell <= 1 min, Both Boardings & Alightings X1 = Boardings X2 = Alightings X3 = Late (> 3 minutes) X4 = Timepoint (dummy) X5 = Precipitation X6 = Ave Temp X7 = Boardings 2 X8 = Alightings 2

Trip Time Model Modified Ahmed Version X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August) X17 = Friday (dummy)

Trip Time Model Modified Ahmed Version X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August) X17 = Friday (dummy)

Trip Time Model Modified Ahmed Version X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August) X17 = Friday (dummy)

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles < 25 & total_dwell < 100*60 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August) X17 = Friday (dummy)

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles < 25 & total_dwell < 100*60 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August)

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles < 25 & total_dwell < 100*60 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August)

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles < 25 & total_dwell < 100*60 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August) X17 = (Boardings + Alightings) 2

Histogram of total boardings(blue) and total alightings(red)

Boxplot of total boardings(1) and total alightings(2)

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles 0 & total_offs > 0 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August) X17 = (Boardings + Alightings) 2

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles 0 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Boardings + Alightings X8 = Lift X9 = Average Passenger Load X10 = Total Dwell Time X11 = Precipitation X12 = Average Temperature X13 = Summer (dummy variable if month = June thru August) X14 = (Boardings + Alightings) 2

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles 0 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Total Boardings X8 = Boardings Squared X9 = Total Alightings X10 = Alightings Squared X11 = Lift X12 = Average Passenger Load X13 = Total Dwell Time X14 = Precipitation X15 = Average Temperature X16 = Summer (dummy variable if month = June thru August) X17 = (Boardings + Alightings) 2

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles 0 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Boardings + Alightings X8 = Lift X9 = Average Passenger Load X10 = Total Dwell Time X11 = Precipitation X12 = Average Temperature X13 = Summer (dummy variable if month = June thru August) X14 = (Boardings + Alightings) 2 Ahmed says use this version

Trip Time Model Modified Ahmed Version – outliers removed tripmiles > 0 & tripmiles 0 X1 = Distance (in miles) X2 = Scheduled Number of Stops X3 = Direction or Southbound X4 = AM Peak X5 = PM Peak X6 = Actual Number of Stops X7 = Boardings + Alightings X8 = Lift X9 = Average Passenger Load X10 = Total Dwell Time X11 = Precipitation X12 = Average Temperature X13 = Summer (dummy variable if month = June thru August) X14 = (Boardings + Alightings) 2

Regression Dwell Regression Model I have run several of these..here is an example Dwell = *No. Boardings *No. Alightings R squared =.291 There are interesting differences in the dwells for timepoint stop locations versus regular stops. Travel Time Regression Model I am still experimenting with this. The thought was that we can explain as much variation as possible with the bus data…what we can’t explain would be road conditions/congestion. It would be interesting to compare routes (low and high congestion routes) to test this assumption. I have achieved an R squared of about.19 Most of the variation is explained by passenger movement and dwell.

Notes Headway variance Subplots of headway variance and estimated load Subplots of headway variance and boardings/alightings Table of summary statistics to re-plot in excel »See if what I did worked… »Also experiment w/different ways of displaying the timepoint names (i.e. a legend) Dwell regression model With stop locations W/O stop locations Dwell Circle Running time: arrive time(x) – leave time(x-1) Layover time: hmmm… Dwell time: dwell, less layover (?) Stop circle time: leave_time - arrive_time, less dwell (?) Travel time regression model Plottools function in Matlab, which you call from the command line, is very handy for manipulating figure formats…