Using Google’s Aggregated and Anonymized Trip Data to Estimate Dynamic Origin-Destination Matrices for San Francisco TRB Applications Conference 2017 Bhargava.

Slides:



Advertisements
Similar presentations
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Advertisements

Opportunities & Challenges Using Passively Collected Data In Travel Demand Modeling 15 th TRB Transportation Planning Applications Conference Atlantic.
An Application of Mitigating Flow Bias from Origin/Destination Surveys in a Transit System Jamie Snow (AECOM) David Schmitt (AECOM) May 20, 2015.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Use of Truck GPS Data for Travel Model Improvements Talking Freight Seminar April 21, 2010.
Comparison of Cell, GPS, and Bluetooth Derived External Data Results from the 2014 Tyler, Texas Study 15 th TRB National Transportation Planning Conference.
COLLABORATE. INNOVATE. EDUCATE. What Smartphone Bicycle GPS Data Can Tell Us About Current Modeling Efforts Katie Kam, The University of Texas at Austin.
BIG BIKE DATA: HOW GPS ROUTE DATA COLLECTED FROM SMARTPHONES CAN BENEFIT BICYCLE PLANNING MEYER, 2013 UNIVERSITY OF TEXAS AT AUSTIN 1.
Estimation of Demand Prof. Ravikesh Srivastava Lecture-8.
Population Movements from Anonymous Mobile Signaling Data An Alternative or Complement to Large- Scale Episodic Travel Surveys?
5 th Bi-Annual Border to Border Conference Development of an Origin-Destination Matrix for the Paso Del Norte Region (Regional Master Network Phase IV)
Where did you come from? Where did you go? Robust policy relevant evidence from mobile network big data Danaja Maldeniya, Amal Kumarage, Sriganesh Lokanathan,
How to Put “Best Practice” into Traffic Assignment Practice Ken Cervenka Federal Transit Administration TRB National Transportation.
Correlation Correlation is used to measure strength of the relationship between two variables.
Dynamic Origin-Destination Trip Table Estimation for Transportation Planning Ramachandran Balakrishna Caliper Corporation 11 th TRB National Transportation.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Chapter 4 Class 4.
Preliminary Evaluation of Cellular Origin- Destination Data as a Basis for Forecasting Non-Resident Travel 15 th TRB National Transportation Planning Applications.
SAN FRANCISCO COUNTY TRANSPORTATION AUTHORITY Making Activity-Based Travel Demand Models Play Nice With Trip Rates Elizabeth Sall, Daniel Wu, Billy Charlton.
CE Urban Transportation Planning and Management Iowa State University Calibration and Adjustment Techniques, Part 1 Source: Calibration and Adjustment.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
BPA CSUB Prof. Yong Choi. Midwest Distribution 1. Create scatter plot Find out whether there is a linear relationship pattern or not Easy and simple using.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Reconciliation of regional travel model and passive device tracking data 14 th TRB Planning Applications Conference Leta F. Huntsinger Rick Donnelly.
Napa County Travel Behavior Study March 10, 2015 Napa County Joint Board of Supervisors and Planning Committee Meeting Presentation 1.
Privacy Vulnerability of Published Anonymous Mobility Traces Chris Y. T. Ma, David K. Y. Yau, Nung Kwan Yip (Purdue University) Nageswara S. V. Rao (Oak.
Applications in Mobile Technology for Travel Data Collection 2012 Border to Border Transportation Conference South Padre Island, Texas November, 13, 2012.
INTRODUCTION TO RESEARCH METHODS IN ECONOMICS Topic 5 Data Collection Strategies These slides are copyright © 2010 by Tavis Barr. This work is licensed.
Big Data and Advanced Models on a Mid-Sized City’s Budget:
Alternate Methodologies for Origin-Destination Data Collection
Lessons learned from Metro Vancouver
GS/PPAL Section N Research Methods and Information Systems
Evaluation of Hard Shoulder vs
Analysis of Relationships between Road Traffic Volumes and Weather: Exploring Spatial Variation Jaakko Rantala and James Culley Need Previous studies for.
Sections Review.
TRB Planning Application Conference, May 2017
Correlation and Simple Linear Regression
Predicting Salinity in the Chesapeake Bay Using Neural Networks
OPERATIONS MANAGEMENT for MBAs Fourth Edition
Belinda Boateng, Kara Johnson, Hassan Riaz
AMPO Annual Conference October 22, 2014
Mixed Costs Chapter 2: Managerial Accounting and Cost Concepts. In this chapter we explain how managers need to rely on different cost classifications.
Estimating the Traffic Flow Impact of Pedestrians With Limited Data
Validating Trip Distribution using GPS Data
Leta F. Huntsinger, PhD, PE Senior Technical Principal, WSP
WIFI Data Collection and the Effectiveness of Various Survey Expansion Techniques- A Case Study on I-95 Corridor in Palm Beach County, FL Presented to.
EVALUATING THE TRANSPORTATION IMPACTS OF MULTIFAMILY HOUSING
Chapter 12: Regression Diagnostics
Correlation A Lecture for the Intro Stat Course
Correlation and Simple Linear Regression
Module 2: Demand Forecasting 2.
Estimating PM2.5 using MODIS and GOES aerosol optical depth retrievals in IDEA Hai Zhang , Raymond M. Hoff1 , Shobha Kondragunta2.
Identifying Worker Characteristics Using LEHD and GIS
Development of New Supply Models in Maryland Using Big Data
Tabulations and Statistics
No notecard for this quiz!!
Transit Path-Building: “To Multipath or Not to Multipath”
Correlation and Simple Linear Regression
Functions and Their Graphs
Chattanooga Transportation Data Collection Review
Multiple Linear Regression Analysis
Estimating the number of components with defects post-release that showed no defects in testing C. Stringfellow A. Andrews C. Wohlin H. Peterson Jeremy.
Simple Linear Regression and Correlation
Ossi Nurmi 15th Global Forum on Tourism Statistics, Cusco, Peru
BEC 30325: MANAGERIAL ECONOMICS
Comparison and Analysis of Big Data for a Regional Freeway Study in Washington State Amanda Deering, DKS Associates.
What do the Experts Do? Insights from Interviews & Literature to Deal with Bias in Big Data Conversations about Counting: Big Data – Implications for.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Using Google’s Aggregated and Anonymized Trip Data to Estimate Dynamic Origin-Destination Matrices for San Francisco TRB Applications Conference 2017 Bhargava Sana, Joe Castiglione, Dan Tischler, Drew Cooper Good Afternoon! I am here to share our recent experience with a new passive data source from Google to estimate OD matrices for San Francisco Bay Area. SAN FRANCISCO COUNTY TRANSPORTATION AUTHORITY May 16, 2017

OD Data Collection Conventional methods License plate surveys Roadside interviews Emerging passive data collection methods Bluetooth detectors Cell phone Call Detail Records (CDR) data GPS, Wi-Fi detectors Historically, trip OD info available only through expensive and time-consuming surveys Recently, use of passive data collection technologies is gaining momentum

Google’s Better Cities Program Aggregated and Anonymized Trip (AAT) Data Minimize congestion, improve safety and reduce infrastructure spending Aggregated and Anonymized Trip (AAT) information from location reports Extract data from moving users Clean data and snap to road network Aggregate OD trip counts Apply differential privacy filters and minimum trips threshold Partnered with Google Better Cities program made available aggregated and anonymized OD flow information from location reports of mobile device users Reports based on data from users who chose to store their location information from Google-enabled devices on Google servers Easy to turn it on in account settings and record trips fairly accurately; No active navigation constraint Describe AAT creation process AAT DATA

Google AAT Dataset Hourly AAT data for six months (Apr-Jun and Sep-Nov 2015) Flow data provided as relative trips as opposed to absolute counts Convert relative flows to trips using HH travel survey? Hourly flow data provided for 6 months in 2015 85 districts defined for 9-county San Francisco Bay Area: combination of Tract and County boundaries Google provided relative flows rather than absolute counts stating that location reports account for only a sample of travelers But we need trips for planning purposes; maybe relative flows could be converted to trips using regional HH travel survey? A quick comparison with CHTS flows shows that the relative magnitude of AAT flows is reasonable CHTS is sparse with data for less than half the total OD district pairs (7,225) but AAT has all ODs

Relative Flow Conversion Model 𝑷𝑻 𝒐𝒅𝒕 = β 𝒕 𝑹𝑭 𝒐𝒅𝒕 +ϵ ; 𝟎≤𝐭≤𝟐𝟑 𝑃𝑇 𝑜𝑑𝑡 : Person trips between o-d for hour-of-day t 𝛽 𝑡 : Coefficient of AAT relative flow for t 𝑅𝐹 𝑜𝑑𝑡 : Avg AAT flow between o-d for hour-of-day t 𝜖 : Error term No geographic constants applied District- and County-level regression models estimated 20% sample used for validation The relationship between relative flows and trips appeared to vary by geography as well as time-of-day However only TOD coefficients used so that the conversion model is independent of OD markets Both district-level and county-level regression through origin (RTO) models were estimated A 20% holdout sample was used for validation in each case

Origin-Destination-Hour Predicted vs Observed Origin-Destination-Hour Scatter plots of predicted vs observed hourly OD flows show that the models perform reasonably well except county model validation sample. Please note, each point is an origin-destination-hour. RMSE and Mean Absolute Error (MAE) show that county model is unsurprisingly better due to higher spatial aggregation

Summary AAT relative flow magnitudes correlated with actual trips AAT geographic coverage significantly higher Simple linear regression model may be used for conversion Could support measuring longitudinal variation Further studies Better/smooth survey data Compare with cell CDR data