Where did you come from? Where did you go? Robust policy relevant evidence from mobile network big data Danaja Maldeniya, Amal Kumarage, Sriganesh Lokanathan,

Slides:



Advertisements
Similar presentations
A Synthetic Environment to Evaluate Alternative Trip Distribution Models Xin Ye Wen Cheng Xudong Jia Civil Engineering Department California State Polytechnic.
Advertisements

VTrack: Energy-Aware Traffic Delay Estimation Using Mobile Phones Lenin Ravindranath, Arvind Thiagarajan, Katrina LaCurts, Sivan Toledo, Jacob Eriksson,
Will ‘big data’ transform official statistics?
Kien A. Hua Division of Computer Science University of Central Florida.
Human Mobility Modeling at Metropolitan Scales Sibren Isaacman, Richard Becker, Ramón Cáceres, Margaret Martonosi, James Rowland, Alexander Varshavsky,
Enhancing Policy Decision Making with Large-Scale Digital Traces Vanessa Frias-Martinez University of Maryland NFAIS, February 2014.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Analysis and Multi-Level Modeling of Truck Freight Demand Huili Wang, Kitae Jang, Ching-Yao Chan California PATH, University of California at Berkeley.
Spatiotemporal Pattern Mining For Travel Behavior Prediction UIC IGERT Seminar 02/14/2007 Chad Williams.
Archived Data User Services (ADUS). ITS Produce Data The (sensor) data are used for to help take transportation management actions –Traffic control systems.
Lec 15 LU, Part 1: Basics and simple LU models (ch6.1 & 2 (A), ch (C1) Get a general idea of urban planning theories (from rading p (A)
Origin-Destination matrix estimation in Sri Lanka using mobile network big data Danaja Maldeniya, Sriganesh Lokanathan and Amal Kumarage (Phd) 13th International.
Opportunities & Challenges Using Passively Collected Data In Travel Demand Modeling 15 th TRB Transportation Planning Applications Conference Atlantic.
Functional Classification CE 453 Lecture 3. Objectives Summarize general highway design process Identify different roadway classification systems Identify.
1 Vehicular Sensor Networks for Traffic Monitoring In proceedings of 17th International Conference on Computer Communications and Networks (ICCCN 2008)
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
The Market Research Process
Use of Truck GPS Data for Travel Model Improvements Talking Freight Seminar April 21, 2010.
Utilizing Big Data Methods for a Long-Distance Automobile Origin-Destination Study May 19, 2015 Presented by Polly Ramsey 15 th TRB National Transportation.
Miao(Mia) Gao, Travel Demand Modeler, HDR Engineering Santanu Roy, Transportation Planning Manager, HDR Engineering Ridership Forecasting for Central Corridor.
An introduction May Offermans, Martijn Tennekes, Alex Priem, Shirley Ortega en Nico Heerschap Using Mobile Phone Meta Data For National Statistics.
Vienna, 23 April 2008 UNECE Work Session on SDE Topic (v) Editing on results (post-editing) 1 Topic (v): Editing based on results Discussants: Maria M.
Application of the Histocity Method to Mobility analysis and scenario definition in the city of Florence, Italy M.A. Esposito Università degli Studi di.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides.
1 WORLD TOURISM ORGANIZATION (UNWTO) MEASURING TOURISM EXPENDITURE: A UNWTO PROPOSAL SESRIC-UNWTO WORKSHOP ON TOURISM STATISTICS AND THE ELABORATION OF.
ECON 6012 Cost Benefit Analysis Memorial University of Newfoundland
Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Kouros Mohammadian, PhD and Yongping Zhang (PhD Candidate), CME,
6 am 11 am 5 pm Fig. 5: Population density estimates using the aggregated Markov chains. Colour scale represents people per km. Population Activity Estimation.
Evaluating Robustness of Signal Timings for Conditions of Varying Traffic Flows 2013 Mid-Continent Transportation Research Symposium – August 16, 2013.
Freight Bottleneck Study Update to the Intermodal, Freight, and Safety Subcommittee of the Regional Transportation Council September 12, 2002 North Central.
Population Movements from Anonymous Mobile Signaling Data An Alternative or Complement to Large- Scale Episodic Travel Surveys?
Act Now: An Incremental Implementation of an Activity-Based Model System in Puget Sound Presented to: 12th TRB National Transportation Planning Applications.
1 Building the Knowledge Base. 2 New Parameters In crossing international borders, a firm encounters parameters not found in domestic business. Examples.
Travel Data Simulation and Transferability of Household Travel Survey Data Kouros Mohammadian, PhD and Yongping Zhang (PhD Candidate), CME, UIC Prime Grant.
M I T I n t e r n a t i o n a l C e n t e r f o r A i r T r a n s p o r t a t i o n ANALYSIS OF BARRIERS TO THE UTILITY OF GENERAL AVIATION TROY D. DOWNENR.
Travel Demand Forecasting: Trip Distribution CE331 Transportation Engineering.
Assessing Quality for Integration Based Data M. Denk, W. Grossmann Institute for Scientific Computing.
Y. Yuan and M. Raubal1 Investigating the distribution of human activity space from mobile phone usage Yihong Yuan 1,2 and Martin Raubal 1 1 Institute.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
1 Predicted-versus-Actual Studies: Why/how to do them and Lessons Learned Ken Cervenka Federal Transit Administration TRB Transportation Planning Applications.
Client Name Here - In Title Master Slide Data Requirements to Support Road Pricing Analyses Johanna Zmud, Ph.D. NuStats Partners, LP Expert Forum on Road.
SVY 207: Lecture 13 Ambiguity Resolution
Managing Complexity with Multi-scale Travel Forecasting Models Jeremy Raw Office of Planning Federal Highway Administration May 11, 2011.
SUB-MODULE 5. MEANS OF VERIFICATION RESULTS BASED LOGICAL FRAMEWORK TRAINING Quality Assurance and Results Department (ORQR.2)
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
FDOT Transit Office Modeling Initiatives The Transit Office has undertaken a number of initiatives in collaboration with the Systems Planning Office and.
Methodological Considerations for Integrating Dynamic Traffic Assignment with Activity-Based Models Ramachandran Balakrishna Daniel Morgan Srinivasan Sundaram.
November 28, 2006 CCOS On-Road Allocation Factors Page 1 Spatial & Temporal Allocation of On-Road Emissions CCOS Technical Committee November 28, 2006.
Copyright 2010, The World Bank Group. All Rights Reserved. Producer prices, part 1 Introduction Business Statistics and Registers 1.
Big and Open Data: Challenges and Issues
Travel Data and the Smartphone: Building an International Travel Dataset One Android User at a Time GIL TAL MICHAEL NICHOLAS MATTHEW FAVETTI.
A Tour-Based Urban Freight Transportation Model Based on Entropy Maximization Qian Wang, Assistant Professor Department of Civil, Structural and Environmental.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Regional Policy Ex-post evaluation 2007 – 2013 Completed work-packages: 8 – Energy efficiency 9 – Tourism and culture 13 – Geography of expenditure DG.
The Current State-of-the-Practice in Modeling Road Pricing Bruce D. Spear Federal Highway Administration.
Using mobile network big data for land use classification Kaushalya Madhawa, Sriganesh Lokanathan, Danaja Maldeniya, Rohan Samarajiva CPRsouth 2015 Taipei.
Generated Trips and their Implications for Transport Modelling using EMME/2 Marwan AL-Azzawi Senior Transport Planner PDC Consultants, UK Also at Napier.
A COMPARATIVE STUDY Dr. Shahram Tahmasseby Transportation Systems Engineer, The City of Calgary Calgary, Alberta, CANADA.
Transportation Modeling – Opening the Black Box. Agenda 6:00 - 6:05Welcome by Brant Liebmann 6:05 - 6:10 Introductory Context by Mayor Will Toor and Tracy.
DATA FOR EVIDENCE-BASED POLICY MAKING Dr. Tara Vishwanath, World Bank.
Applications in Mobile Technology for Travel Data Collection 2012 Border to Border Transportation Conference South Padre Island, Texas November, 13, 2012.
Urban Mobility Management and Emissions Measurement System Boile Maria 1,2 Afroditi Anagnostopoulou 1 Evangelia Papargyri 1 1 Centre for Research and Technology.
Big data Analytics for Tourism Destination management
Lessons learned from Metro Vancouver
Administrative data, calling patterns and spatial economics: Impact evaluation drawing on multiple data sources Nathaniel Young (EBRD)
ITTS FEAT Tool Methodology Review ITTS Member States Paula Dowell, PhD
Using Google’s Aggregated and Anonymized Trip Data to Estimate Dynamic Origin-Destination Matrices for San Francisco TRB Applications Conference 2017 Bhargava.
Data Pre-processing Lecture Notes for Chapter 2
Comparison and Analysis of Big Data for a Regional Freeway Study in Washington State Amanda Deering, DKS Associates.
What do the Experts Do? Insights from Interviews & Literature to Deal with Bias in Big Data Conversations about Counting: Big Data – Implications for.
Presentation transcript:

Where did you come from? Where did you go? Robust policy relevant evidence from mobile network big data Danaja Maldeniya, Amal Kumarage, Sriganesh Lokanathan, Gabriel Kreindler, Kaushalya Madhawa CPRsouth 2015 Taipei, Taiwan 26 August 2015 This work was carried out with the aid of a grant from the International Development Research Centre, Canada and the Department for International Development UK.

Policy implication Mobile Network Big Data (MNBD) can support urban transport planning as a continuous exercise – Greater spatio-temporal detail than corresponding traditional output – Negligible incremental cost of generating forecasts – Single source for understanding different aspects of mobility Inherent limitations mean MNBD cannot replace traditional process entirely

Transport forecasting in developing countries are based on infrequent surveys These are done on an as needed basis – The Colombo transport master plan (COMTRANS) survey in 2013 in Sri Lanka, funded by JAICA Expensive – COMTRANS 2013 survey cost approximately $400,000 1 Time consuming – By the time the results are ready, they are often already outdated Cannot support continuous monitoring of transport patterns Not very useful to to help evaluate impact of policies 1 Estimated based on interviews 3

Mobility insights from MNBD need to be aligned with the traditional process MNBD based insights provide greater temporal and spatial resolution MNBD cannot replace the diverse data collected by the traditional survey MNBD insights need to be aligned with different stages of traditional process familiar to planners

The data: historical and anonymized Call Detail Records (CDRs) from Sri Lanka Call Detail Record (CDR): – Records of all calls made and received by a person created mainly for the purposes of billing – Similar records exist for all SMS-es sent and received as well as for all Internet sessions – The Cell ID in turn has a lat-long position associated with it CDR data for 13 contiguous months in – Nearly million 10 SIMs – Over 25 billion records 5

Origin – Destination Matrices Key intermediate output of the traditional forecasting process Estimated people/vehicle flows between regions – E.g.: DSD (3 rd level administrative division) level O-D matrix for the Western Province of Sri Lanka Used in traffic studies, identifying key transport corridors, etc.

Multiple methods exist for extracting O-D matrices from MNBD We extracted O-D matrices for the Western Province of Sri Lanka using 3 methods: – Stay based method – Transient Trip method – Frequent Trip method Methods have two stages: – Identify individual movement as a sequence of trips (varies across methods) – Aggregate individual trips across the origin and destination locations of the trips (same for all methods)

Stay based approach

Transient trip approach A trip is identified from the CDRs by a consecutive pair of records such that, – The records indicate a displacement, i.e. the BTS-es utilized for each record is different – The records are separated by a time interval T Interval where, 10 minutes ≤ T Interval ≤ 1 hour Maximizes amount of extracted mobility information by capturing intermediate points in trips Extracted trips likely to correspond to segments of real trips Built on prior work – Wang, P., Hunter, T., Bayen, A. M., Schechtner, K., & González, M. C. (2012). Understanding road usage patterns in urban areas. Scientific Reports, 2, doi: /srep01001 – Iqbal, M. S., Choudhury, C. F., Wang, P., & González, M. C. (2014). Development of origin- destination matrices using mobile phone call data. Transportation Research Part C: Emerging Technologies, 40, 63–74. doi: /j.trc

Frequent trip approach

Each method has strengths/weaknesses Stay basedTransient tripsFrequent trips Amount of mobilityLowHighRegular mobility Sensitivity to noiseLowHighLow Bias towards active users HighModerateLow UseIdentifies congregations of people Suitable for short term mobility analysis Aligns best with outputs from traditional process

Validation with traditional output Compared with best available traditional forecast – Number of trips generated by region at the DSD level, from COMTRANS 2013 Constructed weighted linear models for all three methods MethodInterceptMNBD estimateR2R2 Stay based35,516***76.41***0.819 Transient trip25,460**2.66***0.903 Frequent Trips14, ***0.909

MNBD insights within a traditional transport forecasting process MNBD insights have inherent limitations – Sampling biases: high activity users, mobile phone penetration in different regions – Sparsity of data: less than 25 records per day for 90% of the users – Lack of socioeconomic, demographic, travel motivation Solutions exist to mitigate these issues to some extent – Adjust for penetration and operator market share by scaling flows – Associating demographic parameters from travel surveys with MNBD insights using machine learning techniques to match mobility variables in both – Probabilistic models (E.g : Hidden Markov Models) to estimate locations for people where no records exist, can improve mobility estimates

Policy implication Mobile Network Big Data (MNBD) can support urban transport planning as a continuous exercise. – Greater spatio-temporal detail than corresponding traditional output – Negligible incremental cost of generating forecasts – Single source for understanding different aspects of mobility – Inherent limitations mean cannot replace traditional process entirely

Thank you. 15