What do the Experts Do? Insights from Interviews & Literature to Deal with Bias in Big Data Conversations about Counting: Big Data – Implications for.

Slides:



Advertisements
Similar presentations
Survey design. What is a survey?? Asking questions – questionnaires Finding out things about people Simple things – lots of people What things? What people?
Advertisements

GIS and Transportation Planning
Estimating Highway Pavement Damage Costs Attributed to Truck Traffic Yong Bai, Ph.D., P.E., F.ASCE Associate Professor Dept. of Civil, Environmental, and.
NON MOTORISED TRANSPORT Teaching & Learning Materials – Update 2007 funded within the 6th Framework Programme of the EU as Specific Support.
Company confidential Prepared by HERE Transit Sr. Product Manager, HERE Transit Product Overview David Volpe.
The City of Gdynia City rights in 1926 With Sopot and Gdańsk forms the Tri-City agglomeration It has inhabitants Port city, employment structure:
ICT cloud-based platform and mobility services available, universal and safe for all users General presentation
Department of Civil and Environmental Engineering University of Wisconsin-Madison Wisconsin Traffic Operations and Safety Laboratory Overview of SAFER.
CITTA 5 TH Annual Conference on Planning Research Planning and Ageing Think, Act and Share Age-Friendly Cities CiViTAS-ELAN Project Development, Implementation,
1 Preserving Privacy in GPS Traces via Uncertainty-Aware Path Cloaking by: Baik Hoh, Marco Gruteser, Hui Xiong, Ansaf Alrabady ACM CCS '07 Presentation:
City of Leawood Bicycle Friendly Community The Year in Review.
Mobile Ghent Mobile positioning data and transport: a theoretical, methodological and empirical discussion 24 October 2013 Bert van Wee Delft University.
National Household Travel Survey Statewide Applications Heather Contrino Travel Surveys Team Lead Federal Highway Administration Office of Highway Policy.
Roadway and traffic characteristics for bicycling Author Janice Kirner Providelo Suely da Penha Sanches Presenter 謝博任.
Where did you come from? Where did you go? Robust policy relevant evidence from mobile network big data Danaja Maldeniya, Amal Kumarage, Sriganesh Lokanathan,
Press Conference on Road Safety Network And Launching Fleet Safety Management.
PHAN Physical Activity Networking Practical application of HEAT in Modena.
Sydney, AUSTRALIA | Beijing, CHINA | Hyderabad, INDIA | London, UK Affiliated with the University of Sydney.
Cloud Market Readiness Report Finance, Media, and Legal Sectors March 2014 Trend Consulting 2013.
ROSETTA Real OpportunitieS for Exploitation of Transport Telematics Applications 5th Framework Project Partners TRGMike McDonald, Richard Hall (Project.
Central MeetBike More sustainable transport in Central European cities through improved integrated bicycle promotion and international networking Jaroslav.
CIVITAS Forum, Casablanca 2014 Project Manager Troels Andersen City of Odense, Denmark GETTING THE MOST OUT OF CIVITAS 2020.
Cloud Computing for the Grassroots A presentation for the Phennd Quarterly Meeting October 5, 2010 Michele Masucci, Ph.D., Associate Professor of Geography.
ADA Revised Regulations General Overview Trainer’s Name Trainer’s Title Phone Number /Website Here ADA Trainer Network Module 1c 1.
Introduction to NCHS Rob Weinzimer, Special Assistant for Outreach Centers for Disease Control and Prevention National Center for Health Statistics.
1 Enabling Smart Cities/Campuses to Serve the Internet of People Florence Hudson Senior Vice President & Chief Innovation Officer Internet2 TNC16 June.
THE SOUND OF SILENCE: AN EVALUATION OF CDC’S PODCAST INITIATIVE Quynh-Chau, M., Myers, Bradford A. (2013). The Sound of Silence: an evaluation of CDC's.
Russell & Jamieson chapter Evaluation Steps 15. Evaluation Steps Step 1: Preparing an Evaluation Proposal Step 2: Designing the Study Step 3: Selecting.
A Shift in the Data Security Paradigm
Krista Nordback, Ph.D., P.E., Kristin Tufte, Ph.D.
Marketing Research Aaker, Kumar, Leone and Day Eleventh Edition
Road Safety Behaviour Symposium: New technology, new connectivity
9/10 Technology Education
1st November, 2016 Transport Modelling – Developing a better understanding of Short Lived Events Marcel Pooke – Operational Modelling & Visualisation Manager.
Neighborhood Pedestrian Fatality Risk
RESEARCH METHODS Lecture 20
Integrating administrative data – the 2021 Census and beyond
Seattle Bike Map Update
CIE Overview.
Data Impacts of Transportation Reauthorization: Data Community’s Plans and Strategies Pat Hu Chair, TRB National Transportation Data Requirements and Programs.
© Inge Hill Start Up, Palgrave 2015
Post Enumeration Survey Census
September 2011 Public Open Houses
Presentation to Pacific Statistics Methods Board
Estimating Highway Pavement Damage Costs Attributed to Truck Traffic
The IRTAD Group The IRTAD database The IRTAD annual report IRTAD research reports Outreach activities.
Nettest An implementation of BEREC’s recommendations
ADA Revised Regulations General Overview
Using Google’s Aggregated and Anonymized Trip Data to Estimate Dynamic Origin-Destination Matrices for San Francisco TRB Applications Conference 2017 Bhargava.
September 2011 Public Open Houses
CENSUS EVALUATION & POST ENUMERATION SURVEYS
MIXED METHODS IN RESEARCH STUDIES: LEARNING FROM EXAMPLES
Section 1.5 Bias in Sampling.
Passenger mobility and road traffic statistics
Business and Management Research
UNODC-UNECE Manual on Victimization Surveys: Content
University of Science & Technology, Meghalaya
Qualtrics for data collection
What do Samples Tell Us Variability and Bias.
SPR-B Research Coordination Webinar
MSP Regional Travel Behavior Inventory Program
RESEARCH METHODS Lecture 14
GLOBAL STATUS REPORT ON ROAD SAFETY SUPPORTING A DECADE OF ACTION
A Healthy Community Perspective on Aging Well
Overview Background Methods Findings Future research
RESEARCH METHODS Lecture 20
RESEARCH METHODS Lecture 14
Mobility Management during road construction – a Swedish attempt
Built Environment and Traffic Safety
Comparison and Analysis of Big Data for a Regional Freeway Study in Washington State Amanda Deering, DKS Associates.
Presentation transcript:

What do the Experts Do? Insights from Interviews & Literature to Deal with Bias in Big Data Conversations about Counting: Big Data – Implications for Bicycle and Pedestrian Traffic Analysis Jun 26, 2018 Greg P. Griffin, AICP

Disclaimer The contents of this report reflect the views of the authors, who are responsible for the facts and the accuracy of the information presented herein. This document is disseminated in the interest of information exchange. The report is funded, partially or entirely, by a grant from the U.S. Department of Transportation’s University Transportation Centers Program. However, the U.S. Government assumes no liability for the contents or use thereof.

Approach Synthesis of Literature Interviews with Experts 135 studies 39 contacted 75 on bias in big data 10 participated Sources & Mitigation of Bias in Big Data

Sampling (or demographic) Bias Sources Mobile phone data Census demographics ≠ mobile adoption Mitigation Local survey Data fusion Reference Chen, C., et al. The Promises of Big Data and Small Data for Travel Behavior (Aka Human Mobility) Analysis. Transportation Research Part C: Emerging Technologies, Vol. 68, 2016, pp. 285–299.

Aggregation Bias Sources Privacy protection changes travel counts <threshold of trips usually removed Mitigation Data fusion Maths ;-) Markovian Inverse Wernerization? Reference Mehmood, R., et al. Exploring the Influence of Big Data on City Transport Operations: A Markovian Approach. International Journal of Operations & Production Management, Vol. 37, No. 1, 2017, pp. 75–104.

Social Desirability Bias Sources Social media or ‘athlete’ data Fitness cyclists may post only ‘good’ or ‘fast’ trips Mitigation Explore platforms that record all trips Sample weighting Data fusion Reference Beecham, R., and J. Wood. Exploring Gendered Cycling Behaviours within a Large-Scale Behavioural Data-Set. Transportation Planning and Technology, Vol. 37, No. 1, 2013, pp. 83–97.

Coverage Bias Sources Mobile phone call data records People do not make calls at every destination Mitigation Data providers that aggregate multiple networks? Sample weighting Reference Toole, J. L., S. Colak, B. Sturt, L. P. Alexander, A. Evsukoff, and M. C. González. The Path Most Traveled: Travel Demand Estimation Using Big Data Resources. Transportation Research Part C, Vol. 58, 2015, pp. 162–177.

Perspective People (not cars) carry phones Biased data > no data big data from mobile devices probably under- represents older segments of the population and maybe lower income ... But it also does a better job of representing under- represented road users like bicyclists and pedestrians. People (not cars) carry phones Biased data > no data

Lessons for Methods Real counts remain critical! Biases include aggregation, coverage, sampling, social desirability, and more Mitigation techniques Sample weighting with counts or local surveys Data fusion Maths ;-)

Recommendations & Next Steps Keep transportation experts and public central in determining the right goals and metrics to evaluate transportation safety. Develop new methods to relate big data to the total population needed for transportation safety. Leverage big data to answer intractable questions. Work ahead to transfer emerging knowledge to future problems.

Thanks! Co-authors Meg Mulhall & Chris Simek Interview coding support by TTI researcher Boya Dai, AICP. Project report posted soon “Sources and Mitigation of Bias in Big Data for Transportation Safety” https://www.vtti.vt.edu/utc/safe-d/ This project was funded by the Safety through Disruption (Safe-D) National University Transportation Center, a grant from the U.S. Department of Transportation – Office of the Assistant Secretary for Research and Technology, University Transportation Centers Program. g-griffin@tti.tamu.edu @gregpgriffin