Transportation mode detection using mobile phones and GIS information Leon Stenneth, Ouri Wolfson, Philip Yu, Bo Xu 1University of Illinois, Chicago.

Slides:



Advertisements
Similar presentations
When Efficient Model Averaging Out-Perform Bagging and Boosting Ian Davidson, SUNY Albany Wei Fan, IBM T.J.Watson.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
GPS and Multi-Week Data Collection of Activity-Travel Patterns Harry Timmermans Eindhoven University of Technology 4/19/2015.
Supervised Learning Techniques over Twitter Data Kleisarchaki Sofia.
Using Mobile Phones to Determine Transportation Modes Hyeong-il Ko Sasank Reddy et al., ACM Transactions on Sensor Networks, Vol. 6, No. 2,
Using Mobile Phones to Determine Transportation Modes Sasank Reddy, Min Mun, Jeff Burke, D. Estrin, M. Hansen, M. Srivastava TOSN 2010.
Semantic Texton Forests for Image Categorization and Segmentation We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד.
Spatiotemporal Pattern Mining For Travel Behavior Prediction UIC IGERT Seminar 02/14/2007 Chad Williams.
NEUROPHONE: BRAIN- MOBILE PHONE INTERFACE USING A WIRELESS EEG HEADSET Andrew T. Campbell, Tanzeem Choudhury, Shaohan Hu, Hong Lu, Matthew K. Mukerjee!,
Mining Motion Sensor Data from Smartphones for Estimating Vehicle Motion Tamer Nadeem, PhD Department of Computer Science NSF Workshop on Large-Scale Traffic.
Accelerometer-based Transportation Mode Detection on Smartphones
Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.
Modal Split Analysis.
1 Application of Metamorphic Testing to Supervised Classifiers Xiaoyuan Xie, Tsong Yueh Chen Swinburne University of Technology Christian Murphy, Gail.
CS 590M Fall 2001: Security Issues in Data Mining Lecture 3: Classification.
Trajectories Simplification Method for Location-Based Social Networking Services Presenter: Yu Zheng on behalf of Yukun Cheng, Kai Jiang, Xing Xie Microsoft.
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Chad A. Williams † Peter C. Nelson Abolfazl (Kouros) Mohammadian University of Illinois at Chicago Department of Computer Science Colloquium July 16th,
Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie Microsoft Research.
Understanding User Mobility Based on GPS Data Yu Zheng Microsoft Research Asia.
Mining Officially Unrecognized Side effects of drugs by combining Web Search and Machine learning Carlo Carino, Yuanyuan Jia, Bruce Lambert, Patricia West.
Ubiquitous Advertising: the Killer Application for the 21st Century Author: John Krumm Presenter: Anh P. Nguyen
Automatic Gender Identification using Cell Phone Calling Behavior Presented by David.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Learning from Imbalanced, Only Positive and Unlabeled Data Yetian Chen
Detecting Movement Type by Route Segmentation and Classification Karol Waga, Andrei Tabarcea, Minjie Chen and Pasi Fränti.
Automatically Identifying Localizable Queries Center for E-Business Technology Seoul National University Seoul, Korea Nam, Kwang-hyun Intelligent Database.
Detecting human activities using smartphones and maps Leon Stenneth Adviser: Professor Ouri Wolfson Co-Adviser: Professor Philip Yu University of Illinois,
TEMPLATE DESIGN © Detecting User Activities Using the Accelerometer on Android Smartphones Sauvik Das, Supervisor: Adrian.
Human Gesture Recognition Using Kinect Camera Presented by Carolina Vettorazzo and Diego Santo Orasa Patsadu, Chakarida Nukoolkit and Bunthit Watanapa.
Printing: This poster is 48” wide by 36” high. It’s designed to be printed on a large-format printer. Customizing the Content: The placeholders in this.
1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.
Active Learning An example From Xu et al., “Training SpamAssassin with Active Semi- Supervised Learning”
Crowdsourcing for Spoken Dialogue System Evaluation Ling 575 Spoken Dialog April 30, 2015.
Machine Learning Approach to Report Prioritization with an Application to Travel Time Dissemination Piotr Szczurek Bo Xu Jie Lin Ouri Wolfson.
Slice&Dice: recognizing food preparation activities using embedded accelerometers Cuong Pham & Patrick Olivier Culture Lab School of Computing Science.
Partially Supervised Classification of Text Documents by Bing Liu, Philip Yu, and Xiaoli Li Presented by: Rick Knowles 7 April 2005.
Ensemble Learning Spring 2009 Ben-Gurion University of the Negev.
Best Practices in Transit Rider Survey Data Collection Chris Tatham Sr. Vice President, CEO, ETC Institute 725 W. Frontier Circle Olathe, KS
Uncertainty in Trip Planning in Transportation Systems A. P. Sistla (Joint work with Booth, Wolfson and Cruz)
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Human pose recognition from depth image MS Research Cambridge.
CONFIDENTIAL1 Hidden Decision Trees to Design Predictive Scores – Application to Fraud Detection Vincent Granville, Ph.D. AnalyticBridge October 27, 2009.
Active learning Haidong Shi, Nanyi Zeng Nov,12,2008.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Travel Data and the Smartphone: Building an International Travel Dataset One Android User at a Time GIL TAL MICHAEL NICHOLAS MATTHEW FAVETTI.
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Learning and Inferring Transportation Routines Lin Liao, Don Patterson, Dieter Fox, Henry Kautz Department of Computer Science and Engineering University.
Introduction to Data Mining Clustering & Classification Reference: Tan et al: Introduction to data mining. Some slides are adopted from Tan et al.
Bayesian Active Learning with Evidence-Based Instance Selection LMCE at ECML PKDD th September 2015, Porto Niall Twomey, Tom Diethe, Peter Flach.
Chapter 6 Activity Recognition from Trajectory Data Yin Zhu, Vincent Zheng and Qiang Yang HKUST November 2011.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Transport mode detection in the city of Lyon using mobile phone sensors Jorge Chong Internship for MLDM M1 Jean Monnet University
Source: Procedia Computer Science(2015)70:
Vijay Srinivasan Thomas Phan
Transportation Mode Recognition using Smartphone Sensor Data
Mitchell Kossoris, Catelyn Scholl, Zhi Zheng
Prepared by: Mahmoud Rafeek Al-Farra
The Combination of Supervised and Unsupervised Approach
Lecture 18: Bagging and Boosting
iSRD Spam Review Detection with Imbalanced Data Distributions
Lecture 06: Bagging and Boosting
Machine Learning with Clinical Data
Stable and Practical AS Relationship Inference with ProbLink
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017
Presentation transcript:

Transportation mode detection using mobile phones and GIS information Leon Stenneth, Ouri Wolfson, Philip Yu, Bo Xu 1University of Illinois, Chicago

Problem Detecting a mobile user’s current mode of transportation based on GPS and GIS. Possible transportation modes considered are: 2University of Illinois, Chicago

Technique A supervised machine learning model New classification features derived by combining GPS with GIS Trained multiple models with these extracted features and labeled data. 3University of Illinois, Chicago

Motivation Value added services to context detection systems More customized advertisements can be sent Providing more accurate travel demand surveys instead of people manually recording trips and transfers Determining a traveler’s carbon footprint. 4University of Illinois, Chicago

Approach In addition to traditional features on speed, acceleration, and heading change. We build classification features using GPS and GIS data 5University of Illinois, Chicago

Features Traditional – Speed, acceleration, and heading change Combining GPS and GIS – Rail line closeness – Average bus closeness – Candidate bus closeness – Bus stop closeness rate 6University of Illinois, Chicago

Rail line closeness ARLC - average rail line closeness Let {p 1, p 2, p 3, p 4 …p n } be a finite the set of GPS reports submitted within a time window. ARLC = ∑ i=1 to n d i rail / n 7University of Illinois, Chicago

Average bus closeness (ABC) Let {p 1, p 2, p 3, p 4 …p n } be a finite the set of GPS reports submitted within a time window. ABC = (∑ i=1 to n d i bus ) / n 8University of Illinois, Chicago

Candidate Bus closeness (CBC) d j.t bus 1≤j≤m - Euclidian distance to each bus bus j D j - total Euclidian distance to bus j over all reports submitted in the time window D j = ∑ t=1 to n d j.t bus 1≤j≤m Given D j for all the m buses, we compute CBC as follows. CBC = min (D j ) 1≤j≤m 9University of Illinois, Chicago

Bus stop closeness rate (BSCR) | PS | is the number of GPS reports who's Euclidian distance to the closest bus stop is less than the threshold BSCR = | PS | / window size 10University of Illinois, Chicago

Machine learning models We compared five different models then choose the most effective – Random Forest (RF) – Decision trees (DT) – Neural networks (MLP) – Naïve Bayes (NB) – Bayesian Network (BN) WEKA machine learning toolkit 11University of Illinois, Chicago

Results Random Forest was the most effective model Precision and recall accuracy of Random forest shown below 12University of Illinois, Chicago

Feature Ranking Below we rank the features to determine the most effective. 13University of Illinois, Chicago

Results Using the top ranked features only Precision and recall accuracy is shown below 14University of Illinois, Chicago

Deployed System We can provide further information (i.e. route, bus id) on the particular bus one is riding. 15University of Illinois, Chicago

Related work with GPS Liao et. al (2004) – consider the user’s history such as where one parked. Zheng et. al (2008) – Robust set of features and a change point segmentation method. Reddy et. al (2010) – Combined accelerometer and GPS to achieve high accuracy. University of Illinois, Chicago16

Conclusion Using GIS data improves transportation mode detection accuracy. This improvement is more noticeable for motorized transportation modes. Only a subset of our initial set of features are needed. Random forest is the most effective model We can provide further information about the bus that a user is riding 17University of Illinois, Chicago

18University of Illinois, Chicago