Traffic State Detection Using Acoustics

Slides:



Advertisements
Similar presentations
KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)
Advertisements

Entropy and Dynamism Criteria for Voice Quality Classification Applications Authors: Peter D. Kukharchik, Igor E. Kheidorov, Hanna M. Lukashevich, Denis.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
THE JIGSAW CONTINUOUS SENSING ENGINE FOR MOBILE PHONE APPLICATIONS Hong Lu,† Jun Yang,! Zhigang Liu,! Nicholas D. Lane,† Tanzeem Choudhury,† Andrew T.
Dual-domain Hierarchical Classification of Phonetic Time Series Hossein Hamooni, Abdullah Mueen University of New Mexico Department of Computer Science.
SoundSense: Scalable Sound Sensing for People-Centric Applications on Mobile Phones -Hong LU, Wei Pan, Nicholas D. Lane, Tanzeem Choudhury and Andrew T.
1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,
Nov 4, Detection, Classification and Tracking of Targets in Distributed Sensor Networks Presented by: Prabal Dutta Dan Li, Kerry Wong,
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
Toward Semantic Indexing and Retrieval Using Hierarchical Audio Models Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu Multimedia Systems,
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Page 0 of 8 Time Series Classification – phoneme recognition in reconstructed phase space Sanjay Patil Intelligent Electronics Systems Human and Systems.
Classifying Motion Picture Audio Eirik Gustavsen
Speaker Adaptation for Vowel Classification
Gaussian Mixture-Sound Field Landmark Model for Robot Localization Talker: Prof. Jwu-Sheng Hu Department of Electrical and Control Engineering National.
Advisor: Prof. Tony Jebara
 Day by day increasing population increases traffic and vehicles also which causes problems in transportation and management of traffic.  Mainly.
Representing Acoustic Information
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
LE 460 L Acoustics and Experimental Phonetics L-13
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Motivation for ITS Too many vehicles, too little road Infrastructure growth slow due to lack of funds, space and bureaucratic issues Alleviating problem.
“SoundSense: Scalable Sound Sensing for People-Centric Applications on Mobile Phones” Authors: Hong Lu, Wei Pan, Nicholas D. Lane, Tanzeem Choudhury and.
Operated by Los Alamos National Security, LLC for NNSA U N C L A S S I F I E D Slide 1 Sensor network based vehicle classification and license plate identification.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
9 th Conference on Telecommunications – Conftele 2013 Castelo Branco, Portugal, May 8-10, 2013 Sara Candeias 1 Dirce Celorico 1 Jorge Proença 1 Arlindo.
Detecting Movement Type by Route Segmentation and Classification Karol Waga, Andrei Tabarcea, Minjie Chen and Pasi Fränti.
By Sarita Jondhale1 Pattern Comparison Techniques.
Cisco Public © 2012 Cisco and/or its affiliates. All rights reserved. 1.
REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.
Prediction of Traffic Density for Congestion Analysis under Indian Traffic Conditions Proceedings of the 12th International IEEE Conference on Intelligent.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Experimental Results ■ Observations:  Overall detection accuracy increases as the length of observation window increases.  An observation window of 100.
Sound-Event Partitioning and Feature Normalization for Robust Sound-Event Detection 2 Department of Electronic and Information Engineering The Hong Kong.
Look who’s talking? Project 3.1 Yannick Thimister Han van Venrooij Bob Verlinden Project DKE Maastricht University.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
Arlindo Veiga Dirce Celorico Jorge Proença Sara Candeias Fernando Perdigão Prosodic and Phonetic Features for Speaking Styles Classification and Detection.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
Pocket, Bag, Hand, etc. - Automatically Detecting Phone Context through Discovery Emiliano Miluzzoy, Michela Papandreax, Nicholas D. Laney, Hong Luy, Andrew.
ADAPTIVE BABY MONITORING SYSTEM Team 56 Michael Qiu, Luis Ramirez, Yueyang Lin ECE 445 Senior Design May 3, 2016.
Research Methodology Proposal Prepared by: Norhasmizawati Ibrahim (813750)
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
SEMINAR ON TRAFFIC MANAGEMENT USING IMAGE PROCESSING by Smruti Ranjan Mishra (1AY07IS072) Under the guidance of Prof Mahesh G. Acharya Institute Of Technology.
IMAGE PROCESSING APPLIED TO TRAFFIC QUEUE DETECTION ALGORITHM.
Traffic Simulation L2 – Introduction to simulation Ing. Ondřej Přibyl, Ph.D.
Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.
[1] National Institute of Science & Technology Technical Seminar Presentation 2004 Suresh Chandra Martha National Institute of Science & Technology Audio.
Audio Fingerprinting Wes Hatch MUMT-614 Mar.13, 2003.
University of Rochester
MART: Music Assisted Running Trainer
My Tiny Ping-Pong Helper
ARTIFICIAL NEURAL NETWORKS
Presentation on Artificial Neural Network Based Pathological Voice Classification Using MFCC Features Presenter: Subash Chandra Pakhrin 072MSI616 MSC in.
How to measure and analyze traffic noise?
BotCatch: A Behavior and Signature Correlated Bot Detection Approach
Dead Reckoning, a location tracking app for Android™ smartphones Nisarg Patel Mentored by Adam Schofield and Michael Caporellie Introduction Results (cont.)
Vijay Srinivasan Thomas Phan
Urban Sound Classification with a Convolution Neural Network
Vehicle Segmentation and Tracking in the Presence of Occlusions
EE513 Audio Signals and Systems
AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION
John H.L. Hansen & Taufiq Al Babba Hasan
SMART TRAFFIC MANAGEMENT SYSTEM
Audio Compression Techniques
Endpoint Detection ( 端點偵測)
Anthor: Andreas Tsiartas, Prasanta Kumar Ghosh,
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Presentation transcript:

Traffic State Detection Using Acoustics Nitakshi Sood, Arshvir Kaur, Bhavdeep Sachdeva, Dinesh Vij, Naveen Aggarwal DIC, UIET, Panjab University, Chandigarh Introduction Initial Findings Results Train set = 161 recordings per class Test set = 69 recordings per class Testing repeated for 6 times The algorithm below describes the binary classification techniques applied. The table shows the classification accuracy (in %) of the traffic classes. Traffic Scenes Considered • BusyStreet: Heavy flow traffic means the congested road or the jammed situation. Vehicle speed between 0-20 kmph • MediumStreet: traffic state in which volume of vehicles is moderate. Vehicle speed between 20-50 kmph • QuietStreet : This is the traffic scene with little traffic. Vehicle speed between more than 50 kmph Objectives : The traffic and number of vehicles on roads are increasing with an unstoppable pace, which in turn leads to the problem of traffic congestion. We propose the use of Acoustics to determine : (a) How efficient and accurate do acoustic signals turn out to be in the detection of Traffic events. (b) If acoustics could be used as a standalone parameter to detect Traffic events. Traffic state Acoustic Analysis – What and Why? Data Collection • Crowd sourced data collected from 7 different Smart Phones using “Smart Voice Recorder by SmartMobv1.71” application. • Phones used: Nexus, Samsung S3/S4/Grand, lenovo,Mi4. • 1000 recordings of length 30 seconds approximately - for each traffic scene, collected in almost 15 sessions. • Recorded at 16 KHz Sample Rate and 16-bit Bit Rate. • Additional data captured such as location of the phone, whether the phone was in hand/pocket/bag, associated AndroSensor file, number of microphones in the phone that was used for recording, air (still or turbulent), and environmental conditions. • Acoustic Event: Acoustic events represents all the events that are present in audio signals; such as engine start, engine stop, horn, music, speech, etc. • Acoustic Scene: These events usually occur within different acoustic scenes such as car, bus, train, busy street, quiet street, etc. What can we get on capturing an Acoustic signal? Advantages of using acoustics: Relatively inexpensive sensors. Less energy consumption. Not affected by visual occlusions/orientation/lighting conditions. Works for non-lane driven traffic. Prominent at slower speeds Prominent at faster speeds Others Honks Tire noise Environmental noise Engine-idling noise Engine noise Exhaust noise Air Turbulence Classifier Congestion vs Non- congestion Medium vs Free flow Overall accuracy GMM 77.8% 61.6% 69.7% SVM 92.7% 94.2% 93.2% Neural Network 90% 93.5% 91.75% Classes Number of recordings Heavy flow 280 Medium flow Free flow Quiet street 160 Two level classification greatly improves the overall classification accuracy. Also the results show that SVM and NN out perform GMM. Conclusion • Extensive data collection: inside pocket, inside bag, windy conditions, rainy season, on different vehicles is required for future. •Two level classification done with Hamming window function and frame size of 8192 with suitable feature set of (MFCC+SC+EE) gives the best results for traffic state detection. Features • Temporal features : Zero Crossing Rate (ZCR) Short Time Energy (STE) Root Mean Square(RMS) Energy Entropy (EE) • Spectral features : Spectral Centroid (SC) Spectral Roll Off Point (SRP) Spectral Flux (SF) Mel Frequency Ceptral Coefficients (MFCC) Process of Traffic State Detection using Acoustics Future Work • Early Detection: Minimum amount of recording needed by the classifier to correctly identify a scene. • Triggering Mechanism: Audio would not be recorded continuously. Some event such as accelerometer would trigger the microphone of smartphone. • Convulational Neural Network Implementation: To implement Deep learning to produce efficient accuracy. Experiments on Frame size and Window Type Our approach Rather than using Infrastructure Based Solutions (installing sensors into vehicles and road ways)  Crowd sourced data from Smart Phones using app. To make it efficient, experiments need to be done on: • Extensive Data Collection • Window Size & Type • Early Detection • Feature Set • Classification Technique • Role of Learning • The audio signal captured was segmented using small windows. • Therefore, primary analysis window (Hamming Window) of various sizes as illustrated were tested in this work. • Classifier used: GMM, SVM, Neural Network Contact details For further information contact us at : - navagg@gmail.com - vijdinesh@gmail.com - bhavdeep.mailme@gmail.com - nitakshi.sood@gmail.com arshviranttal77@gmail.com Acknowledgement We would like to thank the Design Innovation Centre (DIC), UIET for extending support in various forms.