THE JIGSAW CONTINUOUS SENSING ENGINE FOR MOBILE PHONE APPLICATIONS Hong Lu,† Jun Yang,! Zhigang Liu,! Nicholas D. Lane,† Tanzeem Choudhury,† Andrew T.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Darwin Phones: the Evolution of Sensing and Inference on Mobile Phones Emiliano Miluzzo *, Cory T. Cornelius *, Ashwin Ramaswamy *, Tanzeem Choudhury *,
Outline Activity recognition applications
SirenDetect Alerting Drivers about Emergency Vehicles Jennifer Michelstein Department of Electrical Engineering Adviser: Professor Peter Kindlmann May.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
Using Mobile Phones to Determine Transportation Modes Hyeong-il Ko Sasank Reddy et al., ACM Transactions on Sensor Networks, Vol. 6, No. 2,
D u k e S y s t e m s Sensing Meets Mobile Social Networks: The Design, Implementation and Evaluation of the CenceMe Application Emiliano Miluzzo†, Nicholas.
SoundSense: Scalable Sound Sensing for People-Centric Applications on Mobile Phones -Hong LU, Wei Pan, Nicholas D. Lane, Tanzeem Choudhury and Andrew T.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Activity, Audio, Indoor/Outdoor classification using cell phones Hong Lu, Xiao Zheng Emiliano Miluzzo, Nicholas Lane CS 185 Final Project presentation.
DARWIN PHONES: THE EVOLUTION OF SENSING AND INFERENCE ON MOBILE PHONES PRESENTED BY: BRANDON OCHS Emiliano Miluzzo, Cory T. Cornelius, Ashwin Ramaswamy,
Chapter 4: Linear Models for Classification
NEUROPHONE: BRAIN- MOBILE PHONE INTERFACE USING A WIRELESS EEG HEADSET Andrew T. Campbell, Tanzeem Choudhury, Shaohan Hu, Hong Lu, Matthew K. Mukerjee!,
ACE: Exploiting Correlation for Energy-Efficient and Continuous Context Sensing Suman Nath Microsoft Research MobiSys 2012 Presenter: Jeffrey.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Accelerometer-based Transportation Mode Detection on Smartphones
Nov 4, Detection, Classification and Tracking of Targets in Distributed Sensor Networks Presented by: Prabal Dutta Dan Li, Kerry Wong,
Toward Semantic Indexing and Retrieval Using Hierarchical Audio Models Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu Multimedia Systems,
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
TRADING OFF PREDICTION ACCURACY AND POWER CONSUMPTION FOR CONTEXT- AWARE WEARABLE COMPUTING Presented By: Jeff Khoshgozaran.
SENSING MEETS MOBILE SOCIAL NETWORKS: THE DESIGN, IMPLEMENTATION AND EVALUATION OF THE CENCEME APPLICATION Emiliano Miluzzo†, Nicholas D. Lane†, Kristóf.
Speaker Adaptation for Vowel Classification
Language and Speaker Identification using Gaussian Mixture Model Prepare by Jacky Chau The Chinese University of Hong Kong 18th September, 2002.
Online classifier construction algorithm for human activity detection using a tri-axial accelerometer Yen-Ping Chen, Jhun-Ying Yang, Shun-Nan Liou, Gwo-Yun.
Feature Screening Concept: A greedy feature selection method. Rank features and discard those whose ranking criterions are below the threshold. Problem:
Techniques for Efficient Processing in Runahead Execution Engines Onur Mutlu Hyesoon Kim Yale N. Patt.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
New Challenges in Cloud Datacenter Monitoring and Management
{ NeuroPhone: Brain-Mobile Phone Interface using a Wireless EEG Headset Andrew T. Campbell, Tanzeem Choudhury, Shaohan Hu, Hong Lu, Matthew K. Mukerjee.
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Sensing Meets Mobile Social Networks: The Design, Implementation and Evaluation of the CenceMe Application Emiliano Miluzzo†, Nicholas D. Lane†, Kristóf.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
“SoundSense: Scalable Sound Sensing for People-Centric Applications on Mobile Phones” Authors: Hong Lu, Wei Pan, Nicholas D. Lane, Tanzeem Choudhury and.
Design, Implementation and Evaluation of CenceMe Application COSC7388 – Advanced Distributed Computing Presentation By Sushil Joshi.
Operated by Los Alamos National Security, LLC for NNSA U N C L A S S I F I E D Slide 1 Sensor network based vehicle classification and license plate identification.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
Automatically Identifying Localizable Queries Center for E-Business Technology Seoul National University Seoul, Korea Nam, Kwang-hyun Intelligent Database.
Automatic detection of microchiroptera echolocation calls from field recordings using machine learning algorithms Mark D. Skowronski and John G. Harris.
Low-Power Wireless Sensor Networks
Zhiphone: A Mobile Phone that Learns Context and User Preferences Jing Michelle Liu Raphael Hoffmann CSE567 class project, AUT/05.
Learning to Associate: HybridBoosted Multi-Target Tracker for Crowded Scene Present by 陳群元.
Automatically Generating Models for Botnet Detection Presenter: 葉倚任 Authors: Peter Wurzinger, Leyla Bilge, Thorsten Holz, Jan Goebel, Christopher Kruegel,
Similarity Matrix Processing for Music Structure Analysis Yu Shiu, Hong Jeng C.-C. Jay Kuo ACM Multimedia 2006.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
The Sociometer: A Wearable Device for Understanding Human Networks
Cognitive Radio: Next Generation Communication System
Indoor Location Detection By Arezou Pourmir ECE 539 project Instructor: Professor Yu Hen Hu.
Rule based Context Sensing. Background Context sensing – Sensors in smartphone – Reacts based on operating condition Example – Location based reminder,
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
Counting How Many Words You Read
Sensor testing and validation plans for Phase-1 and Ultimate IPHC_HFT 06/15/ LG1.
Sensing Meets Mobile Social Networks: The Design, Implementation and Evaluation of the CenceMe Application Emiliano Miluzzo†, Nicholas D. Lane†, Kristóf.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
A UDIO B ANDWIDTH D ETECTION IN THE EVS C ODEC University of Sherbrooke, Canada VoiceAge Corporation, Montréal, Canada Fraunhofer IIS, Erlagen, Germany.
Pocket, Bag, Hand, etc. - Automatically Detecting Phone Context through Discovery Emiliano Miluzzoy, Michela Papandreax, Nicholas D. Laney, Hong Luy, Andrew.
ADAPTIVE BABY MONITORING SYSTEM Team 56 Michael Qiu, Luis Ramirez, Yueyang Lin ECE 445 Senior Design May 3, 2016.
Audio Fingerprinting Wes Hatch MUMT-614 Mar.13, 2003.
Emotional Intelligence Vivian Tseng, Matt Palmer, Jonathan Fouk Group #41.
Machine Learning with Spark MLlib
Traffic State Detection Using Acoustics
ARTIFICIAL NEURAL NETWORKS
Vijay Srinivasan Thomas Phan
AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION
Xin Qi, Matthew Keally, Gang Zhou, Yantao Li, Zhen Ren
John H.L. Hansen & Taufiq Al Babba Hasan
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

THE JIGSAW CONTINUOUS SENSING ENGINE FOR MOBILE PHONE APPLICATIONS Hong Lu,† Jun Yang,! Zhigang Liu,! Nicholas D. Lane,† Tanzeem Choudhury,† Andrew T. Campbell† †Dartmouth College, !Nokia Research Center, Eric Minner 4/11/2011

Outline  Introduction  Design Considerations  Detailed Design  Implementation  Evaluation  Conclusion

Introduction  Paper presents a implementation and analysis for “long term sensing” on mobile devices.  Attempts to balance performance needs to the resources necessary for continuous sensing.  Emphasis on the accomplishing the following tasks :  Resilient accelerometer data processing  Intelligent microphone processing throttled by the content of the input data.  Adaptive Pipeline processing which reduces the frequency of expensive functions based on other sensed data.

Introduction  Compute higher level inferences and representations of human activities and context.  Goals include :  Function with different phone locations on the body and different orientations.  Allow data to be collected for a complete charge cycle while sustaining normal phone usage.

Design Considerations – Accelerometer Pipeline  Accelerometer sampling has a low energy cost.  The main barrier here is the robustness of inferences.  The effect of the phones location on the body.  The artifacts induced between times of transition, referred to as extraneous activities.

Design Considerations – Accelerometer Pipeline  Proposed techniques to overcome issues :  Use calibration data to model the sensors response from various positions under certain activities.  Use other phones functions to determine if the phone is under a extraneous activity. (i.e. texting, etc)

Design Considerations – Microphone Pipeline  Heavy burden on computational resources due to sampling rate of audio.  To overcome this the application allows three small sound processes in parallel.  regulates how much data enters the pipeline with duty cycle control. Short circuits input to pipelines for common but distinctive sounds.  Also minimizes redundant classification operators when the sound type does not change.

Design Considerations – GPS Pipeline  Most costly sensor in terms of energy consumption.  Key is optimizing the sampling schedule while minimizing the localization error.  Dynamically learn the sampling schedule by using a Markov Decision Process.  Use accelerometer readings to trigger GPS sampling.  Also uses user mobility mode to change the sampling frequency.

Detailed Design – Accelerometer Pipeline  One-Time calibration phase.  4 Stages of Real Time Pipeline  Pre-Processing  Feature Extraction  Activity Classification  Smoothing

Detailed Design – Accelerometer Pipeline  Calibration  Determines offsets and gains to normalize all sensors to the same output unit.  Two options for calibration  User Driven  User places his/her phone in various positions and orientations.  Takes a minute or so.  User Transparent  SW automatically grabs samples when it deems fit and then uses collected data to perform the same analysis as the user driven calibration.  Takes 1 to 2 days to collect enough information.

Detailed Design – Accelerometer Pipeline  Preprocessing  Normalization  Takes raw data normalizes it with respect to gravity.  Admission Control  Rejects extraneous events.  Uses absolute difference between previous and current accel. data.  Projection  Transforms the normalized vectors into vertical and horizontal components.

Detailed Design – Accelerometer Pipeline  Feature Extraction  Computes and tracks :  Time Domain :  Mean  Variance  Mean Crossing Rate  Frequency Domain :  Spectrum Peak  Sub Band energy  Spectral Entropy

Detailed Design – Accelerometer Pipeline  Activity Classification  Uses extracted features to classify into a specific activity

Detailed Design – Accelerometer Pipeline  Smoothing  Provides simple sliding smoother to classification output to reduce outliers.

Detailed Design – Microphone Pipeline  Preprocessing  Regulates resource usage of the microphone pipeline.  Uses longer than traditional frames with no overlap. (64ms samples)  Admission control block throws away packets indicative of silence and reduces duty cycle of sampling/processing.

Detailed Design – Microphone Pipeline  Feature extraction  Optimal set of features is extracted from each frame of audio.  DC component is removed before frequency analysis.

Detailed Design – Microphone Pipeline  Voice classification  Inexpensive yet effective classifier.  Trains decision tree classifier with data set that compromises of almost 1GB of audio.

Detailed Design – Microphone Pipeline  Activity classification  Computationally expensive.  Detects brushing teeth, shower, typing, vacuuming, washing, hands, crowd noise, and street noise.  Similarity detector is used to curb how often the complex GMM Classifier is used.  This compares frames of audio to try and detect if they are similar enough to previous frames.

Detailed Design – Microphone Pipeline  Activity classification (Cont)  Gaussian Mixture Model (GMM)  Likelihood model generated for each activity class  The class with the highest likelihood is compared to a threshold to decide if it is valid or the model returns an “Unknown Sound.”

Detailed Design – Microphone Pipeline  Smoothing  Implemented simple classification smoothing window in favor of more complex models.  Simple and computationally inexpensive.

Detailed Design – GPS Pipeline  Smoothing  Duty cycle triggered by accelerometer readings and throttled by the users mobility pattern.  This allows maximum power savings with minimal latency and maximum performance.  Duty cycle ranges from constant samples to 20 minute intervals.  Performs battery budgeting, slowing down duty cycle as battery life becomes critical.

Implementation  Implemented and evaluated on Nokia N95 and Apple iPhone.  ~2000 lines of base code.  Efficient use of semaphores and async processes allows low CPU cycles in processing portions of code.  Uses a 32Hz sampling rate for the accelerometer and streams 8 kHz, 16-bit, mono audio from the microphone.  Allows different applications to use different parts of the engine at any given time.

Implementation

Evaluation  Accelerometer Performance

Evaluation  Accelerometer Performance  Tested with 4 classifiers with and w/o split and merge process. Decision Tree (DT) Gaussian Model (MG) Support Vector Machine (SVM) Naive Bayes (NB)  Decision tree classifier chosen due to its performance to computational cost ratio.

Evaluation  Microphone Performance

Evaluation  GPS Performance  To evaluate accelerometer inferences are collected along with GPS coordinates on both weekdays and weekends.  Data is used to optimize the sampling time and derive the best performance to power balance.

Conclusion  All 3 pipelines perform very close to significantly more complex implementations.  Very well optimized in both iPhone and N95 device.  Overall very modular design that can be configured and tailored for many practical applications.

Questions

References