Madalina Fiterau Computer Science Department, Mobilize Center

Slides:



Advertisements
Similar presentations
On-line learning and Boosting
Advertisements

Caroline Rougier, Jean Meunier, Alain St-Arnaud, and Jacqueline Rousseau IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 21, NO. 5,
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.
Sedentary Lifestyle Are We So Lazy That It’s Killing Us?
Feature/Model Selection by Linear Programming SVM, Combined with State-of-Art Classifiers: What Can We Learn About the Data Erinija Pranckeviciene, Ray.
Measurement of Physical Activity EPHE 348. Why is measurement of PA Important? To specify which aspects are important To monitor changes To evaluate interventions.
Importance Sampling. What is Importance Sampling ? A simulation technique Used when we are interested in rare events Examples: Bit Error Rate on a channel,
Ensemble Learning: An Introduction
Mining Several Databases with an Ensemble of Classifiers Seppo Puuronen Vagan Terziyan Alexander Logvinovsky 10th International Conference and Workshop.
1 A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions Zhihong Zeng, Maja Pantic, Glenn I. Roisman, Thomas S. Huang Reported.
Boosting Main idea: train classifiers (e.g. decision trees) in a sequence. a new classifier should focus on those cases which were incorrectly classified.
Classification and Prediction: Regression Analysis
Team HauntQuant October 25th, 2013
3 ème Journée Doctorale G&E, Bordeaux, Mars 2015 Wei FENG Geo-Resources and Environment Lab, Bordeaux INP (Bordeaux Institute of Technology), France Supervisor:
Accelerometers Maria Canino Leah Wagner Leah Sanders.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Estimate of Swimming Energy Expenditure Utilizing an Omnidirectional Accelerometer and Swim Performance Measures Jeanne D. Johnston and Joel M. Stager,
Automatic Ballistocardiogram (BCG) Beat Detection Using a Template Matching Approach Adviser: Ji-Jer Huang Presenter: Zhe-Lin Cai Date:2014/12/24 30th.
Human Activity Recognition Using Accelerometer on Smartphones
Department of Computer and Electrical Engineering A Study of Time-based Features and Regularity of Manipulation to Improve the Detection of Eating Activity.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Objective Physical Activity Data NHANES The Challenges of Data Management, Processing, and Public Release Hannah G. Lawman, Ph.D., Clinical Health.
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring T.R. Golub et al., Science 286, 531 (1999)
Saisakul Chernbumroong, Shuang Cang, Anthony Atkins, Hongnian Yu Expert Systems with Applications 40 (2013) 1662–1674 Elderly activities recognition and.
Consensus Relevance with Topic and Worker Conditional Models Paul N. Bennett, Microsoft Research Joint with Ece Kamar, Microsoft Research Gabriella Kazai,
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
Lesson Overview Lesson Overview What Is Science?.
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
The New Primary National Curriculum
Constructing a Predictor to Identify Drug and Adverse Event Pairs
Comparison of Four "Time in Intensity“ Physical Activity Indices as
Name: Sushmita Laila Khan Affiliation: Georgia Southern University
How to forecast solar flares?
Semi-Supervised Clustering
Cluster Analysis II 10/03/2012.
Sample Power No reading, class notes only
Classification with Gene Expression Data
Day-to-day variability in older adults' physical activity:
Reliability and Validity
Associations between Total Daily Step Counts and
Evaluating Techniques for Image Classification
Class Imbalance, Redux Byron C Wallace,1,2 Kevin Small,1
Association between two categorical variables
Alan Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani
Basic machine learning background with Python scikit-learn
Exercise Intensity and Blood Sugar
Chapter Three Research Design.
Fenglong Ma1, Jing Gao1, Qiuling Suo1
Mitchell Kossoris, Catelyn Scholl, Zhi Zheng
Data Mining Practical Machine Learning Tools and Techniques
A weight-incorporated similarity-based clustering ensemble method based on swarm intelligence Yue Ming NJIT#:
Data fusion classification method based on Multi agents system
Hidden Markov Models Part 2: Algorithms
Measuring Sedentary Behavior: Epoch- and Hour-level
Detecting the Learning Value of Items In a Randomized Problem Set
Anindya Maiti, Murtuza Jadliwala, Jibo He Igor Bilogrevic
Chap. 7 Regularization for Deep Learning (7.8~7.12 )
iSRD Spam Review Detection with Imbalanced Data Distributions
Abdur Rahman Department of Statistics
Ensemble learning Reminder - Bagging of Trees Random Forest
A.M. CLARKE-CORNWELL1, P.A. COOK1 and M.H.GRANAT1
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
William Fadel, Ph.D. August 1, 2018
MultiModality Registration using Hilbert-Schmidt Estimators
Lesson Overview 1.1 What Is Science?.
Feature computation and classification of grating pitch.
Presentation transcript:

Accelerometer wear and non-wear Classification using an Ensemble of Unsupervised Predictors Madalina Fiterau mfiterau@cs.stanford.edu Computer Science Department, Mobilize Center Jennifer Hicks jenhicks@stanford.edu Scott Delp delp@stanford.edu Bioengineering Department, Mobilize Center Manisha Desai manishad@stanford.edu Thomas Robinson tom.robinson@stanford.edu Jorge Armando Banda jabanda@stanford.edu Farish Haydel kfhaydel@stanford.edu Kristopher I Kapphahn <kikapp@stanford.edu> Manoj Kumar Sharma <manojs@stanford.edu> Hyatt Errol Moore IV <hyatt4@stanford.edu> Stanford University, School of Medicine PHYSICAL ACTIVITY MONITORING ENSEMBLE FOR WEAR DETECTION RESULTS Important for understanding wide array of health problems Obesity, diabetes, heart disease Cancers, osteoporosis, depression Typically performed with wrist and/or hip worn accelerometers Challenges The device is not worn all the time Non-wear looks similar to sleep and sedentary behavior No ground truth on wear time Meaningful analysis depends on unsupervised weartime detection Stanford GOALS dataset[1] 260 subjects, 7-11 year old chidren affected by obesity Triaxial ActiGraph GT3X+ (40Hz), collected 24-hours for 7 days Physical, physiological, behavioral and psychosocial measures States of activity and sleep are not known in this data set Objective: determine the best algorithms to use to discern non-wear/wear particularly for the Stanford GOALS data EXPERT PREDICTIONS ACCURACY ESTIMATION Each algorithm will be used as a base classifier in the ensemble The accuracy of each method will be estimated in an unsupervised way Prediction IS weighted according to experts’ estimated accuracy Different parametrizations of the same algorithm can be included in the ensemble as separate experts Key idea: Estimate agreement over sets of experts, write equations/inequalities linking consistency to accuracy, solve system of equations to obtain accuracy estimates Algorithm Mean (||Estimated - Actual Error||) over subjects Sleep Study Simulated Data Choi 0.0514 0.0107 0.0502 0.0104 NhanesX 0.0094 0.0469 0.0100 NhanesY 0.0527 0.0120 0.0479 0.0125 NahnesZ 0.0494 0.0131 0.0466 NhanesS 0.0498 0.0110 0.0532 0.0136 PadacoX 0.0116 0.0508 0.0126 PadacoY 0.0478 0.0114 0.0492 0.0109 PadacoZ 0.0489 0.0497 0.0128 PVecMag 0.0485 0.0121 0.0545 GMM 0.0512 0.0102 0.0519 METHOD OF MOMENTS FOR ERROR ESTIMATION TOP PERFORMING EXPERTS Agreement between experts i and j E{i} is the error event for expert i A{i,j} is the agreement between i and j a{i,j} = P(E{i}∩E{j}) + P(Ē{i}∩Ē{j}) a{i,j} = 1 – e{i} – e{j} +2e{i,j} Agreement between experts in a set A eA is the probability that all functions in A are wrong aA= P(∩i∈AE{i}) + P(∩i∈AĒ {i}) aA= eA + 1 + ∑I⊆A (-1)|I| eI Objective: minimize dependence between error rates c(e) = ∑A:|A|≥2 (eA - Πi∈Aei)2 subject to eA ≤ mini∈A eA\i Percentage of correctly identified top performing experts Sleep Study Simulated Data Top1 67.74 66.67 Top2 90.32 90 Top3 70.97 70 Top4 100 Top5 WEARTIME ACCURACY Accuracy averaged over subjects Sleep Study Simulated Data Vote 0.3938 0.1548 0.3830 0.1452 Best estimated 0.7464 0.2855 0.7386 0.2871 Best (oracle) WEARTIME DETECTION ALGORITHMS NHANES[2] Based on sequences of 0 counts Applied to each axis separately R ‘accelerometry’ CHOI[3] Uses vector magnitude R ‘Physical Activity’ PADACO[4] Uses smoothing CONCLUSIONS EVALUATION DATA There is no universally best weartime detection algorithm. We can reasonably estimate the accuracy of each individual expert based on agreement using the method of moments. We are able to select the best classifier for each subject. The original data do not have labels Data from a sleep study is used in the evaluation Ground truth is available 31 subjects, 12 hours of monitoring Each algorithm performs better in certain cases We estimate expert accuracy for each subject separately Simulated data based on Stanford GOALS, with added periods of non-wear sampled from known non-wear in Sleep Data References: [1] Robinson et al. Family, Community and Clinic Collaboration to Treat Overweight and Obese Children: Stanford GOALS. Contemporary clinical trials. 2013. [2] http://riskfactor.cancer.gov/tools/nhanes_pam [3] Choi, Leena, et al. "Validation of accelerometer wear and nonwear time classification algorithm." Medicine and science in sports and exercise, 2011 [4] http://web.stanford.edu/~hyatt4/pages/software_padaco