June 27-28, 2006 Vikramaditya Jakkula Monitoring Health by Detecting Drifts and Outliers for a Smart Environment Inhabitant Gaurav Jain, Diane J. Cook,

Slides:



Advertisements
Similar presentations
Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
Advertisements

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
VTrack: Accurate, Energy-Aware Road Traffic Delay Estimation Using Mobile Phones Arvind Thiagarajan, Lenin Ravindranath, Katrina LaCurts, Sivan Toledo,
FixtureFinder: Discovering the Existence of Electrical and Water Fixtures Vijay Srinivasan*, John Stankovic, Kamin Whitehouse University of Virginia *(Currently.
A Survey on Tracking Methods for a Wireless Sensor Network Taylor Flagg, Beau Hollis & Francisco J. Garcia-Ascanio.
G. Alonso, D. Kossmann Systems Group
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
Vikramaditya Jakkula. MavPad Argus Sensor Network  around 100 Sensors.  include Motion, Devices, Light, Pressure, Humidity and more.
Vikramaditya R. Jakkula, Diane J. Cook & Aaron S. Crandall Washington State University Presented by Aaron S. Crandall.
Vikramaditya Jakkula Washington State University First International Workshop on Smart Homes for Tele-Health.
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
Vikramaditya R. Jakkula & Diane J. Cook Washington State University.
TRADING OFF PREDICTION ACCURACY AND POWER CONSUMPTION FOR CONTEXT- AWARE WEARABLE COMPUTING Presented By: Jeff Khoshgozaran.
UNIVERSITY OF SOUTH CAROLINA Department of Computer Science and Engineering On-line Alert Systems for Production Plants A Conflict Based Approach.
Lesson 13-Intrusion Detection. Overview Define the types of Intrusion Detection Systems (IDS). Set up an IDS. Manage an IDS. Understand intrusion prevention.
Research Directions for the Internet of Things Supervised by: Dr. Nouh Sabry Presented by: Ahmed Mohamed Sayed.
Total Quality Management BUS 3 – 142 Statistics for Variables Week of Mar 14, 2011.
Computer Science and Engineering Department The University of Texas at Arlington MavHome: An Intelligent Home Environment.
Methods and Philosophy of Statistical Process Control
Use Cases System/Solution Air Quality Control (Artificial/Natural) Lighting Control (Artificial/Natural) Audio/Visual Control (Playback /Recording /Noise-Reduction.
Smart Home Technologies CSE 4392 / CSE 5392 Spring 2006 Manfred Huber
EMBEDDED SOFTWARE Team victorious Team Victorious.
Adaptive Signal Processing Class Project Adaptive Interacting Multiple Model Technique for Tracking Maneuvering Targets Viji Paul, Sahay Shishir Brijendra,
Presenter: Shant Mandossian EFFECTIVE TESTING OF HEALTHCARE SIMULATION SOFTWARE.
Analysis of Simulation Results Andy Wang CIS Computer Systems Performance Analysis.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Time Series “The Art of Forecasting”. What Is Forecasting? Process of predicting a future event Underlying basis of all business decisions –Production.
CLASS B.Sc.III PAPER APPLIED STATISTICS. Time Series “The Art of Forecasting”
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Chapter 8 Prediction Algorithms for Smart Environments
Verification & Validation
Department of Computer Science Provenance-based Trustworthiness Assessment in Sensor Networks Elisa Bertino CERIAS and Department of Computer Science,
1. There are different assistant software tools and methods that help in managing the network in different things such as: 1. Special management programs.
Vikramaditya Jakkula Washington State University IEEE Workshop of Data Mining in Medicine 2007 (DMMed '07) In conjunction with IEEE.
Streaming Predictions of User Behavior in Real- Time Ethan DereszynskiEthan Dereszynski (Webtrends) Eric ButlerEric Butler (Cedexis) OSCON 2014.
Demo. Overview Overall the project has two main goals: 1) Develop a method to use sensor data to determine behavior probability. 2) Use the behavior probability.
1 ENTROPY-BASED CONCEPT SHIFT DETECTION PETER VORBURGER, ABRAHAM BERNSTEIN IEEE ICDM 2006 Speaker: Li HueiJyun Advisor: Koh JiaLing Date:2007/11/6 1.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
January Smart Environments: Artificial Intelligence in the Home and Beyond Diane J. Cook
Survey on Improving Dynamic Web Performance Guide:- Dr. G. ShanmungaSundaram (M.Tech, Ph.D), Assistant Professor, Dept of IT, SMVEC. Aswini. S M.Tech CSE.
Vikramaditya Jakkula & Diane J. Cook Artificial Intelligence Lab Washington State University 2 nd International Conference on Technology and Aging (ICTA)
Secure Sensor Data/Information Management and Mining Bhavani Thuraisingham The University of Texas at Dallas October 2005.
Experiences from Assessing Daily Activities to Increase Safety and Comfort of Older Persons Living Alone Paul Panek, Peter Mayer, Özge Subasi and Wolfgang.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Systems Analysis and Design in a Changing World, Fourth Edition
Intelligent Environments1 Conclusions and Future Directions.
REU 2004 Computer Science and Engineering Department The University of Texas at Arlington Research Experiences for Undergraduates in Distributed Rational.
Introduction  Use Allen’s Temporal Relations [3] to identify temporal relations among Activities in Daily Life of the resident.  Allen’s relations form.
11 Mayview Regional Service Area Plan (MRSAP) Tracking: Supporting Individuals in the Community June 18, 2008.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
Performance Comparison of Speaker and Emotion Recognition
Learning Objectives Describe what forecasting is Explain time series & its components Smooth a data series –Moving average –Exponential smoothing Forecast.
Vikramaditya R. Jakkula, G. Michael Youngblood and Diane J. Cook AAAI ’06 Workshop on Computational Aesthetics July 16, 2006.
REU 2007 Computer Science and Engineering Department The University of Texas at Arlington Research Experiences for Undergraduates in Information Processing.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Computer Science and Engineering Department The University of Texas at Arlington MavHome: An Intelligent Home Environment.
REU 2009 Computer Science and Engineering Department The University of Texas at Arlington Research Experiences for Undergraduates in Information Processing.
Data Mining for Hierarchical Model Creation G. Michael Youngblood and Diane J. Cook IEEE Transactions on Systems, Man, and Cybernetics, Part C, 37(4): ,
Online Sequential Prediction via Incremental Parsing: The Active LeZi Algorithm Source: IEEE Intelligent Systems, 22(1), pp , 2007 Authors: Karthik.
Physiological Data Analysis of Neuro-Critical Patients Using Markov Models By Shashwat Bhoop sb3758.
Kalman Filter and Data Streaming Presented By :- Ankur Jain Department of Computer Science 7/21/03.
Computer Science and Engineering Department The University of Texas at Arlington MavHome: An Intelligent Home Environment.
CASAS Smart Home Project Center for Advanced Studies in Adaptive Systems Washington State University.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Personal Home Healthcare System for the Cardiac Patient of Smart City Using Fuzzy Logic Shijia Liu.
Non-invasive cardiac pulse measurement
Data Mining Jim King.
Intelligent Contextual Data Stream Monitoring
Assistive “Wearables” for People with Disabilities.
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
Presentation transcript:

June 27-28, 2006 Vikramaditya Jakkula Monitoring Health by Detecting Drifts and Outliers for a Smart Environment Inhabitant Gaurav Jain, Diane J. Cook, Vikramaditya Jakkula

June 27-28, 2006 Vikramaditya Jakkula MavHome UTA Project Unique –Focus on entire home House perceives and acts –Sensors –Controllers for devices –Connections to the mobile user and Internet Unified project incorporating varied AI techniques, cross disciplinary with mobile computing, databases, multimedia, and others

June 27-28, 2006 Vikramaditya Jakkula MavHome Goals The goals of an intelligent environment control system should be to 1. Maximize the safety and security of the inhabitant(s) 2. Maximize the comfort of the inhabitant(s) by automating their environment the fullest and most desirable extent possible 3. Minimize the consumption of natural resources in an effort to reduce costs and maximize environment efficiency.

June 27-28, 2006 Vikramaditya Jakkula Environment MavHome Environment  MavDen  MavKitchen  MavPad

June 27-28, 2006 Vikramaditya Jakkula Environment-Contd

June 27-28, 2006 Vikramaditya Jakkula Overview

June 27-28, 2006 Vikramaditya Jakkula Core Technologies Minimal Sequential Patterns Using “ED” Given an input stream S of event occurrences O, ED: 1. Partitions S into Maximal Episodes, Pmax. 2. Creates Itemsets, I, from the Maximal Episodes. 3. Creates a Candidate Significant Episode, C, for each Itemset I, and computes one or more Significance Values, V, for each Candidate. 4. Identifies Significant Episodes by evaluating the Significance Values of the candidates.

June 27-28, 2006 Vikramaditya Jakkula Core Technologies Decision Making using ProPHeT ProPHeT is the main controlling component of the system It uses data filtered through Episode Discovery (ED) to create a Hierarchical Hidden Markov Model (HHMM). HHMM represents a user model that includes all of the episodes (e.g., entering a room, watching TV, sitting in a chair and listening to music, and so forth) that a person performs in the environment.

June 27-28, 2006 Vikramaditya Jakkula Core Technologies

June 27-28, 2006 Vikramaditya Jakkula Need for Health Monitoring Problem Elderly, disabilities and the chronic ill need health care. Personal preference Increased care cost Inadequate infrastructure Solution Low-cost automated health monitoring system at home Lanspery & Hyde state “For most of us, the word ‘home’ evokes powerful emotions [and is] a refuge”

June 27-28, 2006 Vikramaditya Jakkula Drift Detection Algorithm Diurnal algorithm Uses autocorrelation plots Three Steps –Update history –Detect drifts –Report Generation Input: history h, frequency sets, action list and their criticalities OutFile: report file update h with the frequency sets for each action a loop –find the drift type d in action a’s history –send the drift d for action a to the report manager the report manager generates the final report based on the criticality of each action, the current drift parameters and previous drift parameters. n-k Σ ( x i - μ ) (x k - μ ) i=1 (n-k) * σ 2 r k =

June 27-28, 2006 Vikramaditya Jakkula Update History Maintains six-hourly, daily, weekly history queues. Input is four six-hourly frequency sets. Different window sizes are posible Large window vs. small window

June 27-28, 2006 Vikramaditya Jakkula Detecting Drifts Input: action a, history h, reporter r OutFile: drift type d and its parameters p check if action a has drift type d == no drift if yes then –send the drift type and its parameters to the reporter –return to the calling function check if action a has drift type d == cyclic or increasing if yes then –send the drift type and its parameters to the reporter –return to the calling function send the drift type as chaotic to the reporter

June 27-28, 2006 Vikramaditya Jakkula Test for no-drift No-drift? –constant for a significant period of time, and –may have random noise. Only the top half of the autocorrelation plot is used. Why? Test: –autocorrelation plot values < threshold. Why? –Less than 10% of these values should lie outside the (m – 2s, m + 2s) range. Why?

June 27-28, 2006 Vikramaditya Jakkula Test for Cyclic cyclic trend shows high upward peaks in autocorrelation graph

June 27-28, 2006 Vikramaditya Jakkula Test for Sloping High degree of autocorrelation is between adjacent and near- adjacent observations. High value at lag one Value decreases with increase in lag Slope length is the smallest lag at which the values stops decreasing. Note: Random noise is suppressed by the autocorrelation plot

June 27-28, 2006 Vikramaditya Jakkula Test for Chaotic No test for chaotic Anything not yet classified ends up any chaotic Causes: –large number of irregular changes –heavy non-random noise in data in all the windows –sudden large changes in the distribution –seen for a short period of time when drift type changes Reporting of drifts will be discussed after presentation of the outlier detection algorithms

June 27-28, 2006 Vikramaditya Jakkula Outlier Detection Algorithms Two types of outliers –Extremely high or low value in periodic frequency –Occurrence unexpected action in an ordered sequence of actions. Separate algorithms for each –Autocorrelation-based outlier detection Uses drift detection method Outlier if last data point lies outside (m – 3s, m + 3s), Tested for all window sizes. If found outlier, then drift detection is not done. –Prediction-based outlier detection Why a two methods?

June 27-28, 2006 Vikramaditya Jakkula Prediction-based Outlier Detection Live-monitoring method Uses Active LeZi (ALZ) [2] to find the expected pattern in the data. ALZ uses data compression to predict the next action in a sequence. It determines the probability distribution for each action at any point of time. When an action occurs this probability distribution is used to determine if the action is an outlier or not.

June 27-28, 2006 Vikramaditya Jakkula Prediction-based Outlier Detection To determine if an action x is an outlier we calculate the anomaly measure n(x). Two methods are used to calculate anomaly measure. Why? To determine the importance an outlier we calculate the urgency factor u(x), otherwise 1 ρ ( x) * 100 1, if ρ ( x) * 100 < 1 anomaly measure, n1(x) = 1, if ρ ( x ) * 100 <= ρ ( y) ρ ( y) ρ ( x) * 100 anomaly measure, n2 (x) =, otherwise urgency factor, u (x) = n(x) * c(x) report if u (x) >= 0.1

June 27-28, 2006 Vikramaditya Jakkula Report generation for Autocorrelation-based algorithms Which drift or outlier is important to report? Uses –current classification, –the previous classification, –the criticality of the action, and –other parameters (confidence, length of drifts etc.) Three levels –Level one: Critical drifts and outliers –Level two: Important drifts and outliers –Level three: All drifts and outliers

June 27-28, 2006 Vikramaditya Jakkula Report generation Level one –If action criticality is above medium, and either the classification changed from the previous or cycle period changes. Level Two –All outliers –criticality is above medium –Previous classification changes –cycle period changes –confidence changes by some amount Level three –Classification of each action.

June 27-28, 2006 Vikramaditya Jakkula Experiments HMS was tested using both synthetic and real data (activity and health). Five sets –Synthetic set one –Synthetic set two –Real set one –Real set two –Health set Step1: verify algorithms using synthetic sets Step 2: analyze how the algorithm work on real and health sets

June 27-28, 2006 Vikramaditya Jakkula Nature of data Synthetic set one –To test autocorrelation- based algorithm –Hundred days nine action –10639 data points –Random criticalities Synthetic set two –To test prediction-based algorithm –100 data points –Four actions Action nameDescription PerfectCyclic3_On Cyclic; period - three days; No noise. DailyConstant_OnA daily constant; No noise. PerfectIncreasing_OnIncreasing; No noise. NoisyIncresing_OnIncreasing with noise. NoisyCyclic7_On Noisy cyclic with period of a week Cyclic3ToNoisyCylic7 _On cyclic (period 3) to cyclic (period 7) on the day 41 NoisyIncrToNoisyDesc _On noisy increasing to noisy decreasing on the day 41 NoisyDecrToCyclic3_ On noisy decreasing drift to a cyclic with period 3 Changing_On A drift constantly changing between constant, increasing and decreasing with outliers in the data. Synthetic set one

June 27-28, 2006 Vikramaditya Jakkula Nature of data Real data –Activity data from MavPad –Seven weeks Real set one – tested on prediction-based algorithm –Electrical outlets usage, light usage and overhead fan usage. –2163 data points; 79 actions Real set two – tested on autocorrelation-based algorithm –Real set one data plus motion sensor data – data points; 157 actions Health Data – tested on autocorrelation-based algorithm –Systolic, diastolic and hear rate are taken as action –2 months; one value each per day; –each action is associated with its value instead of frequency –Most missing values were added manually

June 27-28, 2006 Vikramaditya Jakkula Experiments using Autocorrelation-based method For Health Data –sensitive to sudden large changes –could detect drifts due to long term trends even with small amounts of noise.

June 27-28, 2006 Vikramaditya Jakkula Figure: Line graph for graph confidence & diastolic vs. number of days for health set. Figure : Line graph for graph confidence & heart rate vs. number of days for health set.

June 27-28, 2006 Vikramaditya Jakkula Reminder Assistance System Automation assistance is beneficial when activities are difficult to perform. Such reminder service would benefit individuals suffering from dementia. Reminders Triggered in two situations: when user queries for next routine activity Critical anomaly is detected.

June 27-28, 2006 Vikramaditya Jakkula Conclusion HMS help us gain information about different types of drifts and outliers that are part of the inhabitant’s lifestyle. Detect anomalies in inhabitants health. Gives information about sudden changes observed in inhabitants health. Successful demonstration of MavHome software Architecture can monitor and provide automated assistance for inhabitants.

June 27-28, 2006 Vikramaditya Jakkula Future Work We are currently collecting health-specific data in the MavHome sites. We will be testing in the living environments of recruited residents at the C.C. Young Retirement Community in Dallas, Texas. Lifestyle Trends and patterns of inhabitants would be analyzed over period of time.

June 27-28, 2006 Vikramaditya Jakkula Thank You