Modeling Count Data over Time Using Dynamic Bayesian Networks Jonathan Hutchins Advisors: Professor Ihler and Professor Smyth.

Slides:



Advertisements
Similar presentations
EXPLORIS Montserrat Volcano Observatory Aspinall and Associates Risk Management Solutions An Evidence Science approach to volcano hazard forecasting.
Advertisements

CS188: Computational Models of Human Behavior
Autonomic Scaling of Cloud Computing Resources
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Probabilistic models Jouni Tuomisto THL. Outline Deterministic models with probabilistic parameters Hierarchical Bayesian models Bayesian belief nets.
Dynamic Bayesian Networks (DBNs)
Hidden Markov Models Reading: Russell and Norvig, Chapter 15, Sections
Introduction to Belief Propagation and its Generalizations. Max Welling Donald Bren School of Information and Computer and Science University of California.
Introduction of Probabilistic Reasoning and Bayesian Networks
Anomaly Detection in the WIPER System using A Markov Modulated Poisson Distribution Ping Yan Tim Schoenharl Alec Pawling Greg Madey.
Oklahoma State University Generative Graphical Models for Maneuvering Object Tracking and Dynamics Analysis Xin Fan and Guoliang Fan Visual Computing and.
From: Probabilistic Methods for Bioinformatics - With an Introduction to Bayesian Networks By: Rich Neapolitan.
1 Vertically Integrated Seismic Analysis Stuart Russell Computer Science Division, UC Berkeley Nimar Arora, Erik Sudderth, Nick Hay.
SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.
PatReco: Hidden Markov Models Alexandros Potamianos Dept of ECE, Tech. Univ. of Crete Fall
1 Graphical Models in Data Assimilation Problems Alexander Ihler UC Irvine Collaborators: Sergey Kirshner Andrew Robertson Padhraic Smyth.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Dynamic Tuning of the IEEE Protocol to Achieve a Theoretical Throughput Limit Frederico Calì, Marco Conti, and Enrico Gregori IEEE/ACM TRANSACTIONS.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
1 Distributed Online Simultaneous Fault Detection for Multiple Sensors Ram Rajagopal, Xuanlong Nguyen, Sinem Ergen, Pravin Varaiya EECS, University of.
Probability Grid: A Location Estimation Scheme for Wireless Sensor Networks Presented by cychen Date : 3/7 In Secon (Sensor and Ad Hoc Communications and.
Dynamic Bayesian Networks CSE 473. © Daniel S. Weld Topics Agency Problem Spaces Search Knowledge Representation Reinforcement Learning InferencePlanningLearning.
Probabilistic Analysis of a Large-Scale Urban Traffic Sensor Data Set Jon Hutchins, Alexander Ihler, and Padhraic Smyth Department of Computer Science.
Algorithms for Data Mining and Querying with Graphs Investigators: Padhraic Smyth, Sharad Mehrotra University of California, Irvine Students: Joshua O’
Prediction and Change Detection Mark Steyvers Scott Brown Mike Yi University of California, Irvine This work is supported by a grant from the US Air Force.
Robert M. Saltzman © DS 851: 4 Main Components 1.Applications The more you see, the better 2.Probability & Statistics Computer does most of the work.
Bayesian Filtering for Robot Localization
Activity Detection in Videos Riu Baring CIS 8590 Perception of Intelligent System Temple University Fall 2007.
Anomaly detection with Bayesian networks Website: John Sandiford.
Bayesian Learning By Porchelvi Vijayakumar. Cognitive Science Current Problem: How do children learn and how do they get it right?
Soft Sensor for Faulty Measurements Detection and Reconstruction in Urban Traffic Department of Adaptive systems, Institute of Information Theory and Automation,
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
K. J. O’Hara AMRS: Behavior Recognition and Opponent Modeling Oct Behavior Recognition and Opponent Modeling in Autonomous Multi-Robot Systems.
第十讲 概率图模型导论 Chapter 10 Introduction to Probabilistic Graphical Models
UIUC CS 498: Section EA Lecture #21 Reasoning in Artificial Intelligence Professor: Eyal Amir Fall Semester 2011 (Some slides from Kevin Murphy (UBC))
Collaborative Sampling in Wireless Sensor Networks Minglei Huang Yu Hen Hu 2010 IEEE Global Telecommunications Conference 1.
ADAPTIVE EVENT DETECTION USING TIME-VARYING POISSON PROCESSES Kdd06 University of California, Irvine.
Processing Sequential Sensor Data The “John Krumm perspective” Thomas Plötz November 29 th, 2011.
Computing & Information Sciences Kansas State University Data Sciences Summer Institute Multimodal Information Access and Synthesis Learning and Reasoning.
CS Statistical Machine learning Lecture 24
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
Learning to Detect Events with Markov-Modulated Poisson Processes Ihler, Hutchins and Smyth (2007)
State Estimation and Kalman Filtering
Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.
Analyzing wireless sensor network data under suppression and failure in transmission Alan E. Gelfand Institute of Statistics and Decision Sciences Duke.
Inferring High-Level Behavior from Low-Level Sensors Donald J. Patterson, Lin Liao, Dieter Fox, and Henry Kautz.
Unsupervised Mining of Statistical Temporal Structures in Video Liu ze yuan May 15,2011.
Markov Chain Monte Carlo for LDA C. Andrieu, N. D. Freitas, and A. Doucet, An Introduction to MCMC for Machine Learning, R. M. Neal, Probabilistic.
Shankar Subramaniam University of California at San Diego Data to Biology.
Uncertain Observation Times Shaunak Chatterjee & Stuart Russell Computer Science Division University of California, Berkeley.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Review of statistical modeling and probability theory Alan Moses ML4bio.
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
Dynamic Programming & Hidden Markov Models. Alan Yuille Dept. Statistics UCLA.
Gibbs Sampling and Hidden Markov Models in the Event Detection Problem By Marc Sobel.
Hidden Markov Model Parameter Estimation BMI/CS 576 Colin Dewey Fall 2015.
Instructor: Eyal Amir Grad TAs: Wen Pu, Yonatan Bisk Undergrad TAs: Sam Johnson, Nikhil Johri CS 440 / ECE 448 Introduction to Artificial Intelligence.
How Dirty is your Data : The Duality between detecting Events and Faults J. Gupchup A. Terzis R. Burns A. Szalay Department of Computer Science Johns Hopkins.
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Statistica /Statistics Statistics is a discipline that has as its goal the study of quantity and quality of a particular phenomenon in conditions of.
Integrative Genomics I BME 230. Probabilistic Networks Incorporate uncertainty explicitly Capture sparseness of wiring Incorporate multiple kinds of data.
COmbining Probable TRAjectories — COPTRA
WP2 INERTIA Distributed Multi-Agent Based Framework
ICS 280 Learning in Graphical Models
Appendix A: Probability Theory
Course: Autonomous Machine Learning
University of California at San Diego
Estimating Networks With Jumps
Markov Random Fields Presented by: Vladan Radosavljevic.
Learning Probabilistic Graphical Models Overview Learning Problems.
Presentation transcript:

Modeling Count Data over Time Using Dynamic Bayesian Networks Jonathan Hutchins Advisors: Professor Ihler and Professor Smyth

Optical People Counter at a Building Entrance Loop Sensors on Southern California Freeways Sensor Measurements Reflect Dynamic Human Activity

Outline Introduction, problem description Probabilistic model Single sensor results Multiple sensor modeling Future Work

Modeling Count Data In a Poisson distribution: mean = variance = λ p(count|λ) count

mean people count variance Simulated Data 15 weeks, 336 time slots

mean people count variance Building Data

mean people count variance Freeway Data

One Week of Freeway Observations

One Week of Freeway Data

Detecting Unusual Events: Baseline Method Ideal model car count Baseline model car count Unsupervised learning faces a “chicken and egg” dilemma

Persistent Events Notion of Persistence missing from Baseline model

Quantifying Event Popularity Ideal model Baseline model

My contribution Adaptive event detection with time-varying Poisson processes A. Ihler, J. Hutchins, and P. Smyth Proceedings of the 12th ACM SIGKDD Conference (KDD-06), August Baseline method, Data sets, Ran experiments Validation Learning to detect events with Markov-modulated Poisson processes A. Ihler, J. Hutchins, and P. Smyth ACM Transactions on Knowledge Discovery from Data, Dec 2007 Extended the model to include a second event type (low activity) Poisson Assumption Testing Modeling Count Data From Multiple Sensors: A Building Occupancy Model J. Hutchins, A. Ihler, and P. Smyth IEEE CAMSAP 2007,Computational Advances in Multi-Sensor Adaptive Processing, December 2007.

"Graphical models are a marriage between probability theory and graph theory. They provide a natural tool for dealing with two problems that occur throughout applied mathematics and engineering -- uncertainty and complexity” Michael Jordan 1998 Graphical Models

Nodes  variables Directed Graphical Models observed Observed Count hidden EventRate Parameter

Directed Graphical Models Nodes  variables Edges  direct dependencies A B C

Graphical Models: Modularity Observed Count t Observed Count t-2 Observed Count t-1 Observed Count t+2 Observed Count t+1

Graphical Models: Modularity hidden observed Poisson Rate λ(t) Normal Count t-1 Observed Count t Observed Count t-1 Observed Count t+1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1

Graphical Models: Modularity hidden observed Poisson Rate λ(t) Normal Count t-1 Observed Count t Observed Count t-1 Observed Count t+1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1

Graphical Models: Modularity Event t Event t-1 Event t+1 hidden observed Poisson Rate λ(t) Normal Count t-1 Observed Count t Observed Count t-1 Observed Count t+1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1

Graphical Models: Modularity Event t Event t-1 Event t+1 hidden observed Poisson Rate λ(t) Normal Count t-1 Observed Count t Observed Count t-1 Observed Count t+1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1 Event State Transition Matrix

Event t Event t-1 Event t+1 Event State Transition Matrix Observed Count t Observed Count t-1 Observed Count t+1 Event Count t Event Count t-1 Event Count t+1 hidden observed Poisson Rate λ(t) Normal Count t-1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1

Event t Event t-1 Event t+1 Event State Transition Matrix Observed Count t Observed Count t-1 Observed Count t+1 Event Count t Event Count t-1 Event Count t+1 hidden observed Poisson Rate λ(t) Normal Count t-1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1 β α η ηη

Event t Event t-1 Event t+1 Event State Transition Matrix Observed Count t Observed Count t-1 Observed Count t+1 Event Count t Event Count t-1 Event Count t+1 hidden observed Poisson Rate λ(t) Normal Count t-1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1 Markov Modulated Poisson Process (MMPP) model e.g., see Heffes and Lucantoni (1994), Scott (1998)

Approximate Inference

Gibbs Sampling * ** * * ** * ** * * * * * * * *

* x y ** * * ** *

Block Sampling

Gibbs Sampling Event t Event t-1 Event t+1 Event State Transition Matrix Observed Count t Observed Count t-1 Observed Count t+1 Event Count t Event Count t-1 Event Count t+1 Poisson Rate λ(t) Normal Count t-1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1

Gibbs Sampling Event t Event t-1 Event t+1 Event State Transition Matrix Observed Count t Observed Count t-1 Observed Count t+1 Event Count t Event Count t-1 Event Count t+1 Poisson Rate λ(t) Normal Count t-1 Day, Time t-1 Normal Count t-1 Day, Time t Normal Count t-1 Day, Time t+1 Poisson Rate λ(t) Poisson Rate λ(t) Event State Transition Matrix Event State Transition Matrix For the ternary valued event variable with chain length of 64,000 Brute force complexity ~

Gibbs Sampling Event t Event t-1 Event t+1 A AA Poisson Rate λ(t) Day, Time t-1 Observed Count t-1 Normal Count t-1 Event Count t-1 Poisson Rate λ(t) Day, Time t-1 Observed Count t-1 Normal Count t-1 Event Count t-1 Poisson Rate λ(t) Day, Time t-1 Observed Count t-1 Normal Count t-1 Event Count t-1

Chicken/Egg Delima car count

Event Popularity car count

Notion of Persistence missing from Baseline model Persistent Event

Detecting Real Events: Baseball Games Total Number Of Predicted Events Graphical Model Detection of the 76 known events Baseline Model Detection of the 76 known events %86.8% %81.6% %72.4% %60.5% Remember: the model training is completely unsupervised, no ground truth is given to the model

Multi-sensor Occupancy Model Modeling Count Data From Multiple Sensors: A Building Occupancy Model J. Hutchins, A. Ihler, and P. Smyth IEEE CAMSAP 2007,Computational Advances in Multi-Sensor Adaptive Processing, December 2007

Where are the People? Building LevelCity Level

Optical People Counter at a Building Entrance Loop Sensors on Southern California Freeways Sensor Measurements Reflect Dynamic Human Activity

Application: Multi-sensor Occupancy Model CalIt2 Building, UC Irvine campus

Building Occupancy, Raw Measurements Occ t = Occ t-1 + inCounts t-1,t – outCounts t-1,t

Building Occupancy: Raw Measurements Noisy sensors make raw measurements of little value Over-counting Under-counting

Adding Noise Model Event t Event t-1 Event State Transition Matrix Event Count t Event Count t-1 Poisson Rate λ(t) Normal Count t-1 Day, Time t-1 Normal Count t-1 Day, Time t Observed Count t Observed Count t-1 True Count t-1 True Count t

Probabilistic Occupancy Model In(Entrance) Sensors Out(Exit) Sensors Occupancy In(Entrance) Sensors Out(Exit) Sensors Constraint Time Occupancy Time tTime t+1

24 hour constraint 47           Constraint Occupancy      Building Occupancy Geometric Distribution, p=0.5

Gibbs Sampling | Forward-Backward | Complexity Learning and Inference In(Entrance) Sensors Out(Exit) Sensors Occupancy In(Entrance) Sensors Out(Exit) Sensors Occupancy

Typical Days Thursday Friday Saturday Building Occupancy

Missing Data Building Occupancy time

Corrupted Data Building Occupancy Thursday Friday

Future Work Freeway Traffic On and Off ramps 2300 sensors 6 months of measurements

Sensor Failure Extension

Spatial Correlation

Four Off-Ramps

Publications Modeling Count Data From Multiple Sensors: A Building Occupancy Model J. Hutchins, A. Ihler, and P. Smyth IEEE CAMSAP 2007,Computational Advances in Multi-Sensor Adaptive Processing, December Learning to detect events with Markov-modulated Poisson processes A. Ihler, J. Hutchins, and P. Smyth ACM Transactions on Knowledge Discovery from Data, Dec 2007 Adaptive event detection with time-varying Poisson processes A. Ihler, J. Hutchins, and P. Smyth Proceedings of the 12th ACM SIGKDD Conference (KDD-06), August Prediction and ranking algorithms for event-based network data J. O Madadhain, J. Hutchins, P. Smyth ACM SIGKDD Explorations: Special Issue on Link Mining, 7(2), 23-30, December 2005