Towards a Model-Based Data Collection Framework for Environmental Monitoring Networks Research Proposal Jayant Gupchup Department of Computer Science,

Slides:



Advertisements
Similar presentations
Component Analysis (Review)
Advertisements

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Xiaolong Zheng, Zhichao Cao, Jiliang Wang, Yuan He, and Yunhao Liu SenSys 2014 ZiSense Towards Interference Resilient Duty Cycling in Wireless Sensor Networks.
Life Under Your Feet Johns Hopkins University Computer Science Earth and Planetary Sciences Physics and Astronomy
Jayant Gupchup Graduate student, Johns Hopkins University Representative Slides.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #20.
Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.
An introduction to Principal Component Analysis (PCA)
On Computing Compression Trees for Data Collection in Wireless Sensor Networks Jian Li, Amol Deshpande and Samir Khuller Department of Computer Science,
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
Principal Component Analysis
Distributed Regression: an Efficient Framework for Modeling Sensor Network Data Carlos Guestrin Peter Bodik Romain Thibaux Mark Paskin Samuel Madden.
Self-Correlating Predictive Information Tracking for Large-Scale Production Systems Zhao, Tan, Gong, Gu, Wambolt Presented by: Andrew Hahn.
Microsoft E-Science Data Storage Model for Environmental Monitoring Wireless Sensor Networks Jayant Gupchup †, R.
Context Compression: using Principal Component Analysis for Efficient Wireless Communications Christos Anagnostopoulos & Stathes Hadjiefthymiades Pervasive.
1 Data Quality and Query Cost in Wireless Sensor Networks David Yates, Erich Nahum, Jim Kurose, and Prashant Shenoy IEEE PerCom 2008.
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
Approximate data collection in sensor networks the appeal of probabilistic models David Chu Amol Deshpande Joe Hellerstein Wei Hong ICDE 2006 Atlanta,
A New Household Security Robot System Based on Wireless Sensor Network Reporter :Wei-Qin Du.
Model-Driven Data Acquisition in Sensor Networks - Amol Deshpande et al., VLDB ‘04 Jisu Oh March 20, 2006 CS 580S Paper Presentation.
Privacy Preservation for Data Streams Feifei Li, Boston University Joint work with: Jimeng Sun (CMU), Spiros Papadimitriou, George A. Mihaila and Ioana.
Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.
Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,
Department of Computer Science University of Massachusetts, Amherst PRESTO: Feedback-driven Data Management in Sensor Network Ming Li, Deepak Ganesan,
Classification and Prediction: Regression Analysis
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
SensEye: A Multi-Tier Camera Sensor Network by Purushottam Kulkarni, Deepak Ganesan, Prashant Shenoy, and Qifeng Lu Presenters: Yen-Chia Chen and Ivan.
Active Learning for Probabilistic Models Lee Wee Sun Department of Computer Science National University of Singapore LARC-IMS Workshop.
Decentralised Coordination of Mobile Sensors School of Electronics and Computer Science University of Southampton Ruben Stranders,
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Empirical Modeling Dongsup Kim Department of Biosystems, KAIST Fall, 2004.
Summarized by Soo-Jin Kim
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
IDIES Temporal Integrity Challenges in Long-term Environmental Monitoring Sensor Networks. Jayant Gupchup † Alex.
Life Under Your Feet. Sensor Network Design Philosophies Use low cost components No access to line power –Deployed in remote locations Radio is the.
Distributed Anomaly Detection in Wireless Sensor Networks Ksutharshan Rajasegarar, Christopher Leckie, Marimutha Palaniswami, James C. Bezdek IEEE ICCS2006(Institutions.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Benk Erika Kelemen Zsolt
Dynamic Multi-resolution Data Dissemination in Storage-centric Wireless Sensor Networks Hongbo Luo; Guoliang Xing; Minming Li; Xiaohua Jia Department of.
Microsoft E-Science Data Storage Model for Environmental Monitoring Wireless Sensor Networks Jayant Gupchup †, R.
Compressive Sensing for Multimedia Communications in Wireless Sensor Networks By: Wael BarakatRabih Saliba EE381K-14 MDDSP Literary Survey Presentation.
Using Polynomial Approximation as Compression and Aggregation Technique in Wireless Sensor Networks Bouabdellah KECHAR Oran University.
Gap-filling and Fault-detection for the life under your feet dataset.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Jayant Gupchup Phoenix, EWSN 2010 Phoenix: An Epidemic Approach to Time Reconstruction Jayant Gupchup †, Douglas Carlson †, Răzvan Musăloiu-E. †,*, Alex.
CSE 185 Introduction to Computer Vision Face Recognition.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Secure In-Network Aggregation for Wireless Sensor Networks
Yanlei Diao, University of Massachusetts Amherst Future Directions in Sensor Data Management Yanlei Diao University of Massachusetts, Amherst.
Ching-Wa Yip Johns Hopkins University.  Alex Szalay (JHU)  Rosemary Wyse (JHU)  László Dobos (ELTE)  Tamás Budavári (JHU)  Istvan Csabai (ELTE)
Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.
Model Based Event Detection in Sensor Networks Jayant Gupchup, Andreas Terzis, Randal Burns, Alex Szalay.
Principal Component Analysis and Linear Discriminant Analysis for Feature Reduction Jieping Ye Department of Computer Science and Engineering Arizona State.
Experiences and Challenges in Campaign Style Deployments using Wireless Sensor Networks Jayant Gupchup †, Scott Pitz *, Douglas Carlson †, Chih-Han Chang.
Locations. Soil Temperature Dataset Observations Data is – Correlated in time and space – Evolving over time (seasons) – Gappy (Due to failures) – Faulty.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Martina Uray Heinz Mayer Joanneum Research Graz Institute of Digital Image Processing Horst Bischof Graz University of Technology Institute for Computer.
How Dirty is your Data : The Duality between detecting Events and Faults J. Gupchup A. Terzis R. Burns A. Szalay Department of Computer Science Johns Hopkins.
Kalman Filter and Data Streaming Presented By :- Ankur Jain Department of Computer Science 7/21/03.
Bias Management in Time Changing Data Streams We assume data is generated randomly according to a stationary distribution. Data comes in the form of streams.
Towards Optimal Sleep Scheduling in Sensor Networks for Rare-Event Detection Qing Cao, Tarek Abdelzaher, Tian He, John Stankovic Department of Computer.
Enabling Real Time Alerting through streaming pattern discovery Chengyang Zhang Computer Science Department University of North Texas 11/21/2016 CRI Group.
Principal Component Analysis
Fast Approximate Query Answering over Sensor Data with Deterministic Error Guarantees Chunbin Lin Joint with Etienne Boursier, Jacque Brito, Yannis Katsis,
Vijay Srinivasan Thomas Phan
Feature Selection Methods
Principal Component Analysis
Presentation transcript:

Towards a Model-Based Data Collection Framework for Environmental Monitoring Networks Research Proposal Jayant Gupchup Department of Computer Science, Johns Hopkins University †

75 m

Background – II (motes) Computing, Storage Communication (radio) Sensors SensorPower Barometric Pressure10 μW Humidity/Temperature80 μW Soil Moisture19.6 mW “Sending one packet costs same energy as thousands of CPU cycles” – Matt Welsh, Harvard ComponentPower Radio (CC2420) RX38 mW Radio TX35 mW Microcontroller (TI MSP Mhz)3 mW 3.6 V 19.0 Ah

All data are not equal

Task list  Define “Informative Periods”  Algorithm : Find Informative (or interesting) Periods  Algorithm : Sampling Planner based on the interesting periods  Evaluation

Initial Direction & Main Results  Principal Component Analysis (PCA) based approach  Classification-based approach towards detecting events.

PCA based approach: Motivation Observations: Well behaved days show typical signature (bell-shaped pattern) Rainy days (or periods) deviate from this signature Strong trend component from one day to the next Diurnal, trend features seen in most environmental modalities PCA is good at capturing variation in collection of similar curves

PCA – Toy Example First Principal Component Variable #1 Variable #2 Finds directions of Maximum Variance Reduces Dimensionality (truncate to first “p” directions)

Eigenmodes for Air Temperature Directions of Maximum Variance

Discriminating event, well-behaved days [5] PrecisionRecall 51.28%80% [5] : J. Gupchup, R. Burns, A. Terzis, and A. Szalay, Model-Based Event Detection in Wireless Sensor Networks, Proceedings of Workshop on Data Sharing and Interoperability on the World-Wide Sensor Web (DSI), ACM/IEEE, 2007 Well-behaved days: “Fits model well”Event day: “Large residuals”

Offline to Online  Offline  Basis locked from midnight to midnight  Access to complete 24 hour signal  Online  Access to signal up to the current hour “d”  Basis locked from hour “d” to hour “d”  Vectors cyclically shifted by “d”  Eigenvalues remain the same

Online Prediction Residuals

Summary  PCA model effective in finding informative periods  Need to know  Shift value, “d”  “sundial” [6]  But … why not use Barometric Pressure too? [6] : Jayant Gupchup, Razvan Musăloiu-E, Alex Szalay, Andreas Terzis. Sundial: Using Sunlight to Reconstruct Global Timestamps, To appear in the proceedings of the 6th European Conference on Wireless Sensor Networks (EWSN 2009)

Classification-Based Approach  2-class problem {Rainy, Sunny}  Most classifiers provide probabilities  Sample based on those probabilities

Future Work - I  Task 1: Model Improvement  Study effect (or correlation) of  Event-magnitude  Inter-Arrival Time  Explore Incremental and Robust PCA [7], [8]  Explore Label based Classifiers  Combine Air Temp, Barometric Pressure and Light Modalities (joint work with Zhiliang Ma, Dept. of Applied Math and statistics)  Task 2 : Sampling Planner  Prediction error and/or Probability of Event (PoE)  Neighbor opinion(s)  Acquisition cost of each sensor [7] : Reliable Eigenspectra for New Generation Surveys, Tamas Budavari, Vivienne Wild, Alexander S. Szalay, Laszlo Dobos, Ching-Wa Yip, MNRAS. Accepted for publication [8] : A Robust Classification of Galaxy Spectra: Dealing with Noisy and Incomplete Data, A.J. Connolly, A.S. Szalay, Astronomical Journal

Future Work - II  Task 3 : Evaluation  Define Cost and Benefit functions  Compare proposed approach with existing systems  Task 4 : Application and Extensions  Identify class of applications where the framework can be used

Questions ???

Overview: Proposed Framework Model Sampling Scheduler Update Model Mote Storage Prob (Event) Prediction Error

Properties of our PCA model  Transformation: Y = X*V  Projected variables are uncorrelated  Compression/Multi-resolution  Achieve a massive compression  From previous slide, compression ratio = 4/96 = 24X  Online Basis  Basis for any “d” to “d” hour using cyclic shifting  Re-projection error Bounds  Sum of “left out” eigenvalues

Preliminary Results  Rain prediction  Use Barometric Pressure  Simple linear classifiers perform well  Classification Accuracy towards 76%

Eigenvector 5

Online Prediction

Literature Survey  Barbie-Query (BBQ, [1])  Approximate query answering (Range, value queries)  Sensing cost differential … Energy Saving opportunities!  Predictions outside confidence interval, collect samples  Shortcomings  NOT collecting long-term environmental data  Do not consider the role played by events  PRESTO [2]  Reduce Storage costs => Reduce Communication costs  Seasonal-AutoRegressive Integrated Moving Average (S-ARIMA) [3] model for predictions  Model known to node and Basestation  When predictions within confidence bounds, do not store collected samples  Basestation can reconstruct missing samples.  Shortcomings  No adaptive sampling on interesting events [1] : Model-Driven Data Acquisition in Sensor Networks; Amol Deshpande, et al. VLDB 2004 [2] : PRESTO: Feedback-driven Data Management in Sensor Networks; Ming Li, Deepak Ganesan, and Prashant Shenoy; USENIX 2006 [3]: P.J. Brockwell, R.A. Davis. Introduction to time series and forecasting

Related Work  Near-Optimal Sensor Placement [4]  Find most informative locations to place sensors  At the same time … Keep the network connected  Solution: Information-theoretic (entropy) & Steiner tree approximation  Differences  Focus is finding informative locations in an offline fashion  Solution addresses spatial variability  Sampling rate does not change once locations are fixed [4] : A. Krause, C. Guestrin, A. Gupta, J. Kleinberg. "Near-optimal Sensor Placements: Maximizing Information while Minimizing Communication Cost". In Proc. of Information Processing in Sensor Networks (IPSN) 2006