Near-optimal Sensor Placements: Maximizing Information while Minimizing Communication Cost Andreas Krause, Carlos Guestrin, Anupam Gupta, Jon Kleinberg.

Slides:



Advertisements
Similar presentations
Nonmyopic Active Learning of Gaussian Processes An Exploration – Exploitation Approach Andreas Krause, Carlos Guestrin Carnegie Mellon University TexPoint.
Advertisements

Submodularity for Distributed Sensing Problems Zeyn Saigol IR Lab, School of Computer Science University of Birmingham 6 th July 2010.
Approximations for Min Connected Sensor Cover Ding-Zhu Du University of Texas at Dallas.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Multicast in Wireless Mesh Network Xuan (William) Zhang Xun Shi.
Cost-effective Outbreak Detection in Networks Jure Leskovec, Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, Natalie Glance.
Randomized Sensing in Adversarial Environments Andreas Krause Joint work with Daniel Golovin and Alex Roper International Joint Conference on Artificial.
Online Distributed Sensor Selection Daniel Golovin, Matthew Faulkner, Andreas Krause theory and practice collide 1.
S. J. Shyu Chap. 1 Introduction 1 The Design and Analysis of Algorithms Chapter 1 Introduction S. J. Shyu.
Carnegie Mellon Selecting Observations against Adversarial Objectives Andreas Krause Brendan McMahan Carlos Guestrin Anupam Gupta TexPoint fonts used in.
David Chu--UC Berkeley Amol Deshpande--University of Maryland Joseph M. Hellerstein--UC Berkeley Intel Research Berkeley Wei Hong--Arched Rock Corp. Approximate.
Near-Optimal Sensor Placements in Gaussian Processes Carlos Guestrin Andreas KrauseAjit Singh Carnegie Mellon University.
PCPs and Inapproximability Introduction. My T. Thai 2 Why Approximation Algorithms  Problems that we cannot find an optimal solution.
1 Stochastic Event Capture Using Mobile Sensors Subject to a Quality Metric Nabhendra Bisnik, Alhussein A. Abouzeid, and Volkan Isler Rensselaer Polytechnic.
Approximation Algorithms for Non-Uniform Buy-at-Bulk Network Design Problems Mohammad R. Salavatipour Department of Computing Science University of Alberta.
Efficient Informative Sensing using Multiple Robots
Department of Computer Science, University of Maryland, College Park, USA TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein.
Jie Gao Joint work with Amitabh Basu*, Joseph Mitchell, Girishkumar Stony Brook Distributed Localization using Noisy Distance and Angle Information.
Distributed Regression: an Efficient Framework for Modeling Sensor Network Data Carlos Guestrin Peter Bodik Romain Thibaux Mark Paskin Samuel Madden.
Near-optimal Nonmyopic Value of Information in Graphical Models Andreas Krause, Carlos Guestrin Computer Science Department Carnegie Mellon University.
More routing protocols Alec Woo June 18 th, 2002.
Sensor placement applications Monitoring of spatial phenomena Temperature Precipitation... Active learning, Experiment design Precipitation data from Pacific.
Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause Carlos Guestrin Joe Hellerstein.
Cache Placement in Sensor Networks Under Update Cost Constraint Bin Tang, Samir Das and Himanshu Gupta Department of Computer Science Stony Brook University.
Near-optimal Observation Selection using Submodular Functions Andreas Krause joint work with Carlos Guestrin (CMU)
Approximate data collection in sensor networks the appeal of probabilistic models David Chu Amol Deshpande Joe Hellerstein Wei Hong ICDE 2006 Atlanta,
Quality-Aware Segment Transmission Scheduling in Peer-to-Peer Streaming Systems Cheng-Hsin Hsu Senior Research Scientist Deutsche Telekom R&D Lab USA Los.
1 Efficient planning of informative paths for multiple robots Amarjeet Singh *, Andreas Krause +, Carlos Guestrin +, William J. Kaiser *, Maxim Batalin.
Rendezvous Planning in Mobility- assisted Wireless Sensor Networks Guoliang Xing; Tian Wang; Zhihui Xie; Weijia Jia Department of Computer Science City.
Nonmyopic Active Learning of Gaussian Processes An Exploration – Exploitation Approach Andreas Krause, Carlos Guestrin Carnegie Mellon University TexPoint.
Online Data Gathering for Maximizing Network Lifetime in Sensor Networks IEEE transactions on Mobile Computing Weifa Liang, YuZhen Liu.
1 Efficient Placement and Dispatch of Sensors in a Wireless Sensor Network Prof. Yu-Chee Tseng Department of Computer Science National Chiao-Tung University.
Mario Čagalj supervised by prof. Jean-Pierre Hubaux (EPFL-DSC-ICA) and prof. Christian Enz (EPFL-DE-LEG, CSEM) Wireless Sensor Networks:
Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.
Connected Dominating Sets in Wireless Networks My T. Thai Dept of Comp & Info Sci & Engineering University of Florida June 20, 2006.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Modeling Data-Centric Routing in Wireless Sensor Networks Bhaskar Krishnamachari, Deborah Estrin, Stephan Wicker.
Routing and Network Design: Algorithmic Issues Kamesh Munagala Duke University.
ICALP'05Stochastic Steiner without a Root1 Stochastic Steiner Trees without a Root Martin Pál Joint work with Anupam Gupta.
Approximation Algorithms for Stochastic Combinatorial Optimization Part I: Multistage problems Anupam Gupta Carnegie Mellon University.
On the Construction of Data Aggregation Tree with Minimum Energy Cost in Wireless Sensor Networks: NP-Completeness and Approximation Algorithms National.
Network Design for Information Networks Chaitanya Swamy Caltech and U. Waterloo Ara HayrapetyanÉva Tardos Cornell University.
Efficient Gathering of Correlated Data in Sensor Networks
Network Aware Resource Allocation in Distributed Clouds.
Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun.
1 Network Coding and its Applications in Communication Networks Alex Sprintson Computer Engineering Group Department of Electrical and Computer Engineering.
Minimum Average Routing Path Clustering Problem in Multi-hop 2-D Underwater Sensor Networks Presented By Donghyun Kim Data Communication and Data Management.
Approximation Algorithms
Improved Approximation Algorithms for the Quality of Service Steiner Tree Problem M. Karpinski Bonn University I. Măndoiu UC San Diego A. Olshevsky GaTech.
Image segmentation Prof. Noah Snavely CS1114
EE 685 presentation Utility-Optimal Random-Access Control By Jang-Won Lee, Mung Chiang and A. Robert Calderbank.
On Non-Disjoint Dominating Sets for the Lifetime of Wireless Sensor Networks Akshaye Dhawan.
Models in I.E. Lectures Introduction to Optimization Models: Shortest Paths.
Dr. Sudharman K. Jayaweera and Amila Kariyapperuma ECE Department University of New Mexico Ankur Sharma Department of ECE Indian Institute of Technology,
1 Mar 05Stochastic Steiner without a Root1 Stochastic Steiner Tree without a Root Martin Pál Joint work with Anupam Gupta.
Reliable Multicast Routing for Software-Defined Networks.
By: Gang Zhou Computer Science Department University of Virginia 1 Medians and Beyond: New Aggregation Techniques for Sensor Networks CS851 Seminar Presentation.
A* optimality proof, cycle checking CPSC 322 – Search 5 Textbook § 3.6 and January 21, 2011 Taught by Mike Chiang.
Optimal Relay Placement for Indoor Sensor Networks Cuiyao Xue †, Yanmin Zhu †, Lei Ni †, Minglu Li †, Bo Li ‡ † Shanghai Jiao Tong University ‡ HK University.
1 Power Efficient Monitoring Management in Sensor Networks A.Zelikovsky Georgia State joint work with P. BermanPennstate G. Calinescu Illinois IT C. Shah.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Slide 1 Toward Optimal Sniffer-Channel Assignment for Reliable Monitoring in Multi-Channel Wireless Networks Donghoon Shin, Saurabh Bagchi and Chih-Chun.
Confidential & Proprietary – All Rights Reserved Internal Distribution, October Quality of Service in Multimedia Distribution G. Calinescu (Illinois.
Monitoring rivers and lakes [IJCAI ‘07]
Near-optimal Observation Selection using Submodular Functions
Traveling Salesman Problems Motivated by Robot Navigation
Cost-effective Outbreak Detection in Networks
Near-Optimal Sensor Placements in Gaussian Processes
Presentation transcript:

Near-optimal Sensor Placements: Maximizing Information while Minimizing Communication Cost Andreas Krause, Carlos Guestrin, Anupam Gupta, Jon Kleinberg

Monitoring of spatial phenomena Building automation (Lighting, heat control) Weather, traffic prediction, drinking water quality... Fundamental problem: Where should we place the sensors? Temperature data from sensor network Light data from sensor network Precipitation data from Pacific NW

Trade-off: Information vs. communication cost efficient communication! extra node The “closer” the sensors:The “farther” the sensors: worse information quality!  better information quality! worse communication!  We want to optimally trade-off information quality and communication cost!

Predicting spatial phenomena from sensors Can only measure where we have sensors Multiple sensors can be used to predict phenomenon at uninstrumented locations A regression problem: Predict phenomenon based on location Temp here? X 1 =21 C X 3 =26 C X 2 =22 C 23 C

Predicted temperature throughout the space x y Temp. (C) Regression models for spatial phenomena Real deployment of temperature sensors measurements from 52 sensors (black dots) x y many sensors around ! trust estimate here few sensors around ! don’t trust estimate Good sensor placements: Trust estimate everywhere! Data collected at Intel Research Berkeley

Probabilistic models for spatial phenomena x y sensor locations x y Temp. (C) regression model yx variance estimate uncertainty in prediction many sensors around ! trust estimate here few sensors around ! don’t trust estimate Modeling uncertainty is fundamental! We use a rich probabilistic model Gaussian process, a non-parametric model [O'Hagan ’78] Learned from pilot data or expert knowledge Learning model is well-understood ! focus talk on optimizing sensor locations

Pick locations A with highest information quality lowest “uncertainty” after placing sensors measured in terms of entropy of the posterior distribution x y sensor placement A (a set of locations) uncertainty in prediction after placing sensors y x uncertainty information quality I(A) (a number) placement A I(A) = 10 placement B I(B) = 4 Information quality

The placement problem Let V be finite set of locations to choose from For subset of locations A µ V, let I(A) be information quality and C(A) be communication cost of placement A Want to optimize min C(A) subject to I(A) ¸ Q Q>0 is information quota How do we measure communication cost?

Communication Cost Message loss requires retransmission This depletes the sensor’s battery quickly Communication cost for two sensors means expected number of transmissions (ETX) Communication cost for placement is sum of all ETXs along routing tree ETX 1.2 ETX 1.4 ETX 2.1 ETX 1.6 ETX 1.9 Total cost = 8.2 Many other criteria possible in our approach (e.g. number of sensors, path length of a robot, …) Modeling and predicting link quality hard! We use probabilistic models (Gaussian Processes for classification) ! Come to our demo on Thursday!

We propose: The pSPIEL Algorithm pSPIEL: Efficient, randomized algorithm (padded Sensor Placements at Informative and cost-Effective Locations) In expectation, both information quality and communication cost are close to optimum Built system using real sensor nodes for sensor placement using pSPIEL Evaluated on real-world placement problems

Minimizing communication cost while maximizing information quality V – set of possible locations For each pair, cost is ETX Select placement A µ V, such that: tree connecting A is cheapest min A C(A) C(A)= locations are informative: I(A) ¸ Q I(A) = I( ETX = 3 ETX = 10 ETX = 1.3 ETX 12 ETX […)[…) +++ … [ A1A1 A4A4 A8A8 [ First: simplified case, where each sensor provides independent information: I(A) = I(A 1 ) + I(A 2 ) + I(A 3 ) + I(A 4 ) + …

Quota Minimum Steiner Tree (Q-MST) Problem Problem: Each node A i has a reward I (A i ) Find the cheapest tree that collects at least Q reward: but very well studied [Blum, Garg, …] NP-hard…  Constant factor 2 approximation algorithm available! =10 =12 =8 I(A) = I(A 1 ) + I(A 2 ) + I(A 3 ) + I(A 4 ) + …I(A 1 )I(A 4 )I(A 2 )I(A 3 ) ++ + … ¸ Q Perhaps could use to solve our problem!!!

I(B) A1A1 A2A2 B2B2 B1B1 I(A) Are we done? Q-MST algorithm works if I(A) is modular, i.e., if A and B disjoint, I(A [ B)=I(A)+I(B) Makes no sense for sensor placement! Close by sensors are not independent For sensor placement, I is submodular I(A [ B) · I(A)+I(B) [Guestrin, K., Singh ICML 05] “Sensing regions” overlap, I(A [ B) < I(A) + I(B)

Must solve a new problem Want to optimize min C(A) subject to I(A) ¸ Q if sensors provide independent information I(A) = I(A 1 ) + I(A 2 ) + I(A 3 ) + … a modular problem solve with Q-MST but info not independent  sensors provide submodular information I(A 1 [ A 2 ) · I(A 1 ) + I(A 2 ) a new open problem! submodular steiner tree strictly harder than Q-MST generalizes existing problems e.g., group steiner Insight: our sensor problem has additional structure!

Locality If A, B are placements closeby, then I(A [ B) < I(A) + I(B) If A, B are placements, at least r apart, then I(A [ B) ¼ I(A) + I(B) Sensors that are far apart are approximately independent We showed locality is empirically valid! A1A1 I(B) B1B1 B2B2 r A2A2 I(A)

Our approach: pSPIEL approximate by a modular problem: for nodes A sum of rewards A ¼ I(A) submodular steiner tree with locality I(A 1 [ A 2 ) · I(A 1 ) + I(A 2 ) solve modular approximation with Q-MST obtain solution of original problem (prove it’s good) use off-the-shelf Q-MST solver

C1C1 C2C2 C3C3 C4C4 pSPIEL: an overview ¸ r diameter ·  r Build small, well- separated clusters over possible locations [Gupta et al ‘03] discard rest (doesn’t hurt) Information additive between clusters! locality!!! Don’t care about comm. within cluster (small) Use Q-MST to decide which nodes to use from each cluster and how to connect them

Our approach: pSPIEL approximate by a modular problem (MAG): for nodes A sum of rewards A ¼ I(A) submodular steiner tree with locality I(A 1 [ A 2 ) · I(A 1 ) + I(A 2 ) solve modular approximation with Q-MST obtain solution of original problem (prove it’s good) use off-the-shelf Q-MST solver

C1C1 C2C2 C4C4 C3C3 G 1,1 G 2,1 G 4,1 G 3,1 G 1,2 G 1,3 G 2,2 G 2,3 G 4,2 G 4,3 G 4,4 G 3,2 G 3,3 G 3,4 pSPIEL: Step 3 modular approximation graph Order nodes in “order of informativeness” Build a modular approximation graph (MAG) edge weights and node rewards ! solution in MAG ¼ solution of original problem Cost: C( G 2,1 [ G 2,2 [ G 3,1 [ G 4,1 [ G 4,2 ) ¼ w 4,1–4,2 w 3,1–4,1 w 2,1–3,1 w 2,1–2,2 +++ Info: I( G 2,1 [ G 2,2 [ G 3,1 [ G 4,1 [ G 4,2 ) ¼ most importantly, additive rewards: R( G 4,2 ) R( G 4,1 ) R( G 3,1 ) R( G 2,2 ) R( G 2,1 ) if we were to solve Q-MST in MAG: To learn how rewards are computed, come to our demo! C1C1 C2C2 C4C4 C3C3 G 2,1 G 1,2 G 2,3

use off-the-shelf Q-MST solver Our approach: pSPIEL approximate by a modular problem (MAG): for nodes A sum of rewards A ¼ I(A) submodular steiner tree I(A 1 [ A 2 ) · I(A 1 ) + I(A 2 ) solve modular approximation with Q-MST obtain solution of original problem (prove it’s good)

C1C1 C2C2 C3C3 C4C4 C1C1 C2C2 C3C3 C4C4 pSPIEL: Using Q-MST tree in MAG ! solution in original graph Q-MST on MAG ! solution to original problem!

Our approach: pSPIEL approximate by a modular problem (MAG): for nodes A sum of rewards A ¼ I(A) submodular steiner tree I(A 1 [ A 2 ) · I(A 1 ) + I(A 2 ) solve modular approximation with Q-MST obtain solution of original problem (prove it’s good) use off-the-shelf Q-MST solver

Theorem: pSPIEL finds a placement A with info. quality I(A) ¸  (  ) OPT quality, comm. cost C(A) · O (r log | V|) OPT cost r depends on locality property Guarantees for sensor placement log factor approx. comm. cost const. factor approx. info.

Summary of our approach 1. Use small, short-term “bootstrap” deployment to collect some data (or use expert knowledge) 2. Learn/Compute models for information quality and communication cost 3. Optimize tradeoff between information quality and communication cost using pSPIEL 4. Deploy sensors 5. If desired, collect more data and continue with step 2

We implemented this… Implemented using Tmote Sky motes Collect measurement and link information and send to base station We can now deploy nodes, learn models and come up with placements! See our demo on Thursday!!

Proof of concept study Learned model from short deployment of 46 sensors at the Intelligent Workplace Time learned GPs for light field & link qualities deployed 2 sets of sensors: pSPIEL and manually selected locations evaluated both deployments on 46 locations accuracy CMU’s Intelligent Workplace

Proof of concept study Manual (M20) pSPIEL (pS19) pSPIEL (pS12) Root mean squares error (Lux) better accuracy on 46 locations better Communication cost (ETX) M20 pS19 pS12 pSPIEL improve solution over intuitive manual placement: 50% better prediction and 20% less comm. cost, or 20% better prediction and 40% less comm. cost Poor placements can hurt a lot! Good solution can be unintuitive

Comparison with heuristics Temperature data from sensor network 16 placement locations More expensive (ETX) Roughly number of sensors Higher information quality Optimal solution

Comparison with heuristics Temperature data from sensor network More expensive (ETX) Roughly number of sensors Higher information quality Optimal solution Greedy- Connect Temperature data from sensor network 16 placement locations Greedy-Connect: Maximizes information quality, then connects nodes

Comparison with heuristics Temperature data from sensor network More expensive (ETX) Roughly number of sensors Higher information quality Optimal solution Greedy- Connect Cost-benefit Greedy Temperature data from sensor network 16 placement locations Greedy-Connect: Maximizes information quality, then connects nodes Cost-benefit greedy: Grows clusters optimizing benefit-cost ratio info. / comm.

Comparison with heuristics Temperature data from sensor network More expensive (ETX) Roughly number of sensors Higher information quality Optimal solution pSPIEL Greedy- Connect Cost-benefit Greedy Temperature data from sensor network 16 placement locations pSPIEL is significantly closer to optimal solution similar information quality at 40% less comm. cost! Greedy-Connect: Maximizes information quality, then connects nodes Cost-benefit greedy: Grows clusters optimizing benefit-cost ratio info. / comm.

More expensive (ETX) Higher information quality pSPIEL Greedy- Connect Cost-benefit Greedy Comparison with heuristics Precipitation data 167 locations Temperature data 100 locations 80 More expensive (ETX) Sweet spot of pSPIEL pSPIEL outperforms heuristics Sweet spot captures important region: just enough sensors to capture spatial phenomena Greedy-Connect: Maximizes information quality, then connects nodes Cost-benefit greedy: Grows clusters optimizing benefit-cost ratio info. / comm.

Conclusions Unified approach for deploying wireless sensor networks – uncertainty is fundamental Data-driven models for phenomena and link qualities pSPIEL: Efficient, randomized algorithm optimizes tradeoff: info. quality and comm. cost guaranteed to be close to optimum Built a complete system on Tmote Sky motes, deployed sensors, evaluated placements pSPIEL significantly outperforms alternative methods