1 Querying Sensor Networks Sam Madden UC Berkeley.

Slides:



Advertisements
Similar presentations
Directed Diffusion for Wireless Sensor Networking
Advertisements

IN-NETWORK VS CENTRALIZED PROCESSING FOR LIGHT DETECTION SYSTEM USING WIRELESS SENSOR NETWORKS Presentation by, Desai, Bhairav Solanki, Arpan.
한국기술교육대학교 컴퓨터 공학 김홍연 TinyDB : An Acquisitional Query Processing System for Sensor Networks. - Samuel R. Madden, Michael J. Franklin, Joseph M. Hellerstein,
1 Sensor Network Databases Ref: Wireless sensor networks---An information processing approach Feng Zhao and Leonidas Guibas (chapter 6)
Programming Vast Networks of Tiny Devices David Culler University of California, Berkeley Intel Research Berkeley
1 Implementation and Research Issues in Query Processing for Wireless Sensor Networks Wei Hong Intel Research, Berkeley Sam Madden.
DNA Research Group 1 CountTorrent: Ubiquitous Access to Query Aggregates in Dynamic and Mobile Sensor Networks Abhinav Kamra, Vishal Misra and Dan Rubenstein.
1 Supporting Aggregate Queries Over Ad-Hoc Wireless Sensor Networks Samuel Madden UC Berkeley With Robert Szewczyk, Michael Franklin, and David Culler.
Scaling Down Robert Grimm New York University. Scaling Down in One Slide  Target devices (roughly)  Small form factor  Battery operated  Wireless.
The Cougar Approach to In-Network Query Processing in Sensor Networks By Yong Yao and Johannes Gehrke Cornell University Presented by Penelope Brooks.
Reconfigurable Sensor Networks Chris Elliott Honours in Digital Systems Charles Greif and Nandita Bhattacharjee.
Tributaries and Deltas: Efficient and Robust Aggregation in Sensor Network Streams Amit Manjhi, Suman Nath, Phillip B. Gibbons Carnegie Mellon University.
Aggregation in Sensor Networks NEST Weekly Meeting Sam Madden Rob Szewczyk 10/4/01.
Sensor Database: Querying Sensor Networks Yinghua Wu, Haiyong Xie.
A Survey of Wireless Sensor Network Data Collection Schemes by Brett Wilson.
Approximate data collection in sensor networks the appeal of probabilistic models David Chu Amol Deshpande Joe Hellerstein Wei Hong ICDE 2006 Atlanta,
Probabilistic Data Aggregation Ling Huang, Ben Zhao, Anthony Joseph Sahara Retreat January, 2004.
Taming the Underlying Challenges of Reliable Multihop Routing in Sensor Networks.
Aggregate Query Processing in Ad-Hoc Sensor Networks Yong Yao Database lunch, Apr. 15th.
Sensor Networks: Implications for Database Systems and Vice-Versa Michael Franklin January UCB Sensor Day.
1 Acquisitional Query Processing in TinyDB Sam Madden UC Berkeley NEST Winter Retreat 2003.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong Presentation.
Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.
CS 580S Sensor Networks and Systems Professor Kyoung Don Kang Lecture 7 February 13, 2006.
TAG: A TINY AGGREGATION SERVICE FOR AD-HOC SENSOR NETWORKS Presented by Akash Kapoor SAMUEL MADDEN, MICHAEL J. FRANKLIN, JOSEPH HELLERSTEIN, AND WEI HONG.
T AG : A TINY AGGREGATION SERVICE FOR AD - HOC SENSOR NETWORKS Samuel Madden, Michael J. Franklin, Joseph Hellerstein, and Wei Hong Presented by – Mahanth.
15-744: Computer Networking L-13 Sensor Networks.
1 Implementation and Research Issues in Query Processing for Wireless Sensor Networks Wei Hong Intel Research, Berkeley Sam Madden.
TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Paper By : Samuel Madden, Michael J. Franklin, Joseph Hellerstein, and Wei Hong Instructor :
CS2510 Fault Tolerance and Privacy in Wireless Sensor Networks partially based on presentation by Sameh Gobriel.
INF5100 Autumn 2007 © Ellen Munthe-Kaas and Jarle Søberg 1 Data Management in Sensor Networks Ellen Munthe-Kaas Jarle Søberg.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong.
TinyOS By Morgan Leider CS 411 with Mike Rowe with Mike Rowe.
Sensor Network Databases1 Overview: Chapter 6  Sensor Network Databases  Sensor networks are conceptually a distributed DB  Store collected data  Indexes.
March 6th, 2008Andrew Ofstad ECE 256, Spring 2008 TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Samuel Madden, Michael J. Franklin, Joseph.
15-744: Computer Networking L-13 Sensor Networks.
1 Pradeep Kumar Gunda (Thanks to Jigar Doshi and Shivnath Babu for some slides) TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Samuel Madden,
TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Authors: Samuel Madden, Michael Franklin, Joseph Hellerstein Presented by: Vikas Motwani CSE.
1 TAG: A Tiny Aggregation Service for Ad-Hoc Sensor Networks Samuel Madden UC Berkeley with Michael Franklin, Joseph Hellerstein, and Wei Hong December.
1 Querying Sensor Networks Sam Madden UC Berkeley December 13 th, New England Database Seminar.
INT 598 Data Management for Sensor Networks Silvia Nittel Spatial Information Science & Engineering University of Maine Fall 2006.
The Design of an Acquisitional Query Processor for Sensor Networks CS851 Presentation 2005 Presented by: Gang Zhou University of Virginia.
1 Fjording The Stream An Architecture for Queries over Streaming Sensor Data Samuel Madden, Michael Franklin UC Berkeley.
한국기술교육대학교 컴퓨터 공학 김홍연 Habitat Monitoring with Sensor Networks DKE.
Query Processing for Sensor Networks Yong Yao and Johannes Gehrke (Presentation: Anne Denton March 8, 2003)
Supporting Aggregate Queries Over Ad-Hoc Wireless Sensor Networks.
REED: Robust, Efficient Filtering and Event Detection in Sensor Networks Daniel Abadi, Samuel Madden, Wolfgang Lindner MIT United States VLDB 2005.
1 REED: Robust, Efficient Filtering and Event Detection in Sensor Networks Daniel Abadi, Samuel Madden, Wolfgang Lindner MIT United States VLDB 2005.
ResTAG: Resilient Event Detection with TinyDB Angelika Herbold -Western Washington University Thierry Lamarre -ENSEIRB Systems Software Laboratory, OGI.
1 Report of Advanced Data Base Topics Project Instructor : Dr. rahgozar euhanna ghadimi, Ali abbasi, kave pashaii Data Storage selection in sensor networks.
Aggregation and Secure Aggregation. Learning Objectives Understand why we need aggregation in WSNs Understand aggregation protocols in WSNs Understand.
W. Hong & S. Madden – Implementation and Research Issues in Query Processing for Wireless Sensor Networks, ICDE 2004.
In-Network Query Processing on Heterogeneous Hardware Martin Lukac*†, Harkirat Singh*, Mark Yarvis*, Nithya Ramanathan*† *Intel.
INF5100 Autumn 2008 © Ellen Munthe-Kaas and Jarle Søberg 1 Data Management in Sensor Networks Ellen Munthe-Kaas Jarle Søberg.
Aggregation and Secure Aggregation. [Aggre_1] Section 12 Why do we need Aggregation? Sensor networks – Event-based Systems Example Query: –What is the.
Sep Multiple Query Optimization for Wireless Sensor Networks Shili Xiang Hock Beng Lim Kian-Lee Tan (ICDE 2007) Presented by Shan Bai.
1 TAG: A Tiny Aggregation Service for Ad-Hoc Sensor Networks Samuel Madden UC Berkeley with Michael Franklin, Joseph Hellerstein, and Wei Hong December.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong Presentation.
MAC Protocols for Sensor Networks
TAG: a Tiny AGgregation service for ad-hoc sensor networks Authors: Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong Presenter: Mingwei.
Demetrios Zeinalipour-Yazti (Univ. of Cyprus)
Querying Sensor Networks
Distributed database approach,
The Design of an Acquisitional Query Processor For Sensor Networks
Querying Sensor Networks
Distributing Queries Over Low Power Sensor Networks
Querying Sensor Networks
Data-Centric Networking
REED : Robust, Efficient Filtering and Event Detection
Aggregation.
Presentation transcript:

1 Querying Sensor Networks Sam Madden UC Berkeley

2 Sensor Networks Small computers with: –Radios –Sensing hardware –Batteries Remote deployments –Long lived –10s, 100s, or 1000s Battery Pack Smart Sensor, aka “Mote”

3 Motes Mica Mote 4Mhz, 8 bit Atmel RISC uProc 40 kbit Radio 4 K RAM, 128 K Program Flash, 512 K Data Flash AA battery pack Based on TinyOS* *Hill, Szewczyk, Woo, Culler, & Pister. “Systems Architecture Directions for Networked Sensors.” ASPLOS

4 Sensor Net Sample Apps Traditional monitoring apparatus. Earthquake monitoring in shake- test sites. Vehicle detection: sensors along a road, collect data about passing vehicles. Habitat Monitoring: Storm petrels on Great Duck Island, microclimates on James Reserve.

5 Programming Sensor Nets Is Hard –Months of lifetime required from small batteries »3-5 days naively; can’t recharge often »Interleave sleep with processing –Lossy, low-bandwidth, short range communication »Nodes coming and going »~20% 5m »Multi-hop –Remote, zero administration deployments –Highly distributed environment –Limited Development Tools »Embedded, LEDs for Debugging! Need high level abstractions! instructions per bit transmitted! High-Level Abstraction Is Needed!

6 A Solution: Declarative Queries Users specify the data they want –Simple, SQL-like queries –Using predicates, not specific addresses –Same spirit as Cougar – Our system: TinyDB Challenge is to provide: –Expressive & easy-to-use interface –High-level operators »Well-defined interactions »“Transparent Optimizations” that many programmers would miss Sensor-net specific techniques –Power efficient execution framework Question: do sensor networks change query processing? Yes!

7 Overview TinyDB: Queries for Sensor Nets Processing Aggregate Queries (TAG) Taxonomy & Experiments Acquisitional Query Processing Other Research Future Directions

8 Overview TinyDB: Queries for Sensor Nets Processing Aggregate Queries (TAG) Taxonomy & Experiments Acquisitional Query Processing Other Research Future Directions

9 TinyDB Demo

10 TinyOS Schema Query Processor Multihop Network TinyDB Architecture Schema: “Catalog” of commands & attributes Filter light > 400 get (‘temp’) Agg avg(temp) Queries SELECT AVG(temp) WHERE light > 400 Results T:1, AVG: 225 T:2, AVG: 250 TablesSamples got(‘temp’) Name: temp Time to sample: 50 uS Cost to sample: 90 uJ Calibration Table: 3 Units: Deg. F Error: ± 5 Deg F Get f : getTempFunc() … getTempFunc(…)TinyDB ~10,000 Lines Embedded C Code ~5,000 Lines (PC-Side) Java ~3200 Bytes RAM (w/ 768 byte heap) ~58 kB compiled code (3x larger than 2 nd largest TinyOS Program)

11 Declarative Queries for Sensor Networks Examples: SELECT nodeid, nestNo, light FROM sensors WHERE light > 400 EPOCH DURATION 1s 1 EpochNodeidnestNoLight Sensors “Find the sensors in bright nests.”

12 Aggregation Queries EpochregionCNT(…)AVG(…) 0North3360 0South3520 1North3370 1South3520 “Count the number occupied nests in each loud region of the island.” SELECT region, CNT(occupied) AVG(sound) FROM sensors GROUP BY region HAVING AVG(sound) > 200 EPOCH DURATION 10s 3 Regions w/ AVG(sound) > 200 SELECT AVG(sound) FROM sensors EPOCH DURATION 10s 2

13 Overview TinyDB: Queries for Sensor Nets Processing Aggregate Queries (TAG) Taxonomy & Experiments Acquisitional Query Processing Other Research Future Directions

14 Tiny Aggregation (TAG) In-network processing of aggregates –Common data analysis operation »Aka gather operation or reduction in || programming –Communication reducing »Operator dependent benefit –Across nodes during same epoch Exploit query semantics to improve efficiency! Madden, Franklin, Hellerstein, Hong. Tiny AGgregation (TAG), OSDI 2002.

15 Query Propagation Via Tree- Based Routing Tree-based routing –Used in: »Query delivery »Data collection –Topology selection is important; e.g. »Krishnamachari, DEBS 2002, Intanagonwiwat, ICDCS 2002, Heidemann, SOSP 2001 »LEACH/SPIN, Heinzelman et al. MOBICOM 99 »SIGMOD 2003 –Continuous process »Mitigates failures A B C D F E Q:SELECT … Q Q Q Q Q Q Q Q Q QQ Q R:{…}

16 Basic Aggregation In each epoch: –Each node samples local sensors once –Generates partial state record (PSR) »local readings »readings from children –Outputs PSR during assigned comm. interval At end of epoch, PSR for whole network output at root New result on each successive epoch Extras: –Predicate-based partitioning via GROUP BY

17 Illustration: Aggregation Sensor # Interval # Interval 4 SELECT COUNT(*) FROM sensors Epoch

18 Illustration: Aggregation Sensor # Interval 3 SELECT COUNT(*) FROM sensors Interval #

19 Illustration: Aggregation Sensor # Interval 2 SELECT COUNT(*) FROM sensors Interval #

20 Illustration: Aggregation Sensor # SELECT COUNT(*) FROM sensors Interval 1 Interval #

21 Illustration: Aggregation Sensor # SELECT COUNT(*) FROM sensors Interval 4 Interval #

22 Interval Assignment: An Approach SELECT COUNT(*)… 4 intervals / epoch Interval # = Level 4 3 Level = 1 2 Epoch Comm Interval Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z Z L T L T L T T L T LL Pipelining: Increase throughput by delaying result arrival until a later epoch Madden, Szewczyk, Franklin, Culler. Supporting Aggregate Queries Over Ad-Hoc Wireless Sensor Networks. WMCSA CSMA for collision avoidance Time intervals for power conservation Many variations( e.g. Yao & Gehrke, CIDR 2003 ) Time Sync (e.g. Elson & Estrin OSDI 2002)

23 Aggregation Framework As in extensible databases, we support any aggregation function conforming to: Agg n ={f init, f merge, f evaluate } F init {a 0 }  F merge {, }  F evaluate { }  aggregate value Example: Average AVG init {v}  AVG merge {, }  AVG evaluate { }  S/C Partial State Record (PSR) Restriction: Merge associative, commutative

24 Types of Aggregates SQL supports MIN, MAX, SUM, COUNT, AVERAGE Any function over a set can be computed via TAG In network benefit for many operations –E.g. Standard deviation, top/bottom N, spatial union/intersection, histograms, etc. –Compactness of PSR

25 Overview TinyDB: Queries for Sensor Nets Processing Aggregate Queries (TAG) Taxonomy & Experiments Acquisitional Query Processing Other Research Future Directions

26 Simulation Environment Evaluated TAG via simulation Coarse grained event based simulator –Sensors arranged on a grid –Two communication models »Lossless: All neighbors hear all messages »Lossy: Messages lost with probability that increases with distance Communication (message counts) as performance metric

27 Taxonomy of Aggregates TAG insight: classify aggregates according to various functional properties –Yields a general set of optimizations that can automatically be applied Properties Partial State Monotonicity Exemplary vs. Summary Duplicate Sensitivity Drives an API!

28 Partial State Growth of PSR vs. number of aggregated values (n) –Algebraic: |PSR| = 1 (e.g. MIN) –Distributive: |PSR| = c (e.g. AVG) –Holistic: |PSR| = n (e.g. MEDIAN) –Unique: |PSR| = d (e.g. COUNT DISTINCT) »d = # of distinct values –Content Sensitive: |PSR| < n (e.g. HISTOGRAM) PropertyExamplesAffects Partial StateMEDIAN : unbounded, MAX : 1 record Effectiveness of TAG “Data Cube”, Gray et. al

29 Benefit of In-Network Processing Simulation Results 2500 Nodes 50x50 Grid Depth = ~10 Neighbors = ~20 Uniform Dist. Aggregate & depth dependent benefit!HolisticUnique Distributive Algebraic

30 Monotonicity & Exemplary vs. Summary PropertyExamplesAffects Partial State MEDIAN : unbounded, MAX : 1 record Effectiveness of TAG Monotonicity COUNT : monotonic AVG : non-monotonic Hypothesis Testing, Snooping Exemplary vs. Summary MAX : exemplary COUNT: summary Applicability of Sampling, Effect of Loss

31 Channel Sharing (“Snooping”) Insight: Shared channel can reduce communication Suppress messages that won’t affect aggregate –E.g., MAX –Applies to all exemplary, monotonic aggregates Only snoop in listen/transmit slots –Future work: explore snooping/listening tradeoffs

32 Hypothesis Testing Insight: Guess from root can be used for suppression –E.g. ‘MIN < 50’ –Works for monotonic & exemplary aggregates »Also summary, if imprecision allowed How is hypothesis computed? –Blind or statistically informed guess –Observation over network subset

33 Experiment: Snooping vs. Hypothesis Testing Uniform Value Distribution Dense Packing Ideal Communication Pruning in Network Pruning at Leaves

34 Duplicate Sensitivity PropertyExamplesAffects Partial State MEDIAN : unbounded, MAX : 1 record Effectiveness of TAG Monotonicity COUNT : monotonic AVG : non-monotonic Hypothesis Testing, Snooping Exemplary vs. Summary MAX : exemplary COUNT: summary Applicability of Sampling, Effect of Loss Duplicate Sensitivity MIN : dup. insensitive, AVG : dup. sensitive Routing Redundancy

35 Use Multiple Parents Use graph structure –Increase delivery probability with no communication overhead For duplicate insensitive aggregates, or Aggs expressible as sum of parts –Send (part of) aggregate to all parents »In just one message, via multicast –Assuming independence, decreases variance SELECT COUNT(*) A BC R A BC c R P(link xmit successful) = p P(success from A->R) = p 2 E(cnt) = c * p 2 Var(cnt) = c 2 * p 2 * (1 – p 2 )  V # of parents = n E(cnt) = n * (c/n * p 2 ) Var(cnt) = n * (c/n) 2 * p 2 * (1 – p 2 ) = V/n A BC c/n R n = 2

36 Multiple Parents Results Better than previous analysis expected! Losses aren’t independent! Insight: spreads data over many links Critical Link! No Splitting With Splitting

37 Taxonomy Related Insights Communication Reducing –In-network Aggregation (Partial State) –Hypothesis Testing (Exemplary & Monotonic) –Snooping (Exemplary & Monotonic) –Sampling Quality Increasing –Multiple Parents (Duplicate Insensitive) –Child Cache

38 TAG Contributions Simple but powerful data collection language –Vehicle tracking: SELECT ONEMAX(mag,nodeid) EPOCH DURATION 50ms Distributed algorithm for in-network aggregation –Communication Reducing –Power Aware »Integration of sleeping, computation –Predicate-based grouping Taxonomy driven API –Enables transparent application of techniques to »Improve quality (parent splitting) »Reduce communication (snooping, hypo. testing)

39 Overview TinyDB: Queries for Sensor Nets Processing Aggregate Queries (TAG) Taxonomy & Experiments Acquisitional Query Processing Other Research Future Directions

40 Acquisitional Query Processing (ACQP) Closed world assumption does not hold –Could generate an infinite number of samples An acqusitional query processor controls –when, –where, –and with what frequency data is collected! Versus traditional systems where data is provided a priori Madden, Franklin, Hellerstein, and Hong. The Design of An Acqusitional Query Processor. SIGMOD, 2003 (to appear).

41 ACQP: What’s Different? How should the query be processed? –Sampling as a first class operation –Event – join duality How does the user control acquisition? –Rates or lifetimes –Event-based triggers Which nodes have relevant data? –Index-like data structures Which samples should be transmitted? –Prioritization, summary, and rate control

42 E(sampling mag) >> E(sampling light) 1500 uJ vs. 90 uJ Operator Ordering: Interleave Sampling + Selection SELECT light, mag FROM sensors WHERE pred1(mag) AND pred2(light) EPOCH DURATION 1s  (pred1)  (pred2) mag light  (pred1)  (pred2) mag light  (pred1)  (pred2) mag light Traditional DBMS ACQP At 1 sample / sec, total power savings could be as much as 3.5mW  Comparable to processor! Correct ordering (unless pred1 is very selective and pred2 is not): Cheap Costly

43 Exemplary Aggregate Pushdown SELECT WINMAX(light,8s,8s) FROM sensors WHERE mag > x EPOCH DURATION 1s Novel, general pushdown technique Mag sampling is the most expensive operation!  WINMAX  (mag>x) mag light Traditional DBMS light mag  (mag>x)  WINMAX  (light > MAX) ACQP

44 Lifetime Queries Lifetime vs. sample rate SELECT … EPOCH DURATION 10 s SELECT … LIFETIME 30 days Extra: Allow a MAX SAMPLE PERIOD –Discard some samples –Sampling cheaper than transmitting

45 (Single Node) Lifetime Prediction

46 Overview TinyDB: Queries for Sensor Nets Processing Aggregate Queries (TAG) Taxonomy & Experiments Acquisitional Query Processing Other Research Future Directions

47 Sensor Network Challenge Problems Temporal aggregates Sophisticated, sensor network specific aggregates –Isobar Finding –Vehicle Tracking –Lossy compression »Wavelets Hellerstein, Hong, Madden, and Stanek. Beyond Average. IPSN 2003 (to appear) “Isobar Finding”

48 Additional Research Sensors, TinyDB, TinyOS –This Talk: »TAG (OSDI 2002) »ACQP (SIGMOD 2003) »WMCSA 2002 »IPSN 2003 –TOSSIM. Levis, Lee, Woo, Madden, & Culler. (In submission) –TinyOS contributions: memory allocator, catalog, network reprogramming, OS support, releases, TinyDB

49 Other Research (Cont) Stream Query Processing –CACQ (SIGMOD 2002) »Madden, Shah, Hellerstein, & Raman –Fjords (ICDE 2002) »Madden & Franklin –Java Experiences Paper (SIGMOD Record, December 2001) »Shah, Madden, Franklin, and Hellerstein –Telegraph Project, FFF & ACM1 Demos »Telegraph Team

50 TinyDB Deployments Initial efforts: –Network monitoring –Vehicle tracking Ongoing deployments: –Environmental monitoring –Generic Sensor Kit –Building Monitoring –Golden Gate Bridge

51 Overview TinyDB: Queries for Sensor Nets Processing Aggregate Queries (TAG) Taxonomy & Experiments Acquisitional Query Processing Other Research Future Directions

52 TinyDB Future Directions Expressing lossiness –No longer a closed world! Additional Operations –Joins –Signal Processing Integration with Streaming DBMS –In-network vs. external operations Heterogeneous Nodes and Operators Real Deployments

53 Contributions & Summary Declarative Queries via TinyDB –Simple, data-centric programming abstraction –Known to work for monitoring, tracking, mapping Sensor network contributions –Network as a single queryable entity –Power-aware, in-network query processing –Taxonomy: Extensible aggregate optimizations Query processing contributions –Acquisitional Query Processing –Framework for new issues in acquisitional systems, e.g.: »Sampling as an operator »Languages, indices, approximations to control when, where, and what data is acquired + processed by the system Consideration of database, network, and device issues

54 Questions?