1 DIMENSIONS: Why do we need a new Data Handling architecture for sensor networks? Deepak Ganesan, Deborah Estrin (UCLA), John Heidemann (USC/ISI) Presenter:

Slides:



Advertisements
Similar presentations
CELLULAR COMMUNICATIONS. LTE Data Rate Requirements And Targets to LTE  reduced delays, in terms of both connection establishment and transmission.
Advertisements

Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Directed Diffusion for Wireless Sensor Networking
Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen Scott Shenker This is a modified version of the original presentation by the authors.
Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.
1 Data-Centric Storage in Sensornets with GHT, A Geographic Hash Table Sylvia Ratnasamy, Scott Shenker, Brad Karp, Ramesh Govindan, Deborah Estrin, Li.
Presented By- Sayandeep Mitra TH SEMESTER Sensor Networks(CS 704D) Assignment.
GPS-less Low-Cost Outdoor Localization for Very Small Devices Nirupama Bulusu, John Heidemann, and Deborah Estrin.
An Evaluation of Multi-Resolution Storage for Sensor Networks SenSys’03 Paper by Deepak Ganesan, Ben Greenstein, Denis Perelyubskiy, Deborah Estrin, and.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #13: P2P and Sensor Networks Shivkumar Kalyanaraman:
1 Next Century Challenges: Scalable Coordination in sensor Networks MOBICOMM (1999) Deborah Estrin, Ramesh Govindan, John Heidemann, Satish Kumar Presented.
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
Deterministic Wavelet Thresholding for Maximum-Error Metrics Minos Garofalakis Bell Laboratories Lucent Technologies 600 Mountain Avenue Murray Hill, NJ.
Distributed Regression: an Efficient Framework for Modeling Sensor Network Data Carlos Guestrin Peter Bodik Romain Thibaux Mark Paskin Samuel Madden.
Distributed Quad-Tree for Spatial Querying in Wireless Sensor Networks (WSNs) Murat Demirbas, Xuming Lu Dept of Computer Science and Engineering, University.
1 An Evaluation of Multi-resolution Storage for Sensor Networks Deepak Ganesan, Ben Greenstein, Denis Perelyubskiy, Deborah Estrin (UCLA), John Heidemann.
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
A Hierarchical Energy-Efficient Framework for Data Aggregation in Wireless Sensor Networks IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 55, NO. 3, MAY.
1 An Evaluation of Multi-resolution Storage for Sensor Networks D. Ganesan, B. Greenstein, D. Perelyubskiy, D. Estrin, J. Heidemann ACM SenSys 2003.
Probabilistic Data Aggregation Ling Huang, Ben Zhao, Anthony Joseph Sahara Retreat January, 2004.
The Impact of Spatial Correlation on Routing with Compression in WSN Sundeep Pattem, Bhaskar Krishnamachri, Ramesh Govindan University of Southern California.
Distributed Quad-Tree for Spatial Querying in Wireless Sensor Networks (WSNs) Murat Demirbas, Xuming Lu Dept of Computer Science and Engineering, University.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Improving the Accuracy of Continuous Aggregates & Mining Queries Under Load Shedding Yan-Nei Law* and Carlo Zaniolo Computer Science Dept. UCLA * Bioinformatics.
Adaptive Self-Configuring Sensor Network Topologies ns-2 simulation & performance analysis Zhenghua Fu Ben Greenstein Petros Zerfos.
1-1 CMPE 259 Sensor Networks Katia Obraczka Winter 2005 Routing Protocols II.
1 Wavelet synopses with Error Guarantees Minos Garofalakis Phillip B. Gibbons Information Sciences Research Center Bell Labs, Lucent Technologies Murray.
T.Sharon-A.Frank 1 Multimedia Image Compression 2 T.Sharon-A.Frank Coding Techniques – Hybrid.
Department of Computer Science University of Massachusetts, Amherst PRESTO: Feedback-driven Data Management in Sensor Network Ming Li, Deepak Ganesan,
RACE: Time Series Compression with Rate Adaptivity and Error Bound for Sensor Networks Huamin Chen, Jian Li, and Prasant Mohapatra Presenter: Jian Li.
Network Topologies.
Network Coding vs. Erasure Coding: Reliable Multicast in MANETs Atsushi Fujimura*, Soon Y. Oh, and Mario Gerla *NEC Corporation University of California,
Sensor Networks Storage Sanket Totala Sudarshan Jagannathan.
Lossy Compression Based on spatial redundancy Measure of spatial redundancy: 2D covariance Cov X (i,j)=  2 e -  (i*i+j*j) Vertical correlation   
1 Reading Report 9 Yin Chen 29 Mar 2004 Reference: Multivariate Resource Performance Forecasting in the Network Weather Service, Martin Swany and Rich.
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Network Kernel Architectures and Implementation ( ) Network Architecture Chaiporn Jaikaeo Department of Computer Engineering.
Sensor Network Databases1 Overview: Chapter 6  Sensor Network Databases  Sensor networks are conceptually a distributed DB  Store collected data  Indexes.
Content-Based Music Information Retrieval in Wireless Ad-hoc Networks.
17.0 Distributed Speech Recognition and Wireless Environment References: 1. “Quantization of Cepstral Parameters for Speech Recognition over the World.
Wavelet-based Coding And its application in JPEG2000 Monia Ghobadi CSC561 final project
Department of Computer Science University of Massachusetts, Amherst TSAR*: A Two Tier Sensor Storage Architecture Using Interval Skip Graphs Peter Desnoyers,
Multi-Resolution Spatial and Temporal Coding in a Wireless Sensor Network for Long-Term Monitoring Applications You-Chiun Wang, Member, IEEE, Yao-Yu Hsieh,
Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”
RIDA: A Robust Information-Driven Data Compression Architecture for Irregular Wireless Sensor Networks Nirupama Bulusu (joint work with Thanh Dang, Wu-chi.
Energy-Efficient Signal Processing and Communication Algorithms for Scalable Distributed Fusion.
Outline Kinds of Coding Need for Compression Basic Types Taxonomy Performance Metrics.
The Haar + Tree: A Refined Synopsis Data Structure Panagiotis Karras HKU, September 7 th, 2006.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
11/25/2015 Wireless Sensor Networks COE 499 Localization Tarek Sheltami KFUPM CCSE COE 1.
Efficient Local Statistical Analysis via Integral Histograms with Discrete Wavelet Transform Teng-Yok Lee & Han-Wei Shen IEEE SciVis ’13Uncertainty & Multivariate.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Clustering using Wavelets and Meta-Ptrees Anne Denton, Fang Zhang.
Dr. Sudharman K. Jayaweera and Amila Kariyapperuma ECE Department University of New Mexico Ankur Sharma Department of ECE Indian Institute of Technology,
Analyzing wireless sensor network data under suppression and failure in transmission Alan E. Gelfand Institute of Statistics and Decision Sciences Duke.
Abstract: In many scenarios, wireless presents a tempting "last-mile" alternative to a wired connection for the delivery of internet service. However,
By: Gang Zhou Computer Science Department University of Virginia 1 Medians and Beyond: New Aggregation Techniques for Sensor Networks CS851 Seminar Presentation.
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
3-D WAVELET BASED VIDEO CODER By Nazia Assad Vyshali S.Kumar Supervisor Dr. Rajeev Srivastava.
Progressive transmission of spatial data Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Dense-Region Based Compact Data Cube
Spatial Data Management
Injong Rhee ICMCS’98 Presented by Wenyu Ren
Data Compression.
Cache Memory Presentation I
Outline Ganesan, D., Greenstein, B., Estrin, D., Heidemann, J., and Govindan, R. Multiresolution storage and search in sensor networks. Trans. Storage.
Overview: Chapter 2 Localization and Tracking
Presentation transcript:

1 DIMENSIONS: Why do we need a new Data Handling architecture for sensor networks? Deepak Ganesan, Deborah Estrin (UCLA), John Heidemann (USC/ISI) Presenter: Vijay Sundaram

2 Deployment: Microclimate monitoring at James Reserve Park (UC Riverside) How well does data fit model of variation of temperature with altitude. Send robotic agent to edge between low and high precipitation regions Get detailed data from node with maximum precipitation from Sept to Dec 2003 Hmm…I wonder why packet-loss is so high. Get a connectivity map of the network for all transmit power settings Weather Sensor Network

3 Goals Flexible spatio-temporal querying Provide ability to mine for interesting patterns and features in data. Drill-down on details Distributed Long-term networked data storage Preserve ability for long-term data mining, while catering to node storage constraints Performance Reasonable Accuracy for wide range of queries Low communication (energy) overhead

4 How can we achieve goals? Exploit redundancy in data Potentially huge gains from lossy compression exploiting spatio-temporal correlation Exploit rarity of interesting features Preserve only interesting features. Exploit scale of sensor network. large distributed storage, although limited local storage. Exploit low cost of approximate query processing allow approximate query processing that obtain sufficiently accurate responses.

5 Can existing systems satisfy design goals? P2P: DHT Gnutella Data Correlation Vs Decentralization None Temporal Spatial CentralizedHierarchical Fully Distributed Web Caches Centralized Data Collection Geo-Spatial Data Mining, Streaming Media (MPEG-2) Wireless Sensor Networks Exploited Data Correlation Degree of Decentralization

6 PROGRESSIVELY AGE DIMENSIONS Design: Key Ideas Construct hierarchy of lossy compressed summaries of data using wavelet compression. Queries “drill-down” from root of hierarchy to focus search on small portions of the network. Progressively age lossy data along spatio- temporal hierarchy to enable long-term storage Level 0 Level 1 Level 2 PROGRESSIVELY LOSSY

7 Roadmap Why wavelets? Example Precipitation Hierarchy Spatial and Temporal Processing internals Initial Results: Precipitation Dataset

8 Enabling Technique: Wavelets Very popular signal processing approach, that provides good time and frequency localization. JPEG2000, Geo-Spatial Data Mining preserves spatio-temporal features (edges, discontinuities) while providing good approximation of long-term trends in data Efficient distributed implementation possible.

9 Sample Architecture: Precipitation Hierarchy Local Processing: Construct lossy time-series summary (zero communication cost) Spatial Data Processing: Hierarchical Lossy Compression Organize network into hierarchy. At each higher level, reduce number of participating nodes by a factor of 4. At each step of the hierarchy, summarize data from 4 quadrants, and propagate Wavelet Coeffs What is the maximum precipitation between Sept-Dec 2002? Direct query to quadrant that best matches query decreasing temporal resolution decreasing spatial resolution

10 Spatial Decomposition Recursively split network into non- overlapping square grids. At each level of the hierarchy, Elect clusterhead Cluster-head combines and summarizes data from 4 quadrants Cluster-head propagates compressed data to the next level of the hierarchy. Routing protocol: GPSR variant (DCS - Ratnasamy et al,) Hierarchy construction

11 Wavelet Compression Internals  Haar Filter  Debauchies 9/7 filter Cost Metric  Communication Budget  Error bound Wavelet Subband Decomposition Thresholding + Quantization + Drop Subbands Filter Lossless Encoder x time y Input Data x y Compressed Output time

12 Initial Results with Precipitation Dataset: Communication Overhead 15x12 grid (50km edge) of precipitation data from , from Pacific Northwest †. Gridded before processing. Handpicked choice of threshold, quantization intervals, subbands to drop. Huffman Encoder at output. Very large compression ratio up the hierarchy LevelRaw data size (Kb) - R Mean data sent to n ext level (Kb) - M Compression Ratio = R/M † M. Widmann and C.Bretherton. 50 km resolution daily precipitation for the Pacific Northwest,

13 Find maximum annual precipitation for each year. Exact Answer for 89% of queries. Within 90% of answer for >95% of queries. Queries require less than 3% of network. Good performance on average with very low lookup overhead

14 Locate boundary in annual precipitation between Low and High Precipitation Areas Error Metric: Number of nodes greater than 1 pixel distance from drill-down boundary Accuracy: Within 25% error for 93% of the queries (or within 13% error for 75% of the queries) Less than 5% of the network queried.

15 Open Issues Load Balancing and Robustness Hierarchical Model vs Peer Model: lot of work in p2p systems… Irregular Node Placement Use wavelet extensions for irregular node placement. Computationally more expensive Gridify dataset with interpolation Providing Query Guarantees Can we bound error in response obtained for a drill-down query at a particular level of hierarchy? Implementation on IPAQ/mote network

16 Summary DIMENSIONS provides a holistic data handling architecture for sensor networks that can Support a wide range of sensor-network usage and query models (using drill-down querying of wavelet summaries) Provide a gracefully degrading lossy storage model (by progressively ageing summaries) Offer ability to tune energy expended for query performance. (tunable lossy compression)

17 Different optimization metrics Internet- based Peer-to Peer Systems Geo-Spatial Data Mining Web Caches Streaming Media (MPEG-2) Wireless Sensor Networks Energy Latency Bandwidth Approximate Results OK Spatio-temporal Query Performance Lookup Cost Resource Constraints

18 Other Examples: Packet Loss Different example of dataset that exhibits spatial correlation Throughput from one transmitter to proximate receivers is correlated Throughput from multiple proximate transmitters to one receiver is correlated. Typically, what we want to query is the deviations from normal and average throughput.

19 Involves expensive transfer of 12x14 map from each node. Good approximate results can be obtained from querying compressed data. Packet-Loss Dataset: Get Throughput Vs Distance Map

20 Long-term Storage: Concepts Data is progressively aged, both locally, and along the hierarchy. Summaries that cover larger areas and longer time- periods are retained for much longer than raw time- series. Slower Ageing Wavelet Coefficients

21 Load Balancing and Robustness: Concepts Hierarchical Model Naturally fits wavelet processing Strict hierarchies are vulnerable to node failures. Failures near root of hierarchy can be expensive to repair Decentralized Peer Model Summaries communicated to multiple nodes probabilistically. Better robustness, but incurs greater communication overhead.