Download presentation
Presentation is loading. Please wait.
Published byElian Gay Modified over 9 years ago
1
Computer Engineering and Networks Laboratory Visualizing Large Sensor Network Data Sets in Space and Time with Vizzly Matthias Keller, Jan Beutel, Olga Saukh, Lothar Thiele SenseApp 2012, 22/10/2012
2
Swiss Federal Institute of Technology 2 Matthias Keller, SenseApp 2012, 22/10/2012 Low-power wireless sensor networks for permafrost monitoring Mobile urban air quality sensing GPS loggers Coin-sized temperature loggers
3
Swiss Federal Institute of Technology 3 Matthias Keller, SenseApp 2012, 22/10/2012 More deployments, new sensors 1 st deployment 2 nd deployment 3 rd deployment 4 th deployment New sensor: GPS Four extra trams 1 st OpenSense tram ~2,500 sensing channels
4
Swiss Federal Institute of Technology 4 Matthias Keller, SenseApp 2012, 22/10/2012 Visualizing All Signal Dynamics Interpolation of once loaded dataDesired functionality Short-term behavior is not visible!
5
Swiss Federal Institute of Technology 5 Matthias Keller, SenseApp 2012, 22/10/2012 Research Questions and Challenges Very large, multi-year data sets How can we make all levels of detail accessible? What are suited data structures for efficient data access? How can we use precious resources, e.g., RAM, efficiently? Unrestricted sensing modalities Event-based or periodic sampling (Mean) sampling rates can vary between seconds and days System integration Visualization service should fit into existing landscape
6
Swiss Federal Institute of Technology 6 Matthias Keller, SenseApp 2012, 22/10/2012 Overview Related Work Vizzly System architecture Location-preserving temporal aggregation Hierarchical cache Output generation Case Study 1 2 3
7
Swiss Federal Institute of Technology 7 Matthias Keller, SenseApp 2012, 22/10/2012 Related Work Data interfaces of other projects SensorScope, GlacsWeb, da-sense, Geigermap,... Data streaming platforms, APIs Pachube/Cosm, Google Fusion Tables, Microsoft SensorMap, … Optimized database/data processing systems RasDaMan, tsdb, sMAP, … Evaluated solutions did not support (at least one): the anticipated data volume the visualization of multi-year time series in one view the visualization of all signal dynamics
8
Swiss Federal Institute of Technology 8 Matthias Keller, SenseApp 2012, 22/10/2012 Overview Related Work Vizzly System architecture Location-preserving temporal aggregation Hierarchical cache Output generation Case Study 1 2 3
9
Swiss Federal Institute of Technology 9 Matthias Keller, SenseApp 2012, 22/10/2012 Vizzly Overview Goal: Fast browsing of multi-year data at all levels of detail Cache layer + web service + front-end library Provides time series and map data widgets Includes a management interface and health sensors Freely available as open source software
10
Swiss Federal Institute of Technology 10 Matthias Keller, SenseApp 2012, 22/10/2012 System Architecture Recording of structured data with time and (optional) location information Processes, stores and publishes incoming data Aggregated data is stored in data structures that maintain temporal and spatial locality Users only need a web browser for interactively browsing through large data sets
11
Swiss Federal Institute of Technology 11 Matthias Keller, SenseApp 2012, 22/10/2012 Client-Server Communication User interface is only loaded once Vizzly servers returns (time, value) and (location, value) tuples, respectively, when requests parameters change Easy integration into arbitrary web pages How is the returned CSV data generated?
12
Swiss Federal Institute of Technology 12 Matthias Keller, SenseApp 2012, 22/10/2012 Back-end Design Challenges Users can select data on temporal and spatial criteria Arbitrary combinations of temporal and spatial aggregation levels cannot be cached efficiently Location-preserving temporal aggregation scheme Ideally, all data should also be ready in the Vizzly cache Memory cache is the fastest, but also the smallest Continuous updating of a hierarchical cache Whenever possible, unaggregated data should be returned Size of unaggregated data is not known beforehand Sampling rate estimation
13
Swiss Federal Institute of Technology 13 Matthias Keller, SenseApp 2012, 22/10/2012 Location-preserving Temporal Aggregation Idea: Perform temporal aggregation before caching, but postpone spatial aggregation to request processing Down-sampling of time information to target resolution: 1 2 Aggregation function is applied to samples of equal down-sampled time and equal location information: 3 Resulting tuple is put into cache
14
Swiss Federal Institute of Technology 14 Matthias Keller, SenseApp 2012, 22/10/2012 Spatial aggregation is only done when serving a concrete request Resulting tuples are sent to client Spatial Aggregation of Pre-Aggregated Data Data of reduced temporal resolution is loaded from cache 4 5 Location information is reduced to matching target resolution: 7 Aggregation function is applied to samples of equal (reduced) time and location information: 6
15
Swiss Federal Institute of Technology 15 Matthias Keller, SenseApp 2012, 22/10/2012 Back-end Design Challenges Users can select data on temporal and spatial criteria Arbitrary combinations of temporal and spatial aggregation levels cannot be cached efficiently Location-preserving temporal aggregation scheme Ideally, all data should also be ready in the Vizzly cache Memory cache is the fastest, but also the smallest Continuous updating of a hierarchical cache Whenever possible, unaggregated data should be returned Size of unaggregated data is not known beforehand Sampling rate estimation
16
Swiss Federal Institute of Technology 16 Matthias Keller, SenseApp 2012, 22/10/2012 Hierarchical Cache Memory DBMS, e.g., MySQL Original data store fastest slowest Order of accesses until signal data is found
17
Swiss Federal Institute of Technology 17 Matthias Keller, SenseApp 2012, 22/10/2012 Idea: While time information can be linearized easily, doing so avoids searching for stored data and facilitates further down-sampling in the temporal domain Memory Cache Data without location information: Data with location information: Multiple entries for one timestamp
18
Swiss Federal Institute of Technology 18 Matthias Keller, SenseApp 2012, 22/10/2012 Vizzly “learns” interesting signals from requests received Learned signals are continuously polled for new data Strategies for hierarchical caches Balancing by signal popularity Distribution by aggregation level … Cache Updating
19
Swiss Federal Institute of Technology 19 Matthias Keller, SenseApp 2012, 22/10/2012 Back-end Design Challenges Users can select data on temporal and spatial criteria Arbitrary combinations of temporal and spatial aggregation levels cannot be cached efficiently Location-preserving temporal aggregation scheme Ideally, all data should also be ready in the Vizzly cache Memory cache is the fastest, but also the smallest Continuous updating of a hierarchical cache Whenever possible, unaggregated data should be returned Size of unaggregated data is not known beforehand Sampling rate estimation
20
Swiss Federal Institute of Technology 20 Matthias Keller, SenseApp 2012, 22/10/2012 Output generation for single view containing (n+m) signals: 1. For each signal, decide if unaggregated data points can be displayed (estimated mean sampling rate x time period) 2. Apply spatial filtering/aggregation when data with location information is loaded Request Processing in the Back-end
21
Swiss Federal Institute of Technology 21 Matthias Keller, SenseApp 2012, 22/10/2012 Overview Related Work Vizzly System architecture Location-preserving temporal aggregation Hierarchical cache Output generation Case Study 1 2 3
22
Swiss Federal Institute of Technology 22 Matthias Keller, SenseApp 2012, 22/10/2012 Data Fetch Performance GSN: Unaggregated input 2,100 sensing channels, ~550 million data points MySQL DB: 1 st aggregation level, 4 minutes resolution 300 million aggregated data points, ~15 GB data Memory: 2 nd aggregation level, 16 minutes resolution 100 million aggregated data points, ~800 MB data 99 th percentiles: Live agg.: 4 msec Memory: 5 msec MySQL: 690 msec GSN: 6.9 sec
23
Swiss Federal Institute of Technology 23 Matthias Keller, SenseApp 2012, 22/10/2012 Conclusions New middleware for the interactive browsing of large sensor network data sets Provided time series and map data widgets can be easily integrated into existing web pages Data fetch are dramatically reduced by caching Vizzly has proven its usefulness in more than 1 year of operation in a production environment Vizzly is available as open-source software Demo: http://data.permasense.ch/ Project home: https://code.google.com/p/vizzly/
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.