Design Considerations for High Fan-in Systems: The HiFi Approach Presented by Shawn Jeffery CIDR‘05 1/7/05 Michael J. Franklin, Shawn R. Jeffery, Sailesh.

Slides:



Advertisements
Similar presentations
anywhere and everywhere. omnipresent A sensor network is an infrastructure comprised of sensing (measuring), computing, and communication elements.
Advertisements

V-1 Part V: Collaborative Signal Processing Akbar Sayeed.
Online Filtering, Smoothing & Probabilistic Modeling of Streaming Data In short, Applying probabilistic models to Streams Bhargav Kanagal & Amol Deshpande.
Continuous Analytics Over Discontinuous Streams Sailesh Krishnamurthy, Michael Franklin, Jeff Davis, Daniel Farina, Pasha Golovko, Alan Li, Neil Thombre.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
C van Ingen, D Agarwal, M Goode, J Gupchup, J Hunt, R Leonardson, M Rodriguez, N Li Berkeley Water Center John Hopkins University Lawrence Berkeley Laboratory.
Semantic Web Based Architecture for Managing Hardware Heterogeneity in Wireless Sensor Network Authors: Sinisa Nikolić, MSc Valentin Penca, MSc Milan Segedinac,
11 Distributed Middleware for Container Transport: Lessons Learned (Klaas Thoelen, Sam Michiels, Wouter Joosen) 7th MiNEMA Workshop August 21, Lappeenranta,
1 SAFIRE Project DHS Update – July 15, 2009 Introductions  Update since last teleconference Demo Video - Fire Incident Command Board (FICB) SAFIRE Streams.
Adaptive Cleaning for RFID Data Streams Shawn Jeffery Minos Garofalakis Michael Franklin UC Berkeley Intel Research Berkeley UC Berkeley Presented by:
Design and Implementation of a Middleware for Sentient Spaces Bijit Hore, Hojjat Jafarpour, Ramesh Jain, Shengyue Ji, Daniel Massaguer Sharad Mehrotra,
ARCS Data Analysis Software An overview of the ARCS software management plan Michael Aivazis California Institute of Technology ARCS Baseline Review March.
The Cougar Approach to In-Network Query Processing in Sensor Networks By Yong Yao and Johannes Gehrke Cornell University Presented by Penelope Brooks.
1 Geographic Information Infrastructures for Ubiquitous Computing Spring 2007 Ki-Joune Li.
Queries over Sensor Networks Sam Madden UC Berkeley Database Seminar October 5, 2001.
Components and Architecture CS 543 – Data Warehousing.
A Survey of Wireless Sensor Network Data Collection Schemes by Brett Wilson.
Approximate data collection in sensor networks the appeal of probabilistic models David Chu Amol Deshpande Joe Hellerstein Wei Hong ICDE 2006 Atlanta,
UNIVERSITY OF SOUTHERN CALIFORNIA Embedded Networks Laboratory 1 Wireless Sensor Networks Ramesh Govindan Lab Home Page:
Chapter 6 SECURE WIRELESS PERSONAL NETWORKS: HOME EXTENDED TO ANYWHERE.
Data Management Challenges and Opportunities in the Digital Home* ICME Amsterdam July 2005 Mike Franklin UC Berkeley *in collaboration with Intel Research.
Declarative Support for Sensor Data Cleaning Shawn Jeffery Gustavo Alonso Michael Franklin Wei Hong Jennifer Widom UC Berkeley ETH Zurich UC Berkeley Arch.
Sensor Networks: Implications for Database Systems and Vice-Versa Michael Franklin January UCB Sensor Day.
Abstractions for Shared Sensor Networks DMSN September 2006 Michael J. Franklin.
Streaming Data, Continuous Queries, and Adaptive Dataflow Michael Franklin UC Berkeley NRC June 2002.
HiFi: Network-centric Query Processing in the Physical World SAP Research Forum February 2005 Mike Franklin UC Berkeley.
The Structure of (Computer) Scientific Revolutions Dow Jones Enterprise Ventures May 2006 Michael Franklin UC Berkeley & Amalgamated Insight.
Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.
Adaptive Cleaning for RFID Data Streams VLDB /12/06 Shawn Jeffery Minos Garofalakis Michael Franklin UC Berkeley Intel Research Berkeley UC Berkeley.
Data-Intensive Systems Michael Franklin UC Berkeley
Distributed Data Analysis & Dissemination System (D-DADS) Prepared by Stefan Falke Rudolf Husar Bret Schichtel June 2000.
A Research Agenda for Accelerating Adoption of Emerging Technologies in Complex Edge-to-Enterprise Systems Jay Ramanathan Rajiv Ramnath Co-Directors,
Training Workshop Windows Azure Platform. Presentation Outline (hidden slide): Technical Level: 200 Intended Audience: Developers Objectives (what do.
Virtual Machine Hosting for Networked Clusters: Building the Foundations for “Autonomic” Orchestration Based on paper by Laura Grit, David Irwin, Aydan.
Agent-based Device Management in RFID Middleware Author : Zehao Liu, Fagui Liu, Kai Lin Reporter :郭瓊雯.
Information-Based Building Energy Management SEEDM Breakout Session #4.
Moving the RFID Value Chain Value Proposition Cost and Complexity What is it? (passive RFID) Where is it? (active RFID) How is it? (Sensors) Adapt to it.
PIER & PHI Overview of Challenges & Opportunities Ryan Huebsch † Joe Hellerstein † °, Boon Thau Loo †, Sam Mardanbeigi †, Scott Shenker †‡, Ion Stoica.
COMP 410 Update. The Problems Story Time! Describe the Hurricane Problem Do this with pictures, lots of people, a hurricane, trucks, medicine all disconnected.
Network Computing Laboratory HiFi Systems: Network-Centric Query Processing for the Physical World Michael J. Franklin, Shawn R. Jeffrey, et al UC Berkeley.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
IntroductionRelated work 2 Contents Publish/Subscribe middleware Conclusion and Future works.
Overview of Sensor Networks David Culler Deborah Estrin Mani Srivastava.
Event Processing A Perspective From Oracle Dieter Gawlick, Shailendra Mishra Oracle Corporation March,
The Future of Data Management or The Structure of (Computer) Scientific Revolutions EECS BEARS Conference February 2007 Michael Franklin UC Berkeley &
FOREWORD By: Howard Shrobe MIT CS & AI Laboratory
Rule-Based Programming for VORBs Bertram Ludaescher Arcot Rajasekar Data and Knowledge Systems San Diego Supercomputer Center U.C. San Diego.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
A Lone Wolf No More: Supporting Network Intrusion Detection with Real-Time Intelligence Shane Singh | COMPSCI 726.
Yanlei Diao, University of Massachusetts Amherst Future Directions in Sensor Data Management Yanlei Diao University of Massachusetts, Amherst.
1 CS851 Data Services in Advanced System Applications Sang H. Son
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Internet of Things. IoT Novel paradigm – Rapidly gaining ground in the wireless scenario Basic idea – Pervasive presence around us a variety of things.
Adaptive Cleaning for RFID Data Streams. RFID: Radio Frequency IDentification.
W. Hong & S. Madden – Implementation and Research Issues in Query Processing for Wireless Sensor Networks, ICDE 2004.
Patrick Ortiz Global SQL Solution Architect Dell Inc. BIN209.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN.
Source : 2014 IEEE Ninth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP) Auther : Nacer Khalil, Mohamed.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong Presentation.
Wireless sensor and actor networks: research challenges Ian. F. Akyildiz, Ismail H. Kasimoglu
Distributed database approach,
TRUST:Team for Research in Ubiquitous Secure Technologies
The Design of an Acquisitional Query Processor For Sensor Networks
DrillSim July 2005.
Adaptive Cleaning for RFID Data Streams
An Analysis of Stream Processing Languages
Nolan Leake Co-Founder, Cumulus Networks Paul Speciale
Presentation transcript:

Design Considerations for High Fan-in Systems: The HiFi Approach Presented by Shawn Jeffery CIDR‘05 1/7/05 Michael J. Franklin, Shawn R. Jeffery, Sailesh Krishnamurthy, Frederick Reiss, Shariq Rizvi, Eugene Wu, Owen Cooper, Anil Edakkunni, and Wei Hong UC Berkeley, Intel Research Berkeley

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Itinerary Introduction: High Fan-in Systems HiFi Overview Initial Prototype Ongoing Work and Future Directions Conclusions

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Introduction Receptors everywhere! Wireless sensor networks, RFID technologies, digital home, network monitors,... Somehow need to make sense of this data to provide near real-time decision support

1/7/05Shawn Jeffery, HiFi Project, UCB EECS High Fan-in Systems Large numbers of receptors = large data volumes Hierarchical, successive aggregation The “Bowtie” Challenges in 3 dimensions: Geography Time Resources

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Supply-Chain Management (SCM) Receptors Warehouses, Stores Dock doors, Shelves Regional Centers Headquarters

1/7/05Shawn Jeffery, HiFi Project, UCB EECS State of the Art Not seen as a data management issue Focus on protocol design Different “data models” at each level Reinventing “query languages” at each level Piecemeal/stovepipe approach Each type of receptor (RFID, sensors, etc) handled separately Current solutions tend to be hand-coded, script- based approaches  No end-to-end, integrated solution for managing distributed receptor data

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Itinerary Introduction: High Fan-in Systems HiFi Overview Initial Prototype Ongoing Work and Future Directions Conclusions

1/7/05Shawn Jeffery, HiFi Project, UCB EECS HiFi: Cascading Stream Processing in a High Fan-in System A data management infrastructure for high fan-in environments Uniform Declarative Framework Every node is a data stream processor that speaks SQL-ese  stream-oriented queries at all levels Hierarchical, stream-based views as an organizing principle

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Hierarchical Query Processing “I provide raw readings for Soda Hall” “I provide avg daily values for Berkeley” “I provide avg weekly values for California” “I provide national monthly values for the US” Continuous and Streaming Windows Sharing Hierarchical Temporal granularity vs. geographic scope SELECT S.area, AVG(S.temp) FROM SENSOR_STREAM S [range by ‘5 sec’ slide by ‘5 sec’] GROUP BY S.area

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Basic HiFi Architecture HiFi Glue DSQP HiFi Glue DSQP MDR Hierarchical federation of nodes Each node: Data Stream Query Processor (DSQP) HiFi Glue Views drive system functionality Metadata Repository (MDR) HiFi Glue DSQP HiFi Glue DSQP Management Query Planning Archiving Internode coordination and communication

1/7/05Shawn Jeffery, HiFi Project, UCB EECS In the paper… HiFi Design Considerations Dealing with Real-World Data Hierarchical Windowed Views with Sharing System Management Topological Fluidity Query Planning and Data Placement Complex Event Processing Archiving and Prioritization Privacy and Access Control

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Itinerary Introduction: High Fan-in Systems HiFi Overview Initial Prototype Ongoing Work and Future Directions Conclusions

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Envisioning HiFiBuilding HiFi

1/7/05Shawn Jeffery, HiFi Project, UCB EECS A Tale of Two Systems TelegraphCQ Data stream processor Continuous, adaptive query processing with aggressive sharing TinyDB Declarative query processing for wireless sensor networks In-network aggregation

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Initial Prototype TelegraphCQ TinyDB Stargates Sensor Networks & RFID Readers RFID Wrappers PC

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Initial Prototype VLDB ‘04

1/7/05Shawn Jeffery, HiFi Project, UCB EECS HiFi Design Considerations Dealing with Real-World Data Hierarchical Windowed Views with Sharing System Management Topological Fluidity Query Planning and Data Placement Complex Event Processing Archiving and Prioritization Privacy and Access Control Dealing with Real-World Data Hierarchical Windowed Views with Sharing System Management Topological Fluidity Query Planning and Data Placement Complex Event Processing Archiving and Prioritization Privacy and Access Control

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Processing RFID Data in HiFi RFID data is gross! Lost readings Errant readings Duplicate readings Use queries to make the data usable CSAVA: Clean  Smooth  Arbitrate  Validate  Analyze

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Processing RFID Data in HiFi Clean CREATE VIEW cleaned_rfid_stream AS (SELECT receptor_id, tag_id FROM rfid_stream rs WHERE read_strength >= strength_T)

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Processing RFID Data in HiFi Clean Smooth CREATE VIEW smoothed_rfid_stream AS (SELECT receptor_id, tag_id FROM cleaned_rfid_stream [range by ’5 sec’, slide by ’5 sec’] GROUP BY receptor_id, tag_id HAVING count(*) >= count_T)

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Processing RFID Data in HiFi Clean Smooth Arbitrate CREATE VIEW arbitrated_rfid_stream AS (SELECT receptor_id, tag_id FROM smoothed_rfid_stream rs [range by ’5 sec’, slide by ’5 sec’] GROUP BY receptor_id, tag_id HAVING count(*) >= ALL (SELECT count(*) FROM smoothed_rfid_stream [range by ’5 sec’, slide by ’5 sec’] WHERE tag_id = rs.tag_id GROUP BY receptor_id))

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Processing RFID Data in HiFi Arbitrate Validate CREATE VIEW validated_tags AS (SELECT tag_name, FROM arbitrated_rfid_stream rs [range by ’5 sec’, slide by ’5 sec’], known_tag_list tl WHERE tl.tag_id = rs.tag_id Clean Smooth

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Processing RFID Data in HiFi Validate CREATE VIEW tag_count AS (SELECT tag_name, count(*) FROM validated_tags vt [range by ‘5 min’, slide by ‘1 min’] GROUP BY tag_name Analyze Arbitrate Clean Smooth

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Processing RFID Data in HiFi Augment Convert Aggregate Validate Analyze Arbitrate Clean Smooth Augment Convert Aggregate

1/7/05Shawn Jeffery, HiFi Project, UCB EECS CSAVA: Bridging the Physical- Virtual Divide An example of HiFi processing, but instrumental in dealing with real world data Arbitrate Clean SmoothWindow Single Tuple Multiple Receptors CSAVA Generalization

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Complexity of Hierarchical Windowed Query Processing Naïve dissemination (unchanged query) introduces a lag in query results

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Additive Lag in Hierarchical Windowed Query Processing Level 0 Level 1 Level 2 Window Event Result Tuple(s) Additive Lag! Result Tuple(s) Window SELECT S.area, AVG(temp) FROM SENSOR_STREAM S [range by ‘5 sec’ slide by ‘5 sec’] GROUP BY S.area User Time

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Sketch of a Solution Level 0 Level 1 Level 2 Event Result Tuple(s) Window SELECT S.area, AVG(temp) FROM SENSOR_STREAM S [range by ‘5 seconds’ slide by ‘5 seconds’] GROUP BY S.area User Solution is to use both time-based windows and NOW windows Time Result Tuple(s) NOW window Time-based window

1/7/05Shawn Jeffery, HiFi Project, UCB EECS System Management Our small deployment: 20+ individual devices (4 types of devices) 5 different platforms (OS + Hardware)  Management nightmare System-wide management is crucial Both coarse and fine-grained Where we’re headed: System monitoring needed: turn the lens inwards to introspect on system state Use uniform declarative framework to provide failover and load balancing

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Itinerary Introduction: High Fan-in Systems HiFi Overview Initial Prototype Ongoing Work and Future Directions Conclusions

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Ongoing Work and Future Directions Bridging the physical-virtual divide Generalize CSAVA-type processing to other receptors Hierarchical query processing Query planning, dissemination Complex event processing Unify event and data processing System deployment and management Archiving and prioritization

1/7/05Shawn Jeffery, HiFi Project, UCB EECS Conclusions Receptors everywhere  High Fan-In Systems Uniform declarative framework is the key to building these systems The HiFi project is exploring this approach Our initial prototype Leveraged TelegraphCQ and TinyDB Validated the HiFi approach Identified research directions Broad in scope = much work to be done!

Questions? hifi.cs.berkeley.edu