Eddies for Continuous Queries

Slides:



Advertisements
Similar presentations
Evaluating Window Joins over Unbounded Streams Author: Jaewoo Kang, Jeffrey F. Naughton, Stratis D. Viglas University of Wisconsin-Madison CS Dept. Presenter:
Advertisements

Register Allocation CS 671 March 27, CS 671 – Spring Register Allocation - Motivation Consider adding two numbers together: Advantages: Fewer.
Query Optimization CS634 Lecture 12, Mar 12, 2014 Slides based on “Database Management Systems” 3 rd ed, Ramakrishnan and Gehrke.
CS 540 Database Management Systems
한국기술교육대학교 컴퓨터 공학 김홍연 TinyDB : An Acquisitional Query Processing System for Sensor Networks. - Samuel R. Madden, Michael J. Franklin, Joseph M. Hellerstein,
Database Management Systems 3ed, R. Ramakrishnan and Johannes Gehrke1 Evaluation of Relational Operations: Other Techniques Chapter 14, Part B.
Implementation of Other Relational Algebra Operators, R. Ramakrishnan and J. Gehrke1 Implementation of other Relational Algebra Operators Chapter 12.
Database Management Systems, R. Ramakrishnan and Johannes Gehrke1 Evaluation of Relational Operations: Other Techniques Chapter 12, Part B.
Database Management Systems, R. Ramakrishnan and Johannes Gehrke1 Evaluation of Relational Operations: Other Techniques Chapter 12, Part B.
Query Execution Optimizing Performance. Resolving an SQL query Since our SQL queries are very high level, the query processor must do a lot of additional.
PSoup Kevin Menard CS 561 4/11/2005. Streaming Queries over Streaming Data Sirish Chandrasekaran UC Berkeley August 20, 2002 with Michael J. Franklin.
Evaluating Window Joins Over Unbounded Streams By Nishant Mehta and Abhishek Kumar.
1 Continuously Adaptive Continuous Queries (CACQ) over Streams Samuel Madden, Mehul Shah, Joseph Hellerstein, and Vijayshankar Raman Presented by: Bhuvan.
Performance Evaluation of IPv6 Packet Classification with Caching Author: Kai-Yuan Ho, Yaw-Chung Chen Publisher: ChinaCom 2008 Presenter: Chen-Yu Chaug.
Freddies: DHT-Based Adaptive Query Processing via Federated Eddies Ryan Huebsch Shawn Jeffery CS Peer-to-Peer Systems 12/9/03.
1 Evaluation of Relational Operations: Other Techniques Chapter 12, Part B.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong Presentation.
Performance Issues in Adaptive Query Processing Fred Reiss U.C. Berkeley Database Group.
...Looking back Why use a DBMS? How to design a database? How to query a database? How does a DBMS work?
Query Execution Chapter 15 Section 15.1 Presented by Khadke, Suvarna CS 257 (Section II) Id
T HE Q UERY C OMPILER Prepared by : Ankit Patel (226)
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
Module 7 Reading SQL Server® 2008 R2 Execution Plans.
Ripple Joins for Online Aggregation by Peter J. Haas and Joseph M. Hellerstein published in June 1999 presented by Ronda Hilton.
1 CPS216: Advanced Database Systems Notes 04: Operators for Data Access Shivnath Babu.
1 Fjording The Stream An Architecture for Queries over Streaming Sensor Data Samuel Madden, Michael Franklin UC Berkeley.
REED: Robust, Efficient Filtering and Event Detection in Sensor Networks Daniel Abadi, Samuel Madden, Wolfgang Lindner MIT United States VLDB 2005.
1 REED: Robust, Efficient Filtering and Event Detection in Sensor Networks Daniel Abadi, Samuel Madden, Wolfgang Lindner MIT United States VLDB 2005.
Query Execution Section 15.1 Shweta Athalye CS257: Database Systems ID: 118 Section 1.
An Example Data Stream Management System: TelegraphCQ INF5100, Autumn 2009 Jarle Søberg.
1 Continuously Adaptive Continuous Queries (CACQ) over Streams Samuel Madden SIGMOD 2002 June 4, 2002 With Mehul Shah, Joseph Hellerstein, and Vijayshankar.
Eddies: Continuously Adaptive Query Processing Ross Rosemark.
Evaluating Window Joins over Unbounded Streams Jaewoo Kang Jeffrey F. Naughton Stratis D. Viglas {jaewoo, naughton, Univ. of Wisconsin-Madison.
Query Processing CS 405G Introduction to Database Systems.
File Processing : Query Processing 2008, Spring Pusan National University Ki-Joune Li.
Relational Operator Evaluation. overview Projection Two steps –Remove unwanted attributes –Eliminate any duplicate tuples The expensive part is removing.
REED : Robust, Efficient Filtering and Event Detection in Sensor Network Daniel J. Abadi, Samuel Madden, Wolfgang Lindner Proceedings of the 31st VLDB.
CPS216: Advanced Database Systems Notes 02:Query Processing (Overview) Shivnath Babu.
Unary Query Processing Operators
CPS216: Data-intensive Computing Systems
CS 540 Database Management Systems
CS 440 Database Management Systems
Prepared by : Ankit Patel (226)
The Design of an Acquisitional Query Processor For Sensor Networks
Chapter 15 QUERY EXECUTION.
Evaluation of Relational Operations: Other Operations
CS222: Principles of Data Management Notes #13 Set operations, Aggregation Instructor: Chen Li.
Join Processing in Database Systems with Large Main Memories (part 2)
Lecture 17: Distributed Transactions
Implementation of Relational Operations (Part 2)
Relational Operations
Query Processing for High-Volume XML Message Brokering
Streaming Sensor Data Fjord / Sensor Proxy Multiquery Eddy
CS143:Evaluation and Optimization
Query Execution Presented by Jiten Oswal CS 257 Chapter 15
Relational Algebra Friday, 11/14/2003.
Implementation of Relational Operations
Lecture 13: Query Execution
Kabra and DeWitt presented by Zack Ives CSE 590DB, May 11, 1998
CS222P: Principles of Data Management Notes #13 Set operations, Aggregation, Query Plans Instructor: Chen Li.
Evaluation of Relational Operations: Other Techniques
Heavy Hitters in Streams and Sliding Windows
REED : Robust, Efficient Filtering and Event Detection
Measuring the Gap between FPGAs and ASICs
PSoup: A System for streaming queries over streaming data
CS222: Principles of Data Management Lecture #15 Query Optimization (System-R) Instructor: Chen Li.
Streams and Stuff Sirish and Sam and Mike.
Evaluation of Relational Operations: Other Techniques
All about Indexes Gail Shaw.
Presentation transcript:

Eddies for Continuous Queries Sam Madden CS286 Project S01

Motivation Want many queries over continuous streams of data Current Eddies Thread per query Scanner per source Share common work between modules Reduce memory burden Intra-query scheduling (Not focusing on joins – Need new operators to deal with endless streams)

Data Structures One Eddy per Telegraph Instance Only Source-module for each source (over all queries) One Filter per Source field (over all queries) Per-Source State Source -> Reachable modules Query -> Completion bitmask Per-Tuple State Output query mask Per Query State: Output queues Aggregate information

Tuple Flow Tuple Arrives Works for Joins Too (Somewhat Inefficiently?) Tagged with source id Routing policy chooses a filter to route to, based on modules reachable from source Filter marks query state as “output” for tuples which don’t pass Tuple output to queries which have completed, using source If more filters to check, tuple re-inserted into eddy Works for Joins Too (Somewhat Inefficiently?) Extend reachability graph across joins Project out unused sources when tuples are output

Combining Filters Given a Filter F over some field S.a, with n predicates generalized to be over ranges [a,b] (plus not-equals) Interval tree for >, >=, <, <= predicates, inserting from interval (a,], [a,], [- , b), or [- , b]. (O(log n)) When a tuple arrives, find intervals which it itersects. (O(n)) For = and , use a hash table For , output all tuples except those in table Saves routing, tuple parsing cost Simplifies optimization space

Routing Policy Random policy routes to each module with equal probability Ticket policy: from Eddy paper Route to modules with highest selectivity Estimate selectivity based on ratio of in/out tuples Use back-pressure to adjust delivery rates Multi-query Ticket policy Estimate selectivity based on ratio of (number of applied predicates /number of passed predicates) Based on Shankar’s implementation: back pressure not applied properly

Preliminary Results Simple, four query test: from s select s.index where s.a > 30 from s select s.index where s.b > 30 and s.a > 30 from s select s.index where s.c > 30 and s.b> 30 and s.a > 30 from s select s.index where s.d > 30 and s.c > 30 and s.b > 30 and s.a > 30 Becomes five modules: one scanner and four filters