CAPE: Continuous Query Engine with Heterogeneous-Grained Adaptivity Elke A. Rundensteiner, Luping Ding, Timothy Sutherland, Yali Zhu Brad Pielech, Nishant.

Slides:



Advertisements
Similar presentations
Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
Advertisements

CS 1220 – Object Oriented Design 1 Project Info Term Project Overview 1.You need to read in a circuit description Consists of list of gates, what nodes.
Input/Output Management and Disk Scheduling
Dr Mohamed Menacer College of Computer Science and Engineering Taibah University CS-334: Computer.
DAX: Dynamically Adaptive Distributed System for Processing CompleX Continuous Queries Bin Liu, Yali Zhu, Mariana Jbantova, Brad Momberger, and Elke A.
Elke A. Rundensteiner Topics projects in database and Information systems, such as, web information systems, distributed databases, Etc. Database Systems.
Dynamic Plan Migration for Continuous Query over Data Streams Yali Zhu, Elke Rundensteiner and George Heineman Database System Research Group Worcester.
Continuous Stream Monitoring Technology Elke A. Rundensteiner Database Systems Research Laboratory Department of Computer Science Worcester Polytechnic.
VLDB Revisiting Pipelined Parallelism in Multi-Join Query Processing Bin Liu and Elke A. Rundensteiner Worcester Polytechnic Institute
SIGMOD'061 Run-Time Operator State Spilling for Memory Intensive Long-Running Queries Bin Liu, Yali Zhu and Elke A. Rundensteiner Database Systems Research.
SWiM Panel on Engine Implementation Jennifer Widom.
Continuous Stream Monitoring Technology Elke A. Rundensteiner Database Systems Research Laboratory Department of Computer Science Worcester Polytechnic.
Dynamic Plan Migration for Continuous Queries over Data Streams Yali Zhu, Elke Rundensteiner and George Heineman Database System Research Group, WPI. Massachusetts,
An Adaptive Multi-Objective Scheduling Selection Framework For Continuous Query Processing Timothy M. Sutherland Bradford Pielech Yali Zhu Luping Ding.
1 DCAPE: Distributed and Self-Tuned Continuous Query Processing Tim Sutherland,Bin Liu,Mariana Jbantova, and Elke A. Rundensteiner Department of Computer.
Chapter 15.7 Buffer Management ID: 219 Name: Qun Yu Class: CS Spring 2009 Instructor: Dr. T.Y.Lin.
Prefetching for Visual Data Exploration Punit R. Doshi, Elke A. Rundensteiner, Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
Operating Systems Concepts 1. A Computer Model An operating system has to deal with the fact that a computer is made up of a CPU, random access memory.
Chapter 9 Overview  Reasons to monitor SQL Server  Performance Monitoring and Tuning  Tools for Monitoring SQL Server  Common Monitoring and Tuning.
Memory Management Last Update: July 31, 2014 Memory Management1.
Chapter 3 Operating Systems Introduction to CS 1 st Semester, 2015 Sanghyun Park.
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
How to Resolve Bottlenecks and Optimize your Virtual Environment Chris Chesley, Sr. Systems Engineer
NiagaraCQ : A Scalable Continuous Query System for Internet Databases (modified slides available on course webpage) Jianjun Chen et al Computer Sciences.
Chapter 3: Operating-System Structures System Components Operating System Services System Calls System Programs System Structure Virtual Machines System.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 8: Main Memory.
Freshness-Aware Scheduling of Continuous Queries in the Dynamic Web Mohamed A. Sharaf Alexandros Labrinidis Panos K. Chrysanthis Kirk Pruhs Advanced Data.
Chapter 1 Introduction Dr. Frank Lee. 1.1 Why Study Compiler? To write more efficient code in a high-level language To provide solid foundation in parsing.
CMPE 421 Parallel Computer Architecture
MapReduce – An overview Medha Atre (May 7, 2008) Dept of Computer Science Rensselaer Polytechnic Institute.
©Brooks/Cole, 2003 Chapter 7 Operating Systems. ©Brooks/Cole, 2003 Define the purpose and functions of an operating system. Understand the components.
Index Tuning for Adaptive Multi-Route Data Stream Systems Karen Works, Elke A. Rundensteiner, and Emmanuel Agu Database Systems Research.
1 Dynamically Adaptive Distributed System for Processing CompleX Continuous Queries Bin Liu, Yali Zhu, Mariana Jbantova, Brad Momberger, and Elke A. Rundensteiner.
Invitation to Computer Science 5 th Edition Chapter 6 An Introduction to System Software and Virtual Machine s.
Computer Science and Engineering Parallelizing Defect Detection and Categorization Using FREERIDE Leonid Glimcher P. 1 ipdps’05 Scaling and Parallelizing.
1 Elke. A. Rundensteiner Worcester Polytechnic Institute Elisa Bertino Purdue University 1 Rimma V. Nehme Microsoft.
Introduction to z/OS Basics © 2006 IBM Corporation Chapter 7: Batch processing and the Job Entry Subsystem (JES) Batch processing and JES.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
Activatio n limit. maximum number of process instances that can concurrently be loaded into memory.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Query Optimization CMPE 226 Database Systems By, Arjun Gangisetty
CS 440 Database Management Systems Lecture 5: Query Processing 1.
CS 540 Database Management Systems
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 7 – Buffer Management.
Automata & Formal Languages, Feodor F. Dragan, Kent State University 1 CHAPTER 3 The Church-Turing Thesis Contents Turing Machines definitions, examples,
Safety Guarantee of Continuous Join Queries over Punctuated Data Streams Hua-Gang Li *, Songting Chen, Junichi Tatemura Divykant Agrawal, K. Selcuk Candan.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
COMPUTER SYSTEM FUNDAMENTAL Genetic Computer School THE PROCESSING UNIT LESSON 2.
OPERATING SYSTEMS CS 3502 Fall 2017
Module 11: File Structure
WWW and HTTP King Fahd University of Petroleum & Minerals
Data Structure Interview Question and Answers
Memory Management 6/20/ :27 PM
Intro to Processes CSSE 332 Operating Systems
Virtual Memory Networks and Communication Department.
7 Operating system Foundations of Computer Science ã Cengage Learning.
Joining Punctuated Streams
Operating Systems (CS 340 D)
Evaluating Window Joins over Punctuated Streams
Chapter 9: Virtual-Memory Management
Query Processing B.Ramamurthy Chapter 12 11/27/2018 B.Ramamurthy.
Evaluating Window Joins over Punctuated Streams
Operating Systems.
Overview Continuation from Monday (File system implementation)
Jigar.B.Katariya (08291A0531) E.Mahesh (08291A0542)
Introduction to Operating Systems
Lecture 3: Main Memory.
Query Processing.
Adaptive Query Processing (Background)
Presentation transcript:

CAPE: Continuous Query Engine with Heterogeneous-Grained Adaptivity Elke A. Rundensteiner, Luping Ding, Timothy Sutherland, Yali Zhu Brad Pielech, Nishant Mehta Natasha Bogdanova, Mariana Jbantova Department of Computer Science, Worcester Polytechnic Institute 100 Institute Road, Worcester, MA Tel: , Fax: {rundenst, lisading, tims, yaliz, winners, nishantm, natasha,

Uncertainties in Stream Query Processing Register Continuous Queries Stream Query Engine Stream Query Engine Streaming Data Streaming Result May have different QoS Requirements. May have time- varying rates and data distribution. Available resources for executing each operator may vary over time. Adaptations are required for stream query engine.

What is CAPE? C onstraint-aware A daptive Continuous Query P rocessing E ngine Exploit semantic constraints such as sliding windows and punctuations to reduce resource usage and improve response time. Incorporate heterogeneous-grained adaptivity at all query processing levels. - Adaptive query operator execution - Adaptive query plan re-optimization - Adaptive operator scheduling - Adaptive query plan distribution Process queries in a real-time manner by employing well-coordinated heterogeneous-grained adaptations.

CAPE System Architecture Distribution Manager Plan Reoptimizer Operator Scheduler Operator Configurator

PROBLEM: CAPE handles very large amounts of data, so need backup method when it runs out of memory SOLUTION: Queue Manager, which decides whether data in queue needs to go to file or remain in memory Queue Manager: Purpose

Ideally: Queue Manager: Structure Really:

Keep track of a memory threshold variable –How much memory we want to keep free –Once available memory goes below threshold, tuples are sent to disk Have an update method, which is called every time QM needs to make a decision –Ensures most recent memory info is used Use Storage Manager when tuples need to go to file to minimize I/O costs Queue Manager: Decision Making

Storage Manager is called by QM when tuples need to be written to/ read from disk (Adapted for CAPE from Nishant Mehta’s Storage Manager) Parses tuples and generates symbol trees based on schema –Side Effect: Need a new instance of Storage Manager for every schema Provides an efficient way to read/write files –Implements random access for tuple files Storage Manager: Overview

Tuples are stored in queue (main memory) until memory threshold is reached Then, tuples are written to file and a place holder is put in the queue Dequeue simply reads off the tuples from the front of the queue and from file if necessary Queue Manager: Enqueue/Dequeue

Cursors allow multiple operators to access a queue at the same time If one operator reads from file, those tuples are put in main memory so other operators do not need to read from file again Queue Manager: Cursors

CAPE Publications, TRs & URLs [RDZ04] E. A. Rundensteiner, L. Ding, Y. Zhu, T. Sutherland and B. Pielech, “CAPE: A Constraint-Aware Adaptive Stream Processing Engine”. Invited Book Chapter. July [ZRH04] Y. Zhu, E. A. Rundensteiner and G. T. Heineman, "Dynamic Plan Migration for Continuous Queries Over Data Streams”. SIGMOD 2004, pages [DMR+04] L. Ding, N. Mehta, E. A. Rundensteiner and G. T. Heineman, "Joining Punctuated Streams“. EDBT 2004, pages [DR04] L. Ding and E. A. Rundensteiner, "Evaluating Window Joins over Punctuated Streams“. CIKM 2004, to appear. [DRH03] L. Ding, E. A. Rundensteiner and G. T. Heineman, “MJoin: A Metadata-Aware Stream Join Operator”. DEBS [SPR04] T. Sutherland, B. Pielech and E. A. Rundensteiner, "Adaptive Scheduling Framework for A Continuous Query System“. Tech Report, WPI-CS-TR-04-16, [SR04] T. Sutherland and E. A. Rundensteiner, "D-CAPE: A Self-Tuning Continuous Query Plan Distribution Architecture“. Tech Report, WPI-CS-TR-04-18, CAPE Project: