Control-Based Load Shedding in Data Stream Management Yicheng Tu †, Song Liu ‡, Sunil Prabhakar †, Bin Yao ‡ † Indiana Center of Database Systems, Department.

Slides:



Advertisements
Similar presentations
Semantics and Evaluation Techniques for Window Aggregates in Data Streams Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter A. Tucker SIGMOD.
Advertisements

Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
UNIVERSAL SYSTEMS MODEL
DEXA 2005 Control-based Quality Adaptation in Data Stream Management Systems (DSMS) Yicheng Tu†, Mohamed Hefeeda‡, Yuni Xia†, Sunil Prabhakar†, and Song.
The Design of the Borealis Stream Processing Engine Daniel J. Abadi1, Yanif Ahmad2, Magdalena Balazinska1, Ug ̆ur C ̧ etintemel2, Mitch Cherniack3, Jeong-Hyon.
Backlog Estimation and Management for Real-Time Data Services Kyoung-Don Kang, Jisu Oh, and Yan Zhou Department of Computer Science State University of.
Design of LFC using Optimal Control Theory The optimal controller is designed to minimize the quadratic performance index of the following form For linear.
The Design of the Borealis Stream Processing Engine Brandeis University, Brown University, MIT Magdalena BalazinskaNesime Tatbul MIT Brown.
Caching the MDSPlus Data via Hibernate By Ajith M Jose Comp6703 Project Client: Raju Karia Supervisor: Dr. Henry Gardner (Development of “WebScope”)
Scaling up a Web-Based Intelligent Tutoring System Jozsef Patvarczki, Shane Almeida, and Neil Heffernan Computer Science Department Our research team has.
Chapter Summer 2. Comparator 3. Block Blocks in Series
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
1 Wavelet synopses with Error Guarantees Minos Garofalakis Phillip B. Gibbons Information Sciences Research Center Bell Labs, Lucent Technologies Murray.
Chapter 11 1 Closed-Loop Responses of Simple Control Systems In this section we consider the dynamic behavior of several elementary control problems for.
Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.
Applying Control Theory to Stream Processing Systems Wei Xu Bill Kramer Joe Hellerstein.
Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,
CH 1 Introduction Prof. Ming-Shaung Ju Dept. of Mechanical Engineering NCKU.
Software Verification and Validation (V&V) By Roger U. Fujii Presented by Donovan Faustino.
Greenbench: A Benchmark for Observing Power Grid Vulnerability Under Data-Centric Threats Mingkui Wei, Wenye Wang Department of Electrical and Computer.
CSE 425: Industrial Process Control 1. About the course Lect.TuLabTotal Semester work 80Final 125Total Grading Scheme Course webpage:
Computer System Lifecycle Chapter 1. Introduction Computer System users, administrators, and designers are all interested in performance evaluation. Whether.
Simple Interface for Polite Computing (SIPC) Travis Finch St. Edward’s University Department of Computer Science, School of Natural Sciences Austin, TX.
Naixue GSU Slide 1 ICVCI’09 Oct. 22, 2009 A Multi-Cloud Computing Scheme for Sharing Computing Resources to Satisfy Local Cloud User Requirements.
Adaptive Control of Virtualized Resources in Utility Computing Environments HP Labs: Xiaoyun Zhu, Mustafa Uysal, Zhikui Wang, Sharad Singhal University.
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
Linear System Theory Instructor: Zhenhua Li Associate Professor Mobile : School of Control Science and Engineering, Shandong.
Foundations of Technology, Third Edition/ Technology, Engineering, and Design © 2011 International Technology and Engineering Educators Association STEM.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Final Exam, May 25, 2007 Quality Management in Multimedia Databases and Data Stream Management Systems Yicheng Tu Department of Computer Sciences Purdue.
EmNet: Satisfying The Individual User Through Empathic Home Networks J. Scott Miller, John R. Lange & Peter A. Dinda Department of Electrical Engineering.
Join Synopses for Approximate Query Answering Swarup Achrya Philip B. Gibbons Viswanath Poosala Sridhar Ramaswamy Presented by Bhushan Pachpande.
Budget-based Control for Interactive Services with Partial Execution 1 Yuxiong He, Zihao Ye, Qiang Fu, Sameh Elnikety Microsoft Research.
A new model and architecture for data stream management.
U N I V E R S I T Y O F S O U T H F L O R I D A Database-centric Data Analysis of Molecular Simulations Yicheng Tu *, Sagar Pandit §, Ivan Dyedov *, and.
Smita Vijayakumar Qian Zhu Gagan Agrawal 1.  Background  Data Streams  Virtualization  Dynamic Resource Allocation  Accuracy Adaptation  Research.
University of Catania Computer Engineering Department 1 Educational tools for complex topics: a case study for Network Based Control Systems Prof. Orazio.
11 Experimental and Analytical Evaluation of Available Bandwidth Estimation Tools Cesar D. Guerrero and Miguel A. Labrador Department of Computer Science.
U N I V E R S I T Y O F S O U T H F L O R I D A The basic idea is to start from a difference equation with unknown parameters and orders in the following.
Unit 2 Architectural Styles and Case Studies | Website for Students | VTU NOTES | QUESTION PAPERS | NEWS | RESULTS 1.
Load Shedding Techniques for Data Stream Systems Brian Babcock Mayur Datar Rajeev Motwani Stanford University.
Mechanical Engineering Department Automatic Control Dr. Talal Mandourah 1 Lecture 1 Automatic Control Applications: Missile control Behavior control Aircraft.
Load Shedding in Stream Databases – A Control-Based Approach Yicheng Tu, Song Liu, Sunil Prabhakar, and Bin Yao Department of Computer Science, Purdue.
X. Li, W. LiuICC May 11, 2003A Joint Layer Design Smart Contention Resolution Random Access Wireless Networks With Unknown Multiple Users: A Joint.
Chapter 20 1 Overall Objectives of Model Predictive Control 1.Prevent violations of input and output constraints. 2.Drive some output variables to their.
University of Virginia PID Controllers Jack Stankovic University of Virginia Spring 2015.
A new model and architecture for data stream management.
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
Aurora: a new model and architecture for data stream management Daniel J. Abadi 1, Don Carney 2, Ugur Cetintemel 2, Mitch Cherniack 1, Christian Convey.
Graph Data Management Lab, School of Computer Science Branch Code: A Labeling Scheme for Efficient Query Answering on Tree
Predictive Application- Performance Modeling in a Computational Grid Environment (HPDC ‘99) Nirav Kapadia, José Fortes, Carla Brodley ECE, Purdue Presented.
Control-Based Load Shedding in Data Stream Management Systems Yicheng Tu and Sunil Prabhakar Department of Computer Sciences, Purdue University April 3,
UNIT: User-ceNtrIc Transaction Management in Web-Database Systems Huiming Qu, Alexandros Labrinidis, Daniel Mosse Advanced Data Management Technologies.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Automatic Control Theory School of Automation NWPU Teaching Group of Automatic Control Theory.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Control-based Quality Adaptation in Data Stream Management Systems (DSMS) Yicheng Tu†, Song Liu‡, Sunil Prabhakar†, and Bin Yao‡ † Department of Computer.
Control-Based Load Shedding in Data Stream Management Systems Yicheng Tu and Sunil Prabhakar Department of Computer Sciences, Purdue University April 3,
Written By: Presented By: Swarup Acharya,Amr Elkhatib Phillip B. Gibbons, Viswanath Poosala, Sridhar Ramaswamy Join Synopses for Approximate Query Answering.
SRD-DFA Achieving Sub-Rule Distinguishing with Extended DFA Structure Author: Gao Xia, Xiaofei Wang, Bin Liu Publisher: IEEE DASC (International Conference.
Quality management in database systems A thesis proposal Yicheng Tu January 24, 2006 Advisor: Prof. Sunil Prabhakar.
SKEE 3143 Control Systems Design Chapter 2 – PID Controllers Design
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Data Stream Management System (DSMS)
Load Shedding Techniques for Data Stream Systems
Load Shedding in Stream Databases – A Control-Based Approach
Smita Vijayakumar Qian Zhu Gagan Agrawal
National University of Laos
Autonomic Workload Execution Control Using Throttling
Actively Learning Ontology Matching via User Interaction
Presentation transcript:

Control-Based Load Shedding in Data Stream Management Yicheng Tu †, Song Liu ‡, Sunil Prabhakar †, Bin Yao ‡ † Indiana Center of Database Systems, Department of Computer Sciences, 305 N. University Street, West Lafayette, IN ‡ School of Mechanical Engineering, 140 S. Intramural Drive, West Lafayette, IN Introduction Data Stream Management Systems (DSMSs) process large number of data streams to answer user-specified queries. These systems are generally built following a query-passive data-active model, in which all data are pushed to the database server for processing and query results are sent to the users continuously. Data processing delay is critical in DSMSs since query results generated from old data are useless to users. In case of overloading, data tuples have to be discarded without processing in order to achieve desired processing delay. This is called load shedding. Key Questions: When? How much? Where? We focus on the first two questions. Acknowledgements This is joint work with my advisor, Prof. Sunil Prabhakar Dr. Song Liu and Prof. Bin Yao of the School of Mechanical Engineering in Purdue University. The author would also like to thank Ms. Nesime Tatbul and Prof. Ston Zdonik, both from the Computer Science department of Brown University, for providing the Aurora/Borealis source code. Figure 1. Pushed-based DSMS system model. Objective To design and implement a load shedding framework that minimize the data loss; maintains processing delays in rejection to disturbances: - bursty data arrivals; - internal dynamics of DSMS. is robust, i.e., works for a wide range of input streams. Figure 2. Examples of disturbances in data processing in DSMS. Top: bursty arrival rates; Bottom: unit processing costs. Our approach - View it as a feedback control problem - Develop a dynamic model for a specific DSMS - Design controller via rigorous control-theoretical methods - Work on a real DSMS – the open-source Borealis system Figure 3. The feedback control loop for load shedding. Output ( y ): average tuple delay; Input ( u ): tuple injection rate to DSMS; target delay value ( y r ) and control error ( e ). Results - Obtained a first-order linear model for Borealis - Pole placement-based design ended up a PD controller: where c and H are system-specific constants and T is the control period. - Identified and solved several DSMS-specific problems - Control framework evaluated with real and synthetic data Figure 4. Performance of our load shedding solution (CTRL), AURORA, an open-loop solution that represents state-of-the- art in DSMS load shedding, and BASELINE, a naïve feedback-based solution. Figure 5. Relative performance of CTRL to AURORA and BASELINE. A, B, C: various aspects of delay violations; D: percentage of data discarded. Figure 6. Robustness of CTRL and AURORA tested with input streams of different burstiness (smaller bias factor represents more bursty stream). Conclusions 1. First database work that uses feedback-control- theoretical methods; 2. Rigorous system modeling and controller design generate a PD controller that controls average tuple delays by adjusting the amount of load shedding; 3. Control framework implemented and evaluated in real DSMS. Experiments show that feedback-control-based method significantly improves control of delays with the same amount of data loss as compared to current solutions. 4.The above solution is also robust.