Freshness-Aware Scheduling of Continuous Queries in the Dynamic Web Mohamed A. Sharaf Alexandros Labrinidis Panos K. Chrysanthis Kirk Pruhs Advanced Data.

Slides:



Advertisements
Similar presentations
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Advertisements

Liz Marai 07/08/08 Data Management and Visualization at Pitt CS Liz Marai Pitt Computer Science CMU Robotics Institute (adj) LSST, 08/07/2008.
Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
Efficient Evaluation of k-Range Nearest Neighbor Queries in Road Networks Jie BaoChi-Yin ChowMohamed F. Mokbel Department of Computer Science and Engineering.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Rensselaer Polytechnic Institute CSCI-4210 – Operating Systems David Goldschmidt, Ph.D.
1 Uniprocessor Scheduling Types of scheduling –The aim of processor scheduling is to assign processes to be executed by the processor so as to optimize.
Rensselaer Polytechnic Institute CSCI-4210 – Operating Systems David Goldschmidt, Ph.D.
1 Size-Based Scheduling Policies with Inaccurate Scheduling Information Dong Lu *, Huanyuan Sheng +, Peter A. Dinda * * Prescience Lab, Dept. of Computer.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
Static Optimization of Conjunctive Queries with Sliding Windows over Infinite Streams Presented by: Andy Mason and Sheng Zhong Ahmed M.Ayad and Jeffrey.
Operating System Concepts with Java – 7 th Edition, Nov 15, 2006 Silberschatz, Galvin and Gagne ©2007 Processes and Their Scheduling.
© Ibrahim Korpeoglu Bilkent University
Operating System Process Scheduling (Ch 4.2, )
Operating System I Process Scheduling. Schedulers F Short-Term –“Which process gets the CPU?” –Fast, since once per 100 ms F Long-Term (batch) –“Which.
Scheduling in Batch Systems
Service Differentiated Peer Selection An Incentive Mechanism for Peer-to-Peer Media Streaming Ahsan Habib, Member, IEEE, and John Chuang, Member, IEEE.
CS 3013 & CS 502 Summer 2006 Scheduling1 The art and science of allocating the CPU and other resources to processes.
Chapter 6: CPU Scheduling. 5.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Feb 2, 2005 Chapter 6: CPU Scheduling Basic.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
Cs238 CPU Scheduling Dr. Alan R. Davis. CPU Scheduling The objective of multiprogramming is to have some process running at all times, to maximize CPU.
6/25/2015Page 1 Process Scheduling B.Ramamurthy. 6/25/2015Page 2 Introduction An important aspect of multiprogramming is scheduling. The resources that.
Chain: Operator Scheduling for Memory Minimization in Data Stream Systems Authors: Brian Babcock, Shivnath Babu, Mayur Datar, and Rajeev Motwani (Dept.
CS 104 Introduction to Computer Science and Graphics Problems Operating Systems (2) Process Management 10/03/2008 Yang Song (Prepared by Yang Song and.
An Adaptive Multi-Objective Scheduling Selection Framework For Continuous Query Processing Timothy M. Sutherland Bradford Pielech Yali Zhu Luping Ding.
Job scheduling Queue discipline.
Operating Systems Process Scheduling (Ch 4.2, )
Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts, Amherst Operating Systems CMPSCI 377 Lecture.
Operating System Process Scheduling (Ch 4.2, )
Rensselaer Polytechnic Institute CSCI-4210 – Operating Systems David Goldschmidt, Ph.D.
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
Chapter 6: CPU Scheduling
Computer Architecture and Operating Systems CS 3230: Operating System Section Lecture OS-3 CPU Scheduling Department of Computer Science and Software Engineering.
COT 4600 Operating Systems Spring 2011 Dan C. Marinescu Office: HEC 304 Office hours: Tu-Th 5:00-6:00 PM.
Load Balancing of In-Network Data-Centric Storage Schemes in Sensor Networks Mohamed Aly In collaboration with Kirk Pruhs and Panos K. Chrysanthis Advanced.
Spring Efficient Dissemination of Enterprise Summary Data to Mobile Clients Mohamed A. Sharaf University of Pittsburgh.
Preference-Aware Query and Update Scheduling in Web-databases Huiming Qu Department of Computer Science University of Pittsburgh Joint work with Prof.
Enabling Class of Service for CIOQ Switches with Maximal Weighted Algorithms Thursday, October 08, 2015 Feng Wang Siu Hong Yuen.
CPU Scheduling CSCI 444/544 Operating Systems Fall 2008.
Silberschatz and Galvin  Operating System Concepts Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor.
1 Our focus  scheduling a single CPU among all the processes in the system  Key Criteria: Maximize CPU utilization Maximize throughput Minimize waiting.
Zone Sharing: A Hot-Spots Decomposition Scheme for Data-Centric Storage in Sensor Networks Mohamed Aly Nicholas Morsillo Panos K. Chrysanthis Kirk Pruhs.
Operating Systems 1 K. Salah Module 2.2: CPU Scheduling Scheduling Types Scheduling Criteria Scheduling Algorithms Performance Evaluation.
1 11/29/2015 Chapter 6: CPU Scheduling l Basic Concepts l Scheduling Criteria l Scheduling Algorithms l Multiple-Processor Scheduling l Real-Time Scheduling.
ITFN 2601 Introduction to Operating Systems Lecture 4 Scheduling.
Silberschatz and Galvin  Operating System Concepts Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor.
1 CS.217 Operating System By Ajarn..Sutapart Sappajak,METC,MSIT Chapter 5 CPU Scheduling Slide 1 Chapter 5 CPU Scheduling.
1 Adaptive Parallelism for Web Search Myeongjae Jeon Rice University In collaboration with Yuxiong He (MSR), Sameh Elnikety (MSR), Alan L. Cox (Rice),
Adaptivity in continuous query systems Luis A. Sotomayor & Zhiguo Xu Professor Carlo Zaniolo CS240B - Spring 2003.
Dr. Anis Koubâa CS433 Modeling and Simulation
CS333 Intro to Operating Systems Jonathan Walpole.
UNIT: User-ceNtrIc Transaction Management in Web-Database Systems Huiming Qu, Alexandros Labrinidis, Daniel Mosse Advanced Data Management Technologies.
Rate-Based Query Optimization for Streaming Information Sources Stratis D. Viglas Jeffrey F. Naughton.
Lecture 4 CPU scheduling. Basic Concepts Single Process  one process at a time Maximum CPU utilization obtained with multiprogramming CPU idle :waiting.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
CPU Scheduling G.Anuradha Reference : Galvin. CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time.
CPU scheduling.  Single Process  one process at a time  Maximum CPU utilization obtained with multiprogramming  CPU idle :waiting time is wasted 2.
Basic Concepts Maximum CPU utilization obtained with multiprogramming
Lecturer 5: Process Scheduling Process Scheduling  Criteria & Objectives Types of Scheduling  Long term  Medium term  Short term CPU Scheduling Algorithms.
INTRO TO PROCESS SCHEDULING Module 2.4 COP4600 – Operating Systems Richard Newman.
OPERATING SYSTEMS CS 3502 Fall 2017
The Impact of Replacement Granularity on Video Caching
CPU Scheduling G.Anuradha
Brian Babcock, Shivnath Babu, Mayur Datar, and Rajeev Motwani
Query Optimization Minimizing Memory and Latency in DSMS
Process Scheduling B.Ramamurthy 4/11/2019.
Process Scheduling B.Ramamurthy 4/7/2019.
Uniprocessor scheduling
Don Porter Portions courtesy Emmett Witchel
Presentation transcript:

Freshness-Aware Scheduling of Continuous Queries in the Dynamic Web Mohamed A. Sharaf Alexandros Labrinidis Panos K. Chrysanthis Kirk Pruhs Advanced Data Management Technologies Lab Department of Computer Science University of Pittsburgh HDMS 2005

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 2 Motivation Price < $5

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 3 Motivation Web Server Wish List 1 Price < $5 Title = Star Wars I Price < $5 TP T Wish List 2 P Price < $7 Input data streams Price = $8   Price = $4  Price = $8Price = $4Price = $2  Price < $5  Price = $5

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 4 Continuous Queries A continuous query is a long-running standing query whose execution is triggered with the arrival of new data The output of a continuous query is a continuous data stream (e.g., s or update personalized Web page) A Web server should propagate updates to users as soon as they become available, however delays occur due to: 1) Time spent by a query processing updates 2) Time spent by a query waiting to be executed

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 5 Scheduling Multiple Continuous Queries The execution order of continuous queries determines the overall behavior of the system. For example: In Aurora: the Minimum Latency scheduler reduces response time [Carney et. al., VLDB’03] In STREAM: the Chain scheduler minimizes memory usage [Babcock et. al., SIGMOD’03] Problem Statement: Devise a policy for scheduling the execution of multiple continuous queries (MCQ) with the objective of maximizing the overall quality of data (QoD) of output data streams

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 6 Outline Motivation Quality of Data (QoD) Freshness-Aware Scheduling of MCQ (FAS-MCQ) Experimental Evaluation Conclusions and Future Work

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 7 Quality of Data QoD based on freshness (deviation from the ideal) At any time instance, the output data stream is fresh when it matches the ideal one, otherwise it is stale TxTx TyTy t1t1 t2t2 t3t3 Delay (t i )= wait time + processing time Freshness = 1- (t 1 + t 2 + t 3 )/(T y - T x ) Ideal output data stream Actual output data stream Fresh Stale

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 8 Outline Motivation Quality of Data (QoD) Freshness-Aware Scheduling of MCQ (FAS-MCQ) Experimental Evaluation Conclusions and Future Work

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 9 How updates are processed Q1Q1 N1N1 W1W1 C1C1 S 1 =1 Cost of executing operator Q1 Selectivity of operator Q1 Wait time of top update in input queue for operator Q1 Number of pending updates Incoming updates Operator Q1

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 10 Freshness-Aware Scheduling of MCQ (FAS-MCQ) Compute loss in freshness (L) under two policies: Policy X: first Q 1 then Q 2 Policy Y: first Q 2 then Q 1 L X = (W 1 + N 1 *C 1 ) + (W 2 + N 2 *C 2 + N 1 *C 1 ) Current loss in freshness Loss until processing pending tuples Q 1 ’s loss Loss due to waiting for Q 1 to finish execution Q 2 ’s loss Q1Q1 N1N1 W1W1 C1C1 Q2Q2 N2N2 W2W2 C2C2 S 1 =1S 2 =1

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 11 Freshness-Aware Scheduling of MCQs Under policy X: first Q 1 then Q 2 L X = (W 1 + N 1 *C 1 ) + (W 2 + N 1 *C 1 + N 2 *C 2 ) Under policy Y: first Q 2 then Q 1 L Y = (W 2 + N 2 *C 2 ) + (W 1 + N 2 *C 2 + N 1 *C 1 ) For L X N 1 * C 1 < N 2 * C 2 Q1Q1 N1N1 W1W1 C1C1 Q2Q2 N2N2 W2W2 C2C2 Priority of Q i = 1/(N i * C i ) S 1 =1S 2 =1

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 12 Impact of Selectivity A query is a tree of operators Each query operator is associated with: Cost (c): processing time Selectivity (s): probability of producing an output after processing an input update Maximum cost: C = c 1 + c 2 + c 3 Total selectivity: S = s 1 * s 2 * s 3 Average/Expected Cost: C avg = c 1 + (c 2 * s 1 ) + (c 3 * s 1 * s 2 ) 1 2 3

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 13 Selectivity-Aware FAS-MCQ Priority of Q i = 1/(N i * C i avg ) But we need to consider selectivity: If S i = 0, no appending and the output data stream is fresh If S i = 1, appending and the output data stream is stale Compute the staleness probability (P i ) = 1 - (1-S i ) N i Q1Q1 N1N1 W1W1 C 1 avg Q2Q2 N2N2 W2W2 C 2 avg S1S1 S2S2 Priority of Q i = P i / (N i * C i avg )

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 14 Summary Priority of Q i = P i / (N i * C i avg ) FAS-MCQ behaves as follows: If all queries have the same P (staleness probability) and N (number of pending updates), then FAS-MCQ selects the query with the lowest cost If all queries have the same P and C (expected cost) then FAS-MCQ selects the query with the lowest number of pending updates If all queries have the same N and C, then FAS-MCQ selects the query with the highest staleness probability

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 15 Intuitions underlying FAS-MCQ Priority of Q i = P i / (N i * C i avg ) The priority of a query increases if it has: Small processing cost (C i ), Small number of pending updates (N i ), High staleness probability (P i )

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 16 Outline Motivation Quality of Data (QoD) Freshness-Aware Scheduling of MCQ (FAS-MCQ) Experimental Evaluation Conclusions and Future Work

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 17 Simulation Testbed Simulated the execution of 250 continuous queries with variable costs and selectivities 10 input data streams following Poisson distribution Half of the input streams are bursty Experiments to show: Impact of utilization, Impact of Selectivity, (Impact of burstiness), Fairness

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 18 Scheduling Algorithms FAS-MCQ: Freshness-Aware Scheduling of Multiple Continuous Queries RR: Round-Robin (Aurora) SRPT: Shortest Remaining Processsing Time FCFS: First-Come First-Served

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 19 Impact of Utilization (Selectivity = 1) 22% Quality of Data

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 20 Impact of Selectivity 50% Quality of Data

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 21 Fairness Standard Deviation for QoD

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 22 Conclusions We proposed a policy for freshness-aware scheduling of multiple continuous queries. Our policy exploits the properties of continuous queries (i.e., cost and selectivity) as well as the properties of input data streams (variability of updates) We showed experimentally that our proposed policy outperforms the traditional scheduling policies Future: study multi-stream queries and different definitions for QoD

HDMS  Athens, Greece -  Sharaf, Labrinidis, Chrysanthis, Pruhs University of Pittsburgh 23   Advanced Data Management Technologies Lab