Adaptive Stream Processing using Dynamic Batch Sizing Tathagata Das, Yuan Zhong, Ion Stoica, Scott Shenker.

Slides:



Advertisements
Similar presentations
Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.
Advertisements

Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
Thank you for your introduction.
Esma Yildirim Department of Computer Engineering Fatih University Istanbul, Turkey DATACLOUD 2013.
Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael Franklin, Scott Shenker, Ion Stoica Spark Fast, Interactive,
Discretized Streams: Fault-Tolerant Streaming Computation at Scale Wenting Wang 1.
Discretized Streams Fault-Tolerant Streaming Computation at Scale Matei Zaharia, Tathagata Das (TD), Haoyuan (HY) Li, Timothy Hunter, Scott Shenker, Ion.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The FIR Adaptive Filter The LMS Adaptive Filter Stability and Convergence.
Discretized Streams An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters Matei Zaharia, Tathagata Das, Haoyuan Li, Scott Shenker,
Gossip Scheduling for Periodic Streams in Ad-hoc WSNs Ercan Ucan, Nathanael Thompson, Indranil Gupta Department of Computer Science University of Illinois.
Advanced Computer Networking Congestion Control for High Bandwidth-Delay Product Environments (XCP Algorithm) 1.
Cost-based Workload Balancing for Ray Tracing on a Heterogeneous Platform Mario Rincón-Nigro PhD Showcase Feb 17 th, 2012.
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
Applying Genetic Algorithms to Decision Making in Autonomic Computing Systems Authors: Andres J. Ramirez, David B. Knoester, Betty H.C. Cheng, Philip K.
Karl Schnaitter and Neoklis Polyzotis (UC Santa Cruz) Serge Abiteboul (INRIA and University of Paris 11) Tova Milo (University of Tel Aviv) Automatic Index.
Mesos A Platform for Fine-Grained Resource Sharing in Data Centers Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy.
AQM for Congestion Control1 A Study of Active Queue Management for Congestion Control Victor Firoiu Marty Borden.
Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.
Building a Controlled Delay Assured Forwarding Class in DiffServ Networks Parag Kulkarni Nazeeruddin Mohammad Sally McClean Gerard Parr Michaela Black.
Goals of Adaptive Signal Processing Design algorithms that learn from training data Algorithms must have good properties: attain good solutions, simple.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
RRAPID: Real-time Recovery based on Active Probing, Introspection, and Decentralization Takashi Suzuki Matthew Caesar.
The Power of Choice in Data-Aware Cluster Scheduling
Storage in Big Data Systems
Routing Protocol Evaluation David Holmer
A Distributed Clustering Framework for MANETS Mohit Garg, IIT Bombay RK Shyamasundar School of Tech. & Computer Science Tata Institute of Fundamental Research.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Design methodologies.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
Chih-Ming Chen, Student Member, IEEE, Ying-ping Chen, Member, IEEE, Tzu-Ching Shen, and John K. Zao, Senior Member, IEEE Evolutionary Computation (CEC),
Smita Vijayakumar Qian Zhu Gagan Agrawal 1.  Background  Data Streams  Virtualization  Dynamic Resource Allocation  Accuracy Adaptation  Research.
Applying Control Theory to the Caches of Multiprocessors Department of EECS University of Tennessee, Knoxville Kai Ma.
High-speed TCP  FAST TCP: motivation, architecture, algorithms, performance (by Cheng Jin, David X. Wei and Steven H. Low)  Modifying TCP's Congestion.
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
Lecture 5 - Single Variable Problems CVEN 302 June 12, 2002.
Eddies: Continuously Adaptive Query Processing Ross Rosemark.
Resilient Distributed Datasets: A Fault- Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
ICDCS 2014 Madrid, Spain 30 June-3 July 2014
Version 1.1 Improving our knowledge of metaheuristic approaches for cell suppression problem Andrea Toniolo Staggemeier Alistair R. Clark James Smith Jonathan.
Adaptive Sleep Scheduling for Energy-efficient Movement-predicted Wireless Communication David K. Y. Yau Purdue University Department of Computer Science.
1 Adaptive Parallelism for Web Search Myeongjae Jeon Rice University In collaboration with Yuxiong He (MSR), Sameh Elnikety (MSR), Alan L. Cox (Rice),
A Platform for Fine-Grained Resource Sharing in the Data Center
For a good summary, visit:
1 Sheer volume and dynamic nature of video stresses network resources PIE: A lightweight latency control to address the buffer problem issue Rong Pan,
Querying the Internet with PIER CS294-4 Paul Burstein 11/10/2003.
Equalizer: Dynamically Tuning GPU Resources for Efficient Execution Ankit Sethia* Scott Mahlke University of Michigan.
1 A Grid-Based Middleware’s Support for Processing Distributed Data Streams Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering.
Optimization-based Cross-Layer Design in Networked Control Systems Jia Bai, Emeka P. Eyisi Yuan Xue and Xenofon D. Koutsoukos.
PACMan: Coordinated Memory Caching for Parallel Jobs Ganesh Ananthanarayanan, Ali Ghodsi, Andrew Wang, Dhruba Borthakur, Srikanth Kandula, Scott Shenker,
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Online Parameter Optimization for Elastic Data Stream Processing Thomas Heinze, Lars Roediger, Yuanzhen Ji, Zbigniew Jerzak (SAP SE) Andreas Meister (University.
Resilient Distributed Datasets A Fault-Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
Optimizing Distributed Actor Systems for Dynamic Interactive Services
Introduction to Load Balancing:
Group 4 Anders Linseth Magdalena Lipska Nicolai Friis Tobias Melcher
PA an Coordinated Memory Caching for Parallel Jobs
Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
CPU Scheduling G.Anuradha
Smita Vijayakumar Qian Zhu Gagan Agrawal
Discretized Streams: A Fault-Tolerant Model for Scalable Stream Processing Zaharia, et al (2012)
Cloud Computing Large-scale Resource Management
Self-Managed Systems: an Architectural Challenge
Fast, Interactive, Language-Integrated Cluster Computing
Presentation transcript:

Adaptive Stream Processing using Dynamic Batch Sizing Tathagata Das, Yuan Zhong, Ion Stoica, Scott Shenker

Big Data- The 3 V’s

Batched stream Processing Examples: Comet, Spark Streaming -Run streaming computations as a number of short, deterministic batch jobs -Spark- State between batches kept in memory as Resilient Distributed Datasets Batch interval – Size of the batch in seconds

Latency = Batching delay + Queueing delay + Processing time Lower batch intervals lead to lower latency Stability- batch processing time must not exceed the batch interval Stability Condition Line

Static batch intervals- Offline Learning Limitations Specific to cluster resources Unpredictable changes in incoming data rates hard to model

The Control Module Control Module – Learns behaviour of the system to decide batch interval Desirable Properties Low Latency Agility Generality Ease of Configuration

Dynamic Batch Sizing Goals - Achieve minimum batch interval - Ensure system stability - Speed - Adapt to changing workloads

Early Solutions Controls based on Binary Signals Controls based on Gradient information -Large Gradient- Reduce the batch interval by a configured step size -Small Gradient- Increase or decrease the batch interval based on operating point

Fixed Point Iteration Relaxing the requirements Batch intervals chosen based on the intersection between workload function ω(x) and the condition line batch processing time=px batch size and p<1

Type I intersections - Find a point x* such that ω(x*) = px Type I intersection is reduced to solving the fixed point function f(x) = ω(x) /p such that f(x*) = ω(x*) /p = x* Start with an initial guess x 1 Iterate using x n+1 = f(x n ) = ω(x*) /p for n=1,2,… - Converges quickly

Type II intersections Condition: ω(x 1 ) /x 1 < ω(x 2 ) /x 2 ω(x 1 ) > px 1 and ω(x 2 ) > px 2 Next batch interval X n+1 = (1-r) x min (x 1, x 2 )

-Checking Type II inconclusive if the batch intervals of the last 2 completed jobs are the same -Principle of slow start -Algorithm does not converge if Type I intersection does not exist

System Architecture Adaptive window based aggregation

Evaluation 20 m1.x large EC2 instances p= 0.7 r = 0.25 High vale of p = Greater robustness, Larger batch intervals High value of r = Quick Type I Convergence

Comparisons with static batch intervals Comparison of the end-to-end latency observed with static and time-varying data rates and static/dynamic batch intervals Static data rates Time varying data rates

Reduction in queueing delay Queueing delay starts building up -Sinusoidal input data rate between 1 and 6.4 MB/s -Static batch interval = 1.3 s -Dynamically adapting batch size -Reduction in end to end latency

Changing workloads -Reduce workload -Constant data rate 6MB/s -Number of keys varied between 500 and 40000

Resource Variations -Background jobs consuming 25% of the cluster resources Timeline of batch interval and other times for the reduce workload under variations in the available workload

Comments + No prior knowledge of the workload + Ability to adapt to a varying number of workloads + Achieves low latency, agility and ease of configuration + No modifications required to the programming interface -The control loop introduces an overhead -Failure of the control module -Slow start/ Initial parameters -Batching strategy for lost streams and sudden unexpected spikes?

Discussion/Questions Possible challenges when applied to the continuous operator model?

Comparison with gradient based algorithm