Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management Author: Raul Castro Fernandez, Matteo Migliavacca, et al.

Slides:



Advertisements
Similar presentations
Load Management and High Availability in Borealis Magdalena Balazinska, Jeong-Hyon Hwang, and the Borealis team MIT, Brown University, and Brandeis University.
Advertisements

LIBRA: Lightweight Data Skew Mitigation in MapReduce
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
MapReduce Online Veli Hasanov Fatih University.
High-Performance Complex Event Processing over Streams Eugene Wu, Yanlei Diao, ShariqRizvi Presented by Ming Li and Mo Liu Presented by Ming Li and Mo.
Software Quality Assurance Plan
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
Discretized Streams: Fault-Tolerant Streaming Computation at Scale Wenting Wang 1.
Discretized Streams Fault-Tolerant Streaming Computation at Scale Matei Zaharia, Tathagata Das (TD), Haoyuan (HY) Li, Timothy Hunter, Scott Shenker, Ion.
Discretized Streams An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters Matei Zaharia, Tathagata Das, Haoyuan Li, Scott Shenker,
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
Chapter 10: Stream-based Data Management Title: Design, Implementation, and Evaluation of the Linear Road Benchmark on the Stream Processing Core Authors:
Scalable Distributed Stream System Mitch Cherniack, Hari Balakrishnan, Magdalena Balazinska, Don Carney, Uğur Çetintemel, Ying Xing, and Stan Zdonik Proceedings.
Probabilistic Data Aggregation Ling Huang, Ben Zhao, Anthony Joseph Sahara Retreat January, 2004.
MS CLOUD DB - AZURE SQL DB Fault Tolerance by Subha Vasudevan Christina Burnett.
Distributed Computations MapReduce
1 04/18/2005 Flux Flux: An Adaptive Partitioning Operator for Continuous Query Systems M.A. Shah, J.M. Hellerstein, S. Chandrasekaran, M.J. Franklin UC.
Recovery Techniques in Distributed Databases Naveen Jones December 5, 2011.
Shuchang Shan † ‡, Yu Hu †, Xiaowei Li † † Key Laboratory of Computer System and Architecture, Institute of Computing Technology, Chinese Academy of Sciences.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
Introduction to Parallel Programming MapReduce Except where otherwise noted all portions of this work are Copyright (c) 2007 Google and are licensed under.
An adaptive framework of multiple schemes for event and query distribution in wireless sensor networks Vincent Tam, Keng-Teck Ma, and King-Shan Lui IEEE.
Secure Web Applications via Automatic Partitioning Stephen Chong, Jed Liu, Andrew C. Meyers, Xin Qi, K. Vikram, Lantian Zheng, Xin Zheng. Cornell University.
Implementation Yaodong Bi. Introduction to Implementation Purposes of Implementation – Plan the system integrations required in each iteration – Distribute.
GeoGrid: A scalable Location Service Network Authors: J.Zhang, G.Zhang, L.Liu Georgia Institute of Technology presented by Olga Weiss Com S 587x, Fall.
Providing Resiliency to Load Variations in Distributed Stream Processing Ying Xing, Jeong-Hyon Hwang, Ugur Cetintemel, Stan Zdonik Brown University.
1 Fast Failure Recovery in Distributed Graph Processing Systems Yanyan Shen, Gang Chen, H.V. Jagadish, Wei Lu, Beng Chin Ooi, Bogdan Marius Tudor.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Service Architecture of Grid Faults Diagnosis Expert System Based on Web Service Wang Mingzan, Zhang ziye Northeastern University, Shenyang, China.
Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.
Pregel: A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and.
MapReduce M/R slides adapted from those of Jeff Dean’s.
Benchmarking MapReduce-Style Parallel Computing Randal E. Bryant Carnegie Mellon University.
SALSASALSASALSASALSA Design Pattern for Scientific Applications in DryadLINQ CTP DataCloud-SC11 Hui Li Yang Ruan, Yuduo Zhou Judy Qiu, Geoffrey Fox.
MapReduce Kristof Bamps Wouter Deroey. Outline Problem overview MapReduce o overview o implementation o refinements o conclusion.
Data Streams: Lecture 101 Window Aggregates in NiagaraST Kristin Tufte, Jin Li Thanks to the NiagaraST PSU.
Toward Fault-tolerant P2P Systems: Constructing a Stable Virtual Peer from Multiple Unstable Peers Kota Abe, Tatsuya Ueda (Presenter), Masanori Shikano,
Streamflow - Programming Model for Data Streaming in Scientific Workflows Chathura Herath.
ACM SIGMOD International Conference on Management of Data, Beijing, June 14 th, Keyword Search on Relational Data Streams Alexander Markowetz Yin.
Research of P2P Architecture based on Cloud Computing Speaker : 吳靖緯 MA0G0101.
Design Issues of Prefetching Strategies for Heterogeneous Software DSM Author :Ssu-Hsuan Lu, Chien-Lung Chou, Kuang-Jui Wang, Hsiao-Hsi Wang, and Kuan-Ching.
1 Fault-Tolerant Mechanism for Hierarchical Branch and Bound Algorithm Université A/Mira de Béjaïa CEntre de Recherche sur l’Information Scientifique et.
A N I N - MEMORY F RAMEWORK FOR E XTENDED M AP R EDUCE 2011 Third IEEE International Conference on Coud Computing Technology and Science.
1 Semantics and Evaluation Techniques for Window Aggregates in Data Streams Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter Tucker This work.
MapReduce: Simplified Data Processing on Large Clusters By Dinesh Dharme.
Load Rebalancing for Distributed File Systems in Clouds.
Adaptive Online Scheduling in Storm Paper by Leonardo Aniello, Roberto Baldoni, and Leonardo Querzoni Presentation by Keshav Santhanam.
Escalator Questionnaire. Why a Questionnaire? To understand better the challenges of smooth upgrade in the OPNFV context –Part 1: Questions investigating.
BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data Authored by Sameer Agarwal, et. al. Presented by Atul Sandur.
1 Out of Order Processing for Stream Query Evaluation Jin Li (Portland State Universtiy) Joint work with Theodore Johnson, Vladislav Shkapenyuk, David.
NOVA University of Lisbon
TensorFlow– A system for large-scale machine learning
S. Sudarshan CS632 Course, Mar 2004 IIT Bombay
International Conference on Data Engineering (ICDE 2016)
Applying Control Theory to Stream Processing Systems
Modeling Stream Processing Applications for Dependability Evaluation
PREGEL Data Management in the Cloud
PA an Coordinated Memory Caching for Parallel Jobs
Supporting Fault-Tolerance in Streaming Grid Applications
COS 518: Advanced Computer Systems Lecture 11 Michael Freedman
Capital One Architecture Team and DataTorrent
MapReduce Simplied Data Processing on Large Clusters
湖南大学-信息科学与工程学院-计算机与科学系
Cse 344 May 4th – Map/Reduce.
Discretized Streams: A Fault-Tolerant Model for Scalable Stream Processing Zaharia, et al (2012)
with Raul Castro Fernandez* Matteo Migliavacca+ and Peter Pietzuch*
An Analysis of Stream Processing Languages
COS 518: Advanced Computer Systems Lecture 12 Michael Freedman
The Gamma Database Machine Project
Presentation transcript:

Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management Author: Raul Castro Fernandez, Matteo Migliavacca, et al. Published conference: SIGMOD’13 Reporter: Ma Yuanwen

Introduction Stream – A sequence of tuples – Sensor network, stock trading system Query plan – A query is specified as a directed acyclic graph Distributed stream processing – A query is deployed on a set of nodes State management – Checkpoint, back up, restore, partition Scale out – Split the instance, when the instance is overload Fault tolerance – Recover from failures without affecting processing results

Outline Background – Problem Statement – System Model State Management – Query state – State Operations Scale out and fault tolerance – Fault tolerance scale out algorithm – System architecture – Bottleneck detection and scaling policy Evaluation Conclusions

Problem Statement Operators – Stateless operators (e.g. filter or map) and stateful operators(e.g. join or aggregation ) – Sliding window based state – Entire history based state Intra-query parallelism – Query graph and execution graph Fault tolerance – Passive standby strategy – Active standby strategy – Upstream backup strategy Report the words frequencies in the recent 1 hour about every 10 minutes

System model(1) nameAgesex Li Lei16male Han Mei15female Jim17male nameagesex Li Lei16male nameagesex Han Mei15female nameagesex Jim17male

System model(2)

Query state (1)

Query state (2)

State operations Operator state backup and restore – Checkpoint the state of an operator and backup the state to an upstream operator – Restore state for failure and scale out Operator state partitioning – When a stateful operator scales out, it’s processing state must be split across the new partitioned operators

Operator state backup and restore

Operator state partitioning

Scale out and Fault Tolerance Scale out – SPS partitions operator on-demand in response to bottleneck operators Fault Tolerance – If a node hosting an operator fails, the SPS must replace it with an operator on a new node Operator recover becomes special case of scale out, in which a failed Operator is scale out to a parallelization of 1

Fault-tolerant scale out algorithm

System architecture Query manager – Perform a mapping of query operators to nodes and maintain the execution graph Deployment manager – use the execution graph to initialize nodes, deploy operators, set up stream communication and start processing

Bottleneck detection and scaling policy

Goals and deployment of evaluation The goals of experimental evaluation are to investigate – The effectiveness of stateful operator scale out approach – The recovery time of the stateful recovery mechanism – The impact of state management approach on tuple processing latency Experiment deployment

Experiment data Linear road benchmark (LRB) – It models a road toll network – Queries: (1) Provide toll notifications to vehicles within 5s; (2) detect accidents within 5s; (3) answer balance account queries about paid toll amounts – The input rate for a single express-way (L=1) begins at 15 tuples/s and increase to 1700 tuples/s Wikipedia – A map/reduce-style top-k query That outputs every 30 seconds the ranking of the most visited Wikipedia language versions based on Wikipedia data traces

Dynamic scale out (1)

Dynamic scale out (2)

Failure recovery Word count

State management overhead

Conclusions Provide state management of stateful operators – Checkpoint, back up, restore, partition Present an integrated approach for scale out and failure recovery