Introducing Flink on Mesos Eron Wright – DELL

Slides:



Advertisements
Similar presentations
Resource Management with YARN: YARN Past, Present and Future
Advertisements

Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
Technical Architectures
1 Operating Systems Ch An Overview. Architecture of Computer Hardware and Systems Software Irv Englander, John Wiley, Bare Bones Computer.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
Google App Engine Google APIs OAuth Facebook Graph API
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
— Build your own enterprise-class PaaS platform. Master Cloudset Cloudset1 Cloudset Resource Pool cloud Dedicated resource can be assigned to a cloudset.
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
ArcGIS Server for Administrators
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Apache Struts Technology A MVC Framework for Java Web Applications.
/16 Final Project Report By Facializer Team Final Project Report Eagle, Leo, Bessie, Five, Evan Dan, Kyle, Ben, Caleb.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Structured Container Delivery Oscar Renalias Accenture Container Lead (NOTE: PASTE IN PORTRAIT AND SEND BEHIND FOREGROUND GRAPHIC FOR CROP)
Introduction to Operating Systems Concepts
Introduction Service Management Automation
Microsoft Build /9/2017 5:00 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Flink Security Enhancements
Scalable containers with Apache Mesos and DC/OS
How Alluxio (formerly Tachyon) brings a 300x performance improvement to Qunar’s streaming processing Xueyan Li (Qunar) & Chunming Li (Garena)
Dockerize OpenEdge Srinivasa Rao Nalla.
Introduction to Distributed Platforms
OpenLegacy Training Day Four Introduction to Microservices
The PaaS Layer in the INDIGO-DataCloud
Large-scale file systems and Map-Reduce
Consulting Services JobScheduler Architecture Decision Template
Direct Attached Storage and Introduction to SCSI
Running Apache Flink® Everywhere
Apache Hadoop YARN: Yet Another Resource Manager
Introduction to Microservices Prepared for
Direct Attached Storage and Introduction to SCSI
Solving ETL Bottlenecks with SSIS Scale Out
Using docker containers
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
湖南大学-信息科学与工程学院-计算机与科学系
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Chapter 9: IOS Images and Licensing
Chapter 2: System Structures
Simplified Development Toolkit
Introduction to Apache
RM3G: Next Generation Recovery Manager
ESIS Consulting LLC (C) ESIS Consulting LLC. All rights reserved
Introduction Apache Mesos is a type of open source software that is used to manage the computer clusters. This type of software has been developed by the.
Introduction to Operating Systems
Operating Systems : Overview
Container cluster management solutions
Specialized Cloud Architectures
Introduction to Operating Systems
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
OpenStack Summit Berlin – November 14, 2018
Robert Down & Pranay Sadarangani Nov 8th 2011
Harrison Howell CSCE 824 Dr. Farkas
MapReduce: Simplified Data Processing on Large Clusters
Pig Hive HBase Zookeeper
Presentation transcript:

Introducing Flink on Mesos Eron Wright – DELL

2of 15 What is Apache Mesos? A popular cluster manager (similar to YARN) Makes available CPU, memory, & disk resources Unique capabilities for storage services Emerging as a foundation for data-centric, converged infrastructure Provides a programming model for using cluster resources A Mesos program is called a “framework” Packaged into an open-source distribution called DCOS Prescribes best practices related to Mesos frameworks, related services, etc.

3of 15 Why Flink on Mesos? Flink works best on a cluster manager –Easy to scale each job independently –Externalize scheduling logic (fairness, quota, …) –Good job isolation Flink can benefit from unique Mesos capabilities –Disk resources –Dynamic resource management –Unique management features (e.g. inverse offers for controlled downscaling & maintenance)

Demo

Flink Master Process

6of 15 Introduction Flink Master Process The Flink Master Process is: –The “Application Master” for a single Flink cluster –A Mesos framework! Hosts numerous components: –Job Manager –Resource Manager (acts as Mesos scheduler) –Artifact Server (HTTP server for Mesos fetcher) Responsible for TM scaling and recovery –Handles JobManager scale change requests –Stores task state in ZooKeeper host1 host2 Master JM RM HTTPD TM Mesos

7of 15 How it Works Flink Master Process Offer handling: –Uses Netflix Fenzo as an optimizerNetflix Fenzo –Gathers offers until all tasks launched Recovery: –Stores intentional state in ZooKeeper –Master uses leader election –Mesos allows some time for recovery before killing tasks Monitoring: –Detects task failure; launches replacement automatically. host1 host2 Master TM 4. Launch Mesos 2. Resource Offers 1. Register 5. Fetch (HTTP) 6. Status update 3. Optimize

8of 15 Configuration Flink Master Process (Con’t) Framework Info –mesos.resourcemanager.framework.secret –mesos.resourcemanager.framework.principal –mesos.resourcemanager.framework.role Mesos Master Info – mesos.master : (IP address or ZK lookup info) –mesos.failover-timeout Note: no port configuration is necessary; Mesos automatically assigns ports.

Dispatcher

10of 15 Introduction Dispatcher A highly-available service for launching Flink clusters. A Mesos framework! Accessed via REST by the CLI DCOS compatibility: –HTTP-based –Accessible via the Admin Router –(future) JWT authentication Aligned with FLIP-6 host1 1D 1C 1B 1A host2 2D 2C 2B 2A host3 3D 3C 3B 3A host4 4D 4C 4B 4A Dispatcher Master TM Master CLI TM Mesos

11of 15 Framework Hierarchy Dispatcher (Con’t) Nesting of frameworks is a common Mesos pattern. Here, Marathon launches the dispatcher, which launches the Flink Master Process, etc. Architecturally, it avoids a dependency on the Marathon API. For example, Aurora could be used here in place of Marathon. Dispatcher Master Maratho n TM (Task)

12of 15 Launching a Session Dispatcher (Con’t) Use: mesos-session.sh CLI uploads files to dispatcher via HTTP –Flink Configuration –Supplemental files ( --ship ) –Keytabs –Certificates Dispatcher adds additional elements: –Configuration ›ZooKeeper Namespace –Flink JAR –… host1 1D 1C 1B 1A host2 2D 2C 2B 2A host3 3D 3C 3B 3A host4 4D 4C 4B 4A Dispatcher Master TM CLI HTTP(S) TM HTTP(S) Mesos

13of 15 Dispatcher Deployment Modes Dispatcher (Con’t) Dispatcher is usable in two ways Remote Mode: –Recommended for detached execution Local Mode: –Recommended for simple, interactive sessions (e.g. flink shell) 3C 3B 3A 4C 4B 4A Dispatcher Master CLI HTTP(S) 3C 3B 3A 4C 4B 4A Master CLI + Dispatcher Local ModeRemote Mode

Summary

15of 15 Future Directions Dynamic Scaling –Add/remove Task Managers in response to scale changes over a job’s lifetime –Support Mesos maintenance procedures (e.g. inverse offers) Dispatcher Evolution (FLIP-6) –Generalize to support all deployment scenarios, unified CLI –Provide a centralized Web UI (incl. job history) –Authentication Support (e.g. OAuth 2.0) Docker Image Support –Tracking the “Mesos unified containerizer” Mesos Disk Support –Allocate multiple disks for Task Manager temp space –Scale up the I/O

16of 15 Project Status Targeted for: Flink 1.2 Contributors: –Eron Wright (Dell EMC) –Maximilian Michels (data Artisans) Design Doc: –Mesos Integration on Google DocsMesos Integration on Google Docs JIRAs: –FLINK-1984 – Integrate Flink with Apache MesosFLINK-1984 Code: –