Copyright © Univa Corporation, 2016. All Rights Reserved www.univa.com Using Containers for HPC Workloads HEPiX – Apr 21, 2016 Fritz Ferstl – CTO, Univa.

Slides:



Advertisements
Similar presentations
Tivoli Software from IBM Storage Resource Management Webcast
Advertisements

Introduction to Grid Application On-Boarding Nick Werstiuk
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
SLA-Oriented Resource Provisioning for Cloud Computing
System Center 2012 R2 Overview
Current impacts of cloud migration on broadband network operations and businesses David Sterling Partner, i 3 m 3 Solutions.
Profit from the cloud TM Parallels Dynamic Infrastructure AndOpenStack.
Faster, More Scalable Computing in the Cloud Pavan Pant, Director Product Management.
Application Models for utility computing Ulrich (Uli) Homann Chief Architect Microsoft Enterprise Services.
Building a Cluster Support Service Implementation of the SCS Program UC Computing Services Conference Gary Jung SCS Project Manager
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
Microsoft Virtual Server 2005 Product Overview Mikael Nyström – TrueSec AB MVP Windows Server – Setup/Deployment Mikael Nyström – TrueSec AB MVP Windows.
Copyright © 2010 Platform Computing Corporation. All Rights Reserved. Platform Computing Ken Hertzler VP Product Management.
© 2010 VMware Inc. All rights reserved Confidential VMware Vision Jarod Martin Senior Solutions Engineer.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Sanbolic Enabling the Always-On Enterprise Company Overview.
CHAPTER FIVE Enterprise Architectures. Enterprise Architecture (Introduction) An enterprise-wide plan for managing and implementing corporate data assets.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Introduction To Windows Azure Cloud
Web Services Igor Wasinski Olumide Asojo Scott Hannan.
Taking the Complexity out of Cluster Computing Vendor Update HPC User Forum Arend Dittmer Director Product Management HPC April,
Workshop on the Future of Scientific Workflows Break Out #2: Workflow System Design Moderators Chris Carothers (RPI), Doug Thain (ND)
Cloud Age Time to change the programming paradigm?
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GAAIN Virtual Appliances: Virtual Machine Technology for Scientific Data Analysis Arihant Patawari USC Stevens Neuroimaging and Informatics Institute July.
Accumulus Delivers Enterprise Class Subscription Billing and Automation Solutions for Gaming, Retail, and More on the Scalable Microsoft Azure Platform.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
SAM for SQL Workloads Presenter Name.
Robert Mahowald August 26, 2015 VP, Cloud Software, IDC
MidVision Enables Clients to Rent IBM WebSphere for Development, Test, and Peak Production Workloads in the Cloud on Microsoft Azure MICROSOFT AZURE ISV.
Enabling the Cloud OS Today  New high-density Web Sites with elastic cloud scaling and complete dev-ops experiences  New rich IaaS experience for self-service.
Tackling I/O Issues 1 David Race 16 March 2010.
© 2009 IBM Corporation IBM Cloud Computing Tivoli Service Automation Manager V7.2 The Core of the Service Management System for Cloud Computing.
Microsoft Azure and ServiceNow: Extending IT Best Practices to the Microsoft Cloud to Give Enterprises Total Control of Their Infrastructure MICROSOFT.
Cisco Consulting Services for Application-Centric Cloud Your Company Needs Fast IT Cisco Application-Centric Cloud Can Help.
#msitconf. Damien Caro Technical Evangelist Manager, Что будет, если приложение поместить в контейнер? What happens if the application.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
DECTRIS Ltd Baden-Daettwil Switzerland Continuous Integration and Automatic Testing for the FLUKA release using Jenkins (and Docker)
Structured Container Delivery Oscar Renalias Accenture Container Lead (NOTE: PASTE IN PORTRAIT AND SEND BEHIND FOREGROUND GRAPHIC FOR CROP)
Docker for Ops: Operationalize Your Apps in Production Vivek Saraswat Sr. Product Evan Hazlett Sr. Software
Run Azure Services in your datacenter
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Containers as a Service with Docker to Extend an Open Platform
Organizations Are Embracing New Opportunities
Accelerate your DevOps with OpenShift by Red Hat
Hybrid Management and Security
Docker Birthday #3.
Bridges and Clouds Sergiu Sanielevici, PSC Director of User Support for Scientific Applications October 12, 2017 © 2017 Pittsburgh Supercomputing Center.
Platform as a Service.
Microsoft SharePoint Server 2016
Containers in HPC By Raja.
2017 Real Questions
Kubernetes Container Orchestration
Cloud Computing Dr. Sharad Saxena.
Securing Cloud-Native Applications Jason Schmitt CEO
Enterprise Application Stores
Microsoft Ignite NZ October 2016 SKYCITY, Auckland.
Developing for the cloud with Visual Studio
JOINED AT THE HIP: DEVSECOPS AND CLOUD-BASED ASSETS
Serverless Architecture in the Cloud
IBM Power Systems.
Harrison Howell CSCE 824 Dr. Farkas
Productive + Hybrid + Intelligent + Trusted
Containers and DevOps.
Containers on Azure Peter Lasne Sr. Software Development Engineer
Presentation transcript:

Copyright © Univa Corporation, All Rights Reserved Using Containers for HPC Workloads HEPiX – Apr 21, 2016 Fritz Ferstl – CTO, Univa

Copyright © Univa Corporation, All Rights Reserved 2 Who is Univa

Copyright © Univa Corporation, All Rights Reserved Market Leading Intelligent Workload Management 500+ Customers – 82% Revenue from Enterprise 3 Proven Technology: 500+ Customers

Copyright © Univa Corporation, All Rights Reserved 4 Container Use Cases Use CaseDescription SandboxingEmbed OS, SW stack and configuration dependencies with application Resource Isolation Restrict resource usage within container Limit interference of applications on same node App PackingPack more applications on a node than possible with VMs Portability / Cloud-Native Run same container on modern versions of Linux without changes Including on-premise or public clouds MicroservicesStateless single app containers working together to provide a scalable, highly available and portable service DevOps Application-centric versus machine/server-centric design Support for automatic container builds Built-in version tracking Reusable components Public registry for sharing containers A growing tools ecosystem from the published API

Copyright © Univa Corporation, All Rights Reserved 5 HPC Use Case Considerations  What works well:  New applications architected with containers in mind  Sequential or multi-threaded  Simple IO and networking dependencies  What can work but with challenges:  Distributed memory apps (MPI, etc) run on single host  Strong reliance on network access to POSIX filesystems (NFS)  GPU integration (issues are socket affinity, for instance)  What doesn't really work (yet):  Cross-host distributed memory apps  Strong reliance on network access to POSIX filesystems (NFS) ➠ Choices  Re-architect your software  Look towards HPC-specific container technology (Shifter, Singularity)  Wait for mainstream container innovation (Docker has announced pure ipvlan support)  Focus adoption on new software

Copyright © Univa Corporation, All Rights Reserved Bio-Tech Use Case 6

Copyright © Univa Corporation, All Rights Reserved 7 Use Case Description  Centre for Genomic Regulation (CRG)  Barcelona, Spain   Life-sciences research  Deployment of scientific data analysis pipelines on distributed clusters  Genome sequence discovery  Massive data analysis  large cluster  Life-science analysis pipelines have standardized on Grid Engine

Copyright © Univa Corporation, All Rights Reserved 8 Why Containers?  Large amount of different versions  Life-science analysis pipelines are open source and constantly evolving  Scientists make their own modifications  Large matrix of SW stack dependencies  Operating system  Workflow consist of many steps  SW stack layers with diverse version  Need for reproducibility and versioning  This is life-sciences research, after all  Key Challenge: maintain many versions of production environment over a long time  Sandboxing

Copyright © Univa Corporation, All Rights Reserved 9 Use Case Environment  Computing environment:  ~3,000 cores, heterogeneous systems  Managed with Univa Grid Engine  Non-trivial networking and shared storage  Cluster is a shared resource  Many users  Advanced policies, e.g. fair-sharing, back-filling and dependable resource controls  Advanced job types, e.g. array jobs  Detailed accounting and billing for resource consumption  High performance environment  Can't loose performance

Copyright © Univa Corporation, All Rights Reserved 10 Challenges and Solution Challenges  Sanboxing – maintain many production environments for a long time  At minimal or no performance impact:  From running applications in a container  From network and shared file system access from within a container  From starting the same containers over and over on nodes o Avoid to reload images Solution  CRG Nextflow workflow management  Integrated with Univa Grid Engine  And integrated with Docker  Make Univa Grid Engine Docker-aware  Enable Docker jobs  Container image cache-aware scheduling

Copyright © Univa Corporation, All Rights Reserved 11 Results  4% increase of Docker application run-time vs native run-time with cached images  12,5% increase with container bootstraping, i.e. downloading from image repository  Image-cache aware scheduling has solid benefit on utilization and throughput  Cost is considered low vs benefit by CRG  Use case requirements really can't be satisfied without containers

Copyright © Univa Corporation, All Rights Reserved Container Edition Univa Grid Engine 12

Copyright © Univa Corporation, All Rights Reserved 13 Docker with Univa Grid Engine  Launch Docker Container on best machine in cluster  Minimize layer downloads (not in early access version)  Reduces the time wasted (it can be minutes) waiting for the Docker image to download from the Docker registry. Container runs faster increasing throughput in the cluster.  Run Docker Containers in a Univa Grid Engine Cluster  Business Critical containers are prioritized over other containers. Increases efficiency of the overall organization.  Job Control and Limits for Docker Containers  Provides user and administrator control over containers running on Grid Engine Hosts.  Currently in Early Access; Release in May 2016

Copyright © Univa Corporation, All Rights Reserved 14 Docker Integration with Univa  Accounting for Docker Containers  Keeps track of containers. Share policies require accounting.  Data file Management for Docker Containers  Transparent access to input, output and error files. Simplifies the management of input and output files for Docker Containers and ensures any output or error files are moved to a location where the user can access them.  Interactive Docker Containers  Good for debugging when containers don’t work correctly!  Currently in Early Access; Release in May 2016

Copyright © Univa Corporation, All Rights Reserved Managing cloud-native and container-only micro-service frameworks Based on Googles Kubernetes

Copyright © Univa Corporation, All Rights Reserved Univa Container Solutions Easy installation, preconfigured solution including pre-integration with cloud services. Build a container cluster on premise or in the cloud. The fastest way to build a container cluster!! Respond Quickly: Easy to resize, adapt, dynamic provisioning Orchestrate and Optimize: Best use of resources and keep track of containers The most advanced container orchestration! ! 16

Copyright © Univa Corporation, All Rights Reserved Solving the set up problem – in minutes “Implementing Docker and Kubernetes is complicated. It's even more complicated for enterprises that have diverse workloads, various app stacks, heterogeneous infrastructures, and limited resources.” InfoWorld May

Copyright © Univa Corporation, All Rights Reserved 18 CloudOn Premises Servers / VMs After “The next step is large scale orchestration and scale”. 451 Research Before Containers and other workloads need resources Run containers at scale Blend containers with other workloads Maximize resources / use of cloud Navops orchestration solution

Copyright © Univa Corporation, All Rights Reserved 19 Conclusions  Containers are well suited to address use cases like sandboxing  A bit of a paradigm shift in SW development, networking, storage, security, provisioning, etc  Production use requires enterprise-ready container orchestration  Univa Grid Engine Container Edition for  Mixed Workloads  "Fat" Containers (no microservice)  "Jobs"  Navops for container-only, microservices-based SW architectures

Copyright © Univa Corporation, All Rights Reserved THANK YOU