Creating a Dynamic HPC Infrastructure with Platform Computing

Slides:

Advertisements

Similar presentations

Grid Computing at The Hartford OGF22 February 27, 2008 Robert Nordlund

Advertisements

NetApp OnCommand Management Software

IBM DEVELOP, NETWORK, PROMOTE & GROW Cloud Transformation: What are the risks, pitfalls and challenges to be addressed? Steve Strutt, CTO Cloud Computing,

System Center 2012 R2 Overview

Agile Infrastructure built on OpenStack Building The Next Generation Data Center with OpenStack John Griffith, Senior Software Engineer,

1 Vladimir Knežević Microsoft Software d.o.o.. 80% Održavanje 80% Održavanje 20% New Cost Reduction Keep Business Up & Running End User Productivity End.

Introduction to DBA.

CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.

VMware Virtualization Last Update Copyright Kenneth M. Chipps Ph.D.

What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.

Microsoft Virtual Server 2005 Product Overview Mikael Nyström – TrueSec AB MVP Windows Server – Setup/Deployment Mikael Nyström – TrueSec AB MVP Windows.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy

Copyright © 2010 Platform Computing Corporation. All Rights Reserved. Platform Computing Ken Hertzler VP Product Management.

Cloud Computing (101).

Cluster Scheduler Reference: Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center NSDI’2011 Multi-agent Cluster Scheduling for Scalability.

Demonstrating IT Relevance to Business Aligning IT and Business Goals with On Demand Automation Solutions Robert LeBlanc General Manager Tivoli Software.

An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.

Cloud Computing Myths and Realities Towards a policy Framework for Arab countries.

WHAT IS PRIVATE CLOUD? Michał Jędrzejczak Główny Architekt Rozwiązań Infrastruktury IT

Sanbolic Enabling the Always-On Enterprise Company Overview.

1 Introduction to Cloud Computing Jian Tang 01/19/2012.

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.

Dual Stack Virtualization: Consolidating HPC and commodity workloads in the cloud Brian Kocoloski, Jiannan Ouyang, Jack Lange University of Pittsburgh.

Introduction To Windows Azure Cloud

August 27, 2008 Platform Market, Business & Strategy.

Copyright 2009 Fujitsu America, Inc. 0 Fujitsu PRIMERGY Servers “Next Generation HPC and Cloud Architecture” PRIMERGY CX1000 Tom Donnelly April

VMware Infrastructure 3 The Next Generation in Virtualization.

© 2004 Oracle Corporation Laurent Sandrolini Vice President Systems Platform Division Oracle Corporation.

( I SSA ) I NFRASTRUCTURE AS A S ERVICE Will discuss : *Overview *Feature *Benefits for Enterprises * examples.

SUNY FARMINGDALE Computer Programming & Information Systems BCS451 – Cloud Computing Prof. Tolga Tohumcu.

= WEEKS, MONTHS, YEARS OF DELAYED APPLICATION VALUE MISSED REVENUE OPPORTUNITIES, INCREASED COST AND RISK DEV QA PACKAGE COMMERCIAL SOFTWARE CUSTOM APPLICATION.

Living in a “Greener” Detroit Presenter: Joe Cieslak P.E. HPC Market Director.

Grid Computing at The Hartford Condor Week 2008 Robert Nordlund

Server Virtualization

 The End to the Means › (According to IBM ) › 03.ibm.com/innovation/us/thesmartercity/in dex_flash.html?cmp=blank&cm=v&csr=chap ter_edu&cr=youtube&ct=usbrv111&cn=agus.

3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.

1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.

1© Copyright 2015 EMC Corporation. All rights reserved. FEDERATION ENTERPRISE HYBRID CLOUD OPERATION SERVICES FULL RANGE OF SERVICES TO ASSIST YOUR STAFF.

Practical IT Research that Drives Measurable Results 1Info-Tech Research Group Get Moving with Server Virtualization.

1 Implementing a Virtualized Dynamic Data Center Solution Jim Sweeney, Principal Solutions Architect, GTSI.

1 Automated Power Management Through Virtualization Anne Holler, VMware Anil Kapur, VMware.

Virtualization to Cloud: Accelerating Efficiency in the Data Center Hugh Jenkins Next Generation Compute Solutions.

Hello everyone I am rajul and I’ll be speaking on a case study of elastic slurm Case Study of Elastic Slurm Rajul Kumar, Northeastern University

Delivering on the Promise of a Virtualized Dynamic Data Center

Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.

Chapter 6: Securing the Cloud

Organizations Are Embracing New Opportunities

OPERATING SYSTEMS CS 3502 Fall 2017

Processes and threads.

Cluster Standalone SQL Server Instances at “Ludicrous” Speed

New Heights by Guiding Them into the Cloud

An Introduction to Cloud Computing

Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.

Juniper and IBM Delivering Value Together

Enterprise Transform: Now is the turning point

Microsoft SharePoint Server 2016

AWS Batch Overview A highly-efficient, dynamically-scaled, batch computing service May 2017.

Red Hat User Group June 2014 Marco Berube, Cloud Solutions Architect

AWS. Introduction AWS launched in 2006 from the internal infrastructure that Amazon.com built to handle its online retail operations. AWS was one of the.

Cloud computing Anton Boyko .NET Developer.

Management of Virtual Execution Environments 3 June 2008

Virtualization Meetup Discussion

Storage: Optimize, Monitor, Automate

Modernizing your enterprise with hybrid it

Operating systems Process scheduling.

CPU SCHEDULING.

Virtualization.

Emerging technologies-

IBM Power Systems.

Presentation transcript:

Creating a Dynamic HPC Infrastructure with Platform Computing Chris Porter Sept. 8th, 2011

The Business Problems Rigidity Inefficiency Incapacity Workload requirements are dynamic and only becoming more so Unbalanced utilization of distributed resources (hot and cold spots) Peak demand occasionally exceeds local AND total supply of resources

Platform’s Roadmap to HPC in the Cloud Efficiency, Flexibility, Service Levels Public Cloud 5. Burst to external providers when needed Private Cloud Grid 4. Make infrastructure dynamic within a shared private cloud Coming soon! Cluster 3. Make infrastructure dynamic Platform LSF + Adaptive Cluster 2. Employ effective resource management and sharing Platform LSF + Adaptive Cluster 1. Harness the power of commodity clusters We Are Here Time

Application Stack Silos Typical Solutions Static infrastructure = trade-offs Utilization Static HPC Capacity Large Job Starvation Application Stack Silos Static HPC Capacity Effective sharing and high utilization, but simply more demand than supply The Problem? Capacity constrained by the static dedicated HPC resources Large Job Starvation A shared cluster with diverse sizes of jobs The Problem? Resource fragmentation leading to large job starvation Application Stack Silos A single cluster with smart sharing, but with applications that require specific environments The Problem? Better still, but capacity is constrained by the static application stacks and OS versions Queue Sprawl - Multiple Host Groups A single cluster with partitioned resources and dedicated queues based on OS and possibly application The Problem? Better, but still expensive Under-utilized, limited sharing High administrative costs due to “queue sprawl” Cluster Sprawl - Multiple Static Clusters To ensure acceptable service levels, each department has their own cluster The Problem? Costly, under-utilized silos that are expensive to manage and maintain Queue Sprawl Cluster Sprawl Service Level

Introducing Platform Adaptive Cluster Utilization With Platform Adaptive Cluster, you can avoid this trade-off Delivers both better economic value and service levels! Service Levels / QoS

Platform Adaptive Cluster Transform a static cluster into a fully dynamic cloud environment Reduce complexity of user environment Control resource allocation policies at the group level Benefit from a mature, stable HPC cloud product Increase user service level Reduce cost & save power Increase resource sharing & utilization Redeploy servers quickly & efficiently Achieve self-service for users Allows the environment to flex as requirements change over time Automate administration

Create a Dynamic HPC Environment Dynamic Provisioning of OS Memory Consolidation RHEL 5.5 RHEL 4.8 Big Mem Job Platform LSF + Adaptive Cluster Multi-boot or use over the wire provisioning . Job requirements are driving the provisioning requirements. Have plenty of memory available, but don’t have enough available memory on one server. Use job containers to move the smaller jobs. Workload out of balance with OS provisioning Large memory jobs starved by running jobs RHEL 5.5 RHEL 4.8 RHEL 4.8 RHEL 5.5 RHEL 5.5

Use Case: Large EDA Customer Problem Service level requirements for high priority workload conflicts with the need to keep utilization high Resources are reserved for critical workload When there is no priority workload, utilization is low Alternatives Use reserve resources, force priority workload to wait Preempt low priority workload, kill or wait for priority Solution: Platform LSF + Platform Adaptive Cluster Use reserved infrastructure for low priority workload When priority workload arrives migrate low priority to other resources Utilization is high, no workload is starved or lost

Chris Porter cporter@platform.com Any Questions? Chris Porter cporter@platform.com 9