Managing Scale and Complexity of Next Generation HPC Systems and Clouds Peter ffoulkes Vice President of Marketing April 2011.

Slides:



Advertisements
Similar presentations
1/17/20141 Leveraging Cloudbursting To Drive Down IT Costs Eric Burgener Senior Vice President, Product Marketing March 9, 2010.
Advertisements

©2009 HP Confidential template rev Ed Turkel Manager, WorldWide HPC Marketing 4/7/2011 BUILDING THE GREENEST PRODUCTION SUPERCOMPUTER IN THE.
System Center 2012 R2 Overview
Cloud Computing to Satisfy Peak Capacity Needs Case Study.
1 Vladimir Knežević Microsoft Software d.o.o.. 80% Održavanje 80% Održavanje 20% New Cost Reduction Keep Business Up & Running End User Productivity End.
Clouds C. Vuerli Contributed by Zsolt Nemeth. As it started.
Cloud SUT proposal OSGcloud group. Objective To fill in the Research the group about the thinking within the OSG working group To solicit new ideas/proposals.
Tunis, Tunisia, 28 April 2014 Business Values of Virtualization Mounir Ferjani, Senior Product Manager, Huawei Technologies 2.
High memory instances Monthly SLA : Virtual Machines Validated & supported Microsoft workloads Price reduction: standard Windows (22%) & Linux (29%)
Unified Automation Intelligence™ for Data Center, Cloud and HPC Peter ffoulkes, VP of Marketing September 2010.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
© 2010 VMware Inc. All rights reserved Confidential VMware Vision Jarod Martin Senior Solutions Engineer.
Wally Kowal, President and Founder Canadian Cloud Computing Inc.
Be Smart, Use PwrSmart What Is The Cloud?. Where Did The Cloud Come From? We get the term “Cloud” from the early days of the internet where we drew a.
Cloud Basics.  Define what the Cloud is  Describe the essential characteristics are of the Cloud  Describe the service models of the Cloud  Describe.
SPRING 2011 CLOUD COMPUTING Cloud Computing San José State University Computer Architecture (CS 147) Professor Sin-Min Lee Presentation by Vladimir Serdyukov.
Cloud computing Tahani aljehani.
Next step of e-government.. Importance Foreword Cloud computing  Characteristics  Service  Users  Benefit Challenges in E-government Cloud government.
Duncan Fraiser, Adam Gambrell, Lisa Schalk, Emily Williams
EA and IT Infrastructure - 1© Minder Chen, Stages in IT Infrastructure Evolution Mainframe/Mini Computers Personal Computer Client/Sever Computing.
Plan Introduction What is Cloud Computing?
Cloud Computing in Large Scale Projects George Bourmas Sales Consulting Manager Database & Options.
Effectively and Securely Using the Cloud Computing Paradigm.
An Oracle SPARC/Solaris Private Cloud Reference Architecture/Implementation Harry J Foxwell, PhD Principal Consultant for Cloud Computing.
Cloud Computing Why is it called the cloud?.
CLOUD COMPUTING & COST MANAGEMENT S. Gurubalasubramaniyan, MSc IT, MTech Presented by.
Introduction to Cloud Computing
Cloud Computing.
VIRTUALIZATION AND CLOUD COMPUTING Dr. John P. Abraham Professor, Computer Engineering UTPA.
Component 4: Introduction to Information and Computer Science Unit 10: Future of Computing Lecture 2 This material was developed by Oregon Health & Science.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
The Legal Issues Facing Digital Forensic Investigations In A Cloud Environment Presented by Janice Rafraf 15/05/2015Janice Rafraf1.
Component 4: Introduction to Information and Computer Science Unit 10b: Future of Computing.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
From Virtualization Management to Private Cloud with SCVMM 2012 Dan Stolts Sr. IT Pro Evangelist Microsoft Corporation
HPC Business update HP Confidential – CDA Required
Plan  Introduction  What is Cloud Computing?  Why is it called ‘’Cloud Computing’’?  Characteristics of Cloud Computing  Advantages of Cloud Computing.
2009 Federal IT Summit Cloud Computing Breakout October 28, 2009.
Looking Ahead: A New PSU Research Cloud Architecture Chuck Gilbert - Systems Architect and Systems Team Lead Research CI Coordinating Committee Meeting.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Update IDC HPC Forum.
PaaSport Introduction on Cloud Computing PaaSport training material.
| nectar.org.au NECTAR TRAINING Module 1 Overview of cloud computing and NeCTAR services.
Cloud computing Cloud Computing1. NIST: Five essential characteristics On-demand self-service Computing capabilities, disks are demanded over the network.
CLOUD COMPUTING RICH SANGPROM. What is cloud computing? “Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware A Cloud Computing Methodology Study of.
Web Technologies Lecture 13 Introduction to cloud computing.
Bay Ridge Security Consulting (BRSC) Cloud Computing.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
1© Copyright 2015 EMC Corporation. All rights reserved. FEDERATION ENTERPRISE HYBRID CLOUD OPERATION SERVICES FULL RANGE OF SERVICES TO ASSIST YOUR STAFF.
CLOUD COMPUTING WHAT IS CLOUD COMPUTING?  Cloud Computing, also known as ‘on-demand computing’, is a kind of Internet-based computing,
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
Distributed Geospatial Information Processing (DGIP) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Innovative Partnership Solution-Driven Commitment Agile Value Sustainable.
Software as a Service (SaaS) Fredrick Dande, MBA, PMP.
Discussion Context NIST Cloud definition and extension to address network and infrastructure issues Discussion of the ISPD-RG Infrastructure definition.
Template V.17, July 29, 2011 What’s the Cloud Got to do with HR Transformation? Heath Brownsworth, Director Technology Strategy.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Welcome To We have registered over 5,000 domain names and host over 1,500 cloud servers for individuals and organizations, Our fast and reliable.
Advanced cloud infrastructures and services SAULIUS ŽIŪKAS.
Designing Cisco Data Center Unified Fabric
CLOUD COMPUTING Presented to Graduate Students Mechanical Engineering Dr. John P. Abraham Professor, Computer Engineering UTPA.
Introduction To Cloud Computing By Diptee Chikmurge And Minakshi Vharkate Asst.Professor MIT AOE Alandi(D),Pune.
Introduction to Cloud Computing
Cloud Computing.
CNIT131 Internet Basics & Beginning HTML
Dr. John P. Abraham Professor, Computer Engineering UTPA
Cloud Computing: Concepts
Presentation transcript:

Managing Scale and Complexity of Next Generation HPC Systems and Clouds Peter ffoulkes Vice President of Marketing April 2011

The World’s Most Capable Computing Systems Are Powered by Moab 2  The world’s largest HPC system, No. 2-ranked Jaguar, with over 18,500 nodes, 224,000 cores and a speed of 1.75 petaflop/s  Half of the top 10 systems  Over one third of the top 50 systems (17 systems)  38% of the compute cores in the top 100 systems Source: Nov 2010 rankings from The Top 500 List (Nov 2010) 2Oak Ridge National Laboratory 5Lawrence Berkeley National Lab 7Los Alamos National Laboratory 8University of Tennessee 10Los Alamos National Laboratory 12Lawrence Livermore National Lab 14Sandia National Laboratories 16Lawrence Livermore National Lab 23Forschungszentrum Jülich 26Lawrence Berkeley National Lab 30Oak Ridge National Laboratory 31Sandia National Laboratories 32NOAA / Oak Ridge National Lab 39SciNet, University of Toronto

Oak Ridge National Laboratory  Jaguar, the second most capable HPC system in the world, running at petaflop/s  18,686 nodes, 224,256 processing cores, 300TB of memory  Diversity of users was severely limiting system workload- management capability 3 Moab resolved Jaguar’s workload management problems and increased system utilization, decreased downtime, and allowed more control over resources

What’s Next… 4 Tsubami 2.0 2,816 (6 core) CPUs (16,896 cores) combined with 4,224 of NVIDIA’s Tesla M2050 (448 core) general-purpose GPUS, dual-rail, non-blocking fabric employing two Voltaire 40 Gb/s InfiniBand connections on each node Tianhe 1A 3,600 nodes, 14,336 (6 core) CPUs, 7,168 (448 core) GPUs - 86,000 general purpose cores and 7,168 GPUs, 160 Gbit/second Galaxy interconnect developed in China

Managing Scale and Complexity  Moab 6.0 A new command communication architecture that delivers a 100-fold increase in internal communications throughput Support for the most commonly used Moab commands and grids deploying multiple Moab instances, dramatically increasing the manageability of complex supercomputing environments Support for hybrid installations deploying GPGPU technologies in conjunction with TORQUE

Managing GPGPUs  Moab 6.0 and TORQUE Specify GPGPUs in the same manner as CPUs GPGPUs are requested as a defined resource Applications receive indexed GPGPU information about which GPGPU(s) to access Moab’s intelligent scheduling ensures GPGPUs never get oversubscribed GPGPU usage is recorded in utilization reports

Managing Scale and Complexity  Moab 6.0 New on-demand dynamic provisioning and management capabilities that support both virtual and physical resources, including VM migration for load balancing, workload packing and consolidation Idle-resource management to deliver increased utilization, efficiency and energy conservation for HPC and enterprise cloud deployments Improved administration and reporting, including new parameterized administration functions; enhanced limits for event, group and account management; and new formats for job and reservation event reporting

Managing Scale and Complexity  Moab Viewpoint 2.0 HPC as a service and HPC cloud capability: Creation, management and status reporting of reservations and job queues for HPC and batch workloads and system maintenance On-demand dynamic management of VMs and physical nodes Increased scalability to support management of tens of thousands of nodes and hundreds of thousands of VMs Flexible security management for flexible security options at installation, including built-in security, single Sign On (SSO), or Lightweight Directory Access Protocol (LDAP) models Service-based administration and reporting for easy access and management of HPC and cloud resources

University of Cambridge: Cosmos 9 Overview COSMOS has expanded: New SGI Altix UV1000, 6-core NehalemEX chips, 768 cores, 2TGB of global shared memory Existing SGI Altix 4700, 920 cores and 2.5TB RAM Both compute systems are supported by 64TB of high performance storage. Challenge Managing both cluster-based workloads and SMP shared memory workloads in the same environment

University of Birmingham 10 Overview The University of Birmingham’s 1500 core cluster runs a mixed workload, from many - often hundreds - of short single-core parameter-sweep jobs to massively parallel multi-core computations, some running for over a week. Challenges The workload is variable, especially at different times of the year, and keeping the whole cluster powered up during less busy periods is wasteful of power. A sophisticated system for managing the power requirements is required to be aware of the scheduled as well as the active workload to ensure that resources are always available when required without power being wasted. Solution Moab Adaptive HPC Suite™ Results An annual saving of about 10% of the current power costs, amounting to £50,000 from powering off nodes that are not in use. Further savings from ancillary supplies, especially the air conditioning in the datacentre are expected.

What’s in a cloud: Vapor-ware or silver lining? Is the Future of Computing Clear or is it Obscured by Clouds? National Institute of Standards and Technology: “Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and released with minimal management effort or service provider interaction. This cloud model promotes availability and is composed of five essential characteristics, three service models, and four deployment models.” Essential Characteristics: On-demand self-service, Broad network access, Resource pooling, Rapid elasticity, Measured Service. Service Models: Cloud Software as a Service (SaaS), Cloud Platform as a Service (PaaS), Cloud Infrastructure as a Service (IaaS) Deployment Models: Private cloud, Community cloud, Public cloud, Hybrid cloud. Note: Cloud software takes full advantage of the cloud paradigm by being service oriented with a focus on statelessness, low coupling, modularity, and semantic interoperability. The NIST Definition of Cloud Computing.

Agile Automated Adaptive Delivers business services rapidly, efficiently and successfully Eliminates human error, enables scaling and capacity, reduces management complexity and cost Anticipates and adapts intelligently to dynamic business service needs and conditions Three Essential Cloud Characteristics

13 SciNet—University of Toronto Solution Energy-aware, stateless, on-demand multi- OS provisioning Moab Adaptive HPC Suite™ and xCAT provisioning software 4,000 server supercomputer system 30,000 Intel Xeon 5500 cores, – a theoretical peak of 306 TFlops Results A state-of-the-art data center that saves enough energy to power more than 700 homes yearly. On-demand provisioning allows users to make their OS choice part of their automated job template. SciNet always has several different flavors of Linux running simultaneously. “Why should we pay for cooling when it’s so cold outside? Toronto is pretty cold for at least half of the year. We could have bought a humongous pile of cheap x86 boxes but couldn’t power, maintain or operate them in any logical way.” Dr. Daniel Gruner, PhD, chief technology officer of software for SciNet.

 Who: Top 3 financial services company  What: Moab - Automation Intelligence Manager will manage 80-90% of workloads (Up to 10,000+ applications of more than 100,000 servers across more than 10 datacenters)  Use Case: Iaas, PaaS, AaaS, using Workload-Driven Cloud 2.0  Objective: Increase agility, reduce risk and save over $1 billion dollars in 3 years. A Global Bank based in the USA

16