1 AppliedMicro X-Gene ® ARM Processors Optimized Scale-Out Solutions for Supercomputing.

Slides:



Advertisements
Similar presentations
The Development of Mellanox - NVIDIA GPUDirect over InfiniBand A New Model for GPU to GPU Communications Gilad Shainer.
Advertisements

Transforming Business with Advanced Analytics: Introducing the New Intel® Xeon® Processor E7 v2 Family Seetha Rama Krishna Director, APAC HPC Solutions.
1 Agenda … HPC Technology & Trends HPC Platforms & Roadmaps HP Supercomputing Vision HP Today.
©2009 HP Confidential template rev Ed Turkel Manager, WorldWide HPC Marketing 4/7/2011 BUILDING THE GREENEST PRODUCTION SUPERCOMPUTER IN THE.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The future is mission.
Lenovo ® ThinkServer ® RD350 and RD450 Speaker Name, Date.
Appro Xtreme-X Supercomputers A P P R O I N T E R N A T I O N A L I N C.
GPU Computing with Hartford Condor Week 2012 Bob Nordlund.
PVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments Palden Lama, Xiaobo Zhou, University of Colorado at Colorado Springs.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC SQL Server Data Warehouse Fast Track Solutions Nov 30, 2011 Ling Wu, EMC.
Linux Clustering A way to supercomputing. What is Cluster? A group of individual computers bundled together using hardware and software in order to make.
Cisco and NetApp Confidential. Distributed under non-disclosure only. Name Date FlexPod Entry-level Solution FlexPod Value, Sized Right for Smaller Workloads.
1 petaFLOPS+ in 10 racks TB2–TL system announcement Rev 1A.
11 Establishing the Framework for Datacenter of the Future Richard Curran Director Product Marketing, Intel EMEA.
Aim High…Fly, Fight, Win NWP Transition from AIX to Linux Lessons Learned Dan Sedlacek AFWA Chief Engineer AFWA A5/8 14 MAR 2011.
Accelerating SQL Database Operations on a GPU with CUDA Peter Bakkum & Kevin Skadron The University of Virginia GPGPU-3 Presentation March 14, 2010.
© 2009 Cisco Systems, Inc. All rights reserved. Cisco Public Presentation_ID 1 Session ID 20PT Unified Computing System Marco Kuendig
© 2013 Mellanox Technologies 1 NoSQL DB Benchmarking with high performance Networking solutions WBDB, Xian, July 2013.
Appro Products and Solutions Anthony Kenisky, Vice President of Sales Appro, Premier Provider of Scalable Supercomputing Solutions: 4/16/09.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
Microsoft Virtual Academy. Microsoft Virtual Academy.
LARGE SCALE DEPLOYMENT OF DAP AND DTS Rob Kooper Jay Alemeda Volodymyr Kindratenko.
Copyright 2009 Fujitsu America, Inc. 0 Fujitsu PRIMERGY Servers “Next Generation HPC and Cloud Architecture” PRIMERGY CX1000 Tom Donnelly April
11 Maximizing Datacenter Efficiencies Realizing the Cloud. Richard Curran Director Product Marketing, Intel EMEA.
Enterprise Storage A New Approach to Information Access Darren Thomas Vice President Compaq Computer Corporation.
Workload Optimized Processor
Use/User:LabServerField Engineer Electrical Engineer Software Engineer Mechanical Engineer Requirements: Small form factor.
© 2012 IBM Corporation IBM Flex System™ The elements of an IBM PureFlex System.
Gilad Shainer, VP of Marketing Dec 2013 Interconnect Your Future.
FlashSystem family 2014 © 2014 IBM Corporation IBM® FlashSystem™ V840 Product Overview.
Appro Products and Solutions Anthony Kenisky, Vice President of Sales Appro, Premier Provider of Scalable Supercomputing Solutions: 9/1/09.
Are Supercomputers returning to Investment Banking?
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
Taking the Complexity out of Cluster Computing Vendor Update HPC User Forum Arend Dittmer Director Product Management HPC April,
 Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over a network (typically the Internet). 
11 Copyright © 2009 Juniper Networks, Inc. ANDY INGRAM VP FST PRODUCT MARKETING & BUSINESS DEVELOPMENT.
March 9, 2015 San Jose Compute Engineering Workshop.
Guangdeng Liao, Xia Zhu, Steen Larsen, Laxmi Bhuyan, Ram Huggahalli University of California, Riverside Intel Labs.
1 Recommendations Now that 40 GbE has been adopted as part of the 802.3ba Task Force, there is a need to consider inter-switch links applications at 40.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Update IDC HPC Forum.
Mellanox Connectivity Solutions for Scalable HPC Highest Performing, Most Efficient End-to-End Connectivity for Servers and Storage April 2010.
Revision - 01 Intel Confidential Page 1 Intel HPC Update Norfolk, VA April 2008.
CloudLab Aditya Akella. CloudLab 2 Underneath, it’s GENI Same APIs, same account system Even many of the same tools Federated (accept each other’s accounts,
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Leading in the compute.
Tackling I/O Issues 1 David Race 16 March 2010.
1 Benchmarking Cloud Serving Systems with YCSB Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan and Russell Sears Yahoo! Research.
Open Source and Business Issues © 2004 Northrop Grumman Corp. All rights reserved. 1 Grid and Beowulf : A Look into the Near Future NorthNorth F-18C Weapons.
Hartree Centre systems overview. Public nameInternal nameTechnologyService type Blue WonderInvictax86 SandyBridgeproduction Blue WonderNapierx86 IvyBridgeproduction.
Red Hat Enterprise Linux Presenter name Title, Red Hat Date.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice ProLiant G5 to G6 Processor Positioning.
The Moonshot BladeSystem By Hewlett-Packard Carl J. Hoppe 7 October 2013 COSC
V2 March © 2015 Citrix | Confidential The perfect solution for the trading floor: HPE Moonshot Trader Workstation with Citrix XenDesktop.
Extreme Scale Infrastructure
Designing the Physical Architecture
Valid Until End of January 2017
Low-Cost High-Performance Computing Via Consumer GPUs
Company Product with Intel Solution Product Focus
40% More Performance per Server 40% Lower HW costs and maintenance
Flex System Enterprise Chassis
Appro Xtreme-X Supercomputers
Windows Server* 2016 & Intel® Technologies
OCP: High Performance Computing Project
Low Latency Analytics HPC Clusters
HARDWARE DESIGNED FOR A SOFTWARE DEFINED WORLD
Server Innovation Accelerates IT Transformation
12/26/2018 5:07 AM Leap forward with fast, agile & trusted solutions from Intel & Microsoft* Eman Yarlagadda (for Christine McMonigal) Hybrid Cloud – Product.
IBM Power Systems.
SERVER INNOVATION ACCELERATES IT TRANSFORMATION
Ampere for the openEDGE
Presentation transcript:

1 AppliedMicro X-Gene ® ARM Processors Optimized Scale-Out Solutions for Supercomputing

2 AppliedMicro X-Gene ® Processor Philosophy Few workloads are compute bound –Most are limited by memory capacity, bandwidth, or I/O –HPC workloads are better served by GPGPU Scale-out versus scale-up –High density –Performance per Watt –Performance per $ Balance –Strong CPU with an optimized ARMv8 core –Large memory capacity / bandwidth – adequate memory is not an upsell –Low power – power efficiency is not an upsell Open Source –Open Source Software –Open Source Hardware

33 A bit about X-Gene ® ARM technology being deployed in the enterprise, today…

4 Emergence of the Optimized ARM Scale-Out Data Center Processor Architecture Strong compute Large memory Low power Cost-effective Software Ecosystem Mature, optimized toolchain Broad Linux support Open-source workloads System Architecture High Density Power-efficient Top-Tier Suppliers

5 X-Gene ® Technology in the Enterprise Demonstrating Value in Leading IT organizations - Today Node Density / Rack 900% Higher Power Consumption 85% Lower Acquisition Cost 45% Lower Source: PayPal

6 Real Workload Performance Web Server (WRK Benchmark) AppliedMicro X-Gene ® 2 Intel Xeon ® E5-2630v3 Bandwidth (higher is better) Latency (lower is Better) Performance (higher is better) KRPS ms Gbps X-Gene GHz) 4 node 1U / ½ width sled 64GB DDR x 10GbE (integrated) Wall power: ~190 Watts Xeon e5-2630v3 2.4 GHz) 2P 1U / ½ width sled 64GB DDR x 10GbE (NIC) Wall power: ~180 Watts Up to 35% Higher Performance | Lower TCO Standard CPU benchmarks do not always translate to delivered workload performance Up to 35% Higher Performance | Lower TCO Standard CPU benchmarks do not always translate to delivered workload performance Source: AppliedMicro

7 Real Workload Performance In-Memory Database (MongoDB - YCSB) 1U / 2P Rack Server Intel™ Xeon® E5-2630v3 16C/32T 2.4GHz Turbo/HT 64GB DDR port 10GbE Mellanox NIC Ubuntu LTS 24-port 10GbE Netgear™ Switch 5 Clients Intel™ Xeon e3-1270v3 4C/8T 3.5GHz Turbo /HT 32GB DDR port 10GbE Mellanox NIC Ubuntu LTS HP Moonshot m400 – 1 cartridge AppliedMicro X-Gene® CPU 8-Core 2.4GHz 64GB DDR GbE Integrated Ethernet RHEL 7.1 Beta with UEFI support Hardware Topology

8 Real Workload Performance In-Memory Database (MongoDB - YCSB) 42U Rack, 9 Moonshot m400 chassis/rack Rack-Level Scalability 5x the throughput of a Haswell Xeon® e5 2P rack server implementation Rack-Level Scalability 5x the throughput of a Haswell Xeon® e5 2P rack server implementation

9 Lower TCO with X-Gene ® Technology Web / Application 30kW/Rack Traditional Intel Xeon ® E5 2P/1U 270 Web Servers + 45 Application Servers Nine racks 35 servers per rack 2 TOR switches per rack 315 total nodes Web servers (32GB) App servers (64GB) 45 HP Moonshot with m400 (X-Gene ® CPU) 270 Web Servers + 90 Application Servers* One rack 8 Moonshot Chassis 2 TOR Switches 360 total 1P m400 nodes 55%+ Hardware Acquisition Cost Savings Additional TCO reduction via simplified management and lower power 55%+ Hardware Acquisition Cost Savings Additional TCO reduction via simplified management and lower power Source: HP

10 X-Gene ® ARM Processor Software Ecosystem PEAP UBOOT Compilers Management UEFI

11 …but what about High Performance Computing?

12 AppliedMicro X-Gene ® HPC Philosophy Workloads are compute bound –…but ‘general purpose’ compute is not the path to exascale Scale-out versus scale-up –High density –Performance per Watt –Performance per $ Balance –Power-efficient CPU with an optimized, power-efficient ARMv8 core –High performance GPUs for the ‘heavy lifting’ –A better alternative to ‘brute force’ high performance computing Open Source –Open Source Software –Open Source Hardware

13 X-Gene ® Processor Platforms Multiple SKUs from Leading OEM and ODM Partners Multiple New Platforms in Development

14 The ARM Revolution has Expanded to Supercomputing 64-bit X-Gene ® ARM Servers in production today The “one size fits all” data center is no longer sufficient AppliedMicro is powering the transition –Proven: real customers in production today –Performance: balanced 64-bit ARM compute with large memory –Economics: TCO savings via both lowered CapEx and OpEx University of Utah Cloudlab on 315 ARM nodes: “HP Moonshot is a first-of-a-kind system that’s enabling us to extend the range of our calculations to solve really complex problems in a highly efficient 64-bit architecture.” James Ang, Technical Manager, Sandia National Laboratories "HP Moonshot offers capabilities that will be critical to the future of cloud computing. It empowers researchers to develop fundamental breakthroughs that have the potential to change the performance, reliability, and security of future clouds.” Robert Ricci, Research Asst. Professor of Comp. Science, University of Utah

15 X-Gene ® Processors in HPC The Efficiency of ARM & the Power of Tesla™ GPUs Speed-up Relative to CPU-Only Source: nVidia Workload Profile X-Gene ® CPU Nvidia Tesla K20 Xeon ® e Nvidia Tesla K20 Xeon ® e X-Gene ® CPU Nvidia Tesla K20 Xeon ® e Nvidia Tesla K20 Xeon ® e X-Gene ® CPU Nvidia Tesla K20 Xeon ® e Nvidia Tesla K20 Xeon ® e GPU CPU GPU CPU GPU CPU Workload Profile Workload Profile All code recompiled to ARM64, no optimizations CPU: 45 Watts $349 CPU: 45 Watts $349 CPU: 150 Watts $1,885 CPU: 150 Watts $1,885 CPU: 45 Watts $349 CPU: 45 Watts $349 CPU: 150 Watts $1,885 CPU: 150 Watts $1,885 CPU: 45 Watts $349 CPU: 45 Watts $349 CPU: 130 Watts $2,614 CPU: 130 Watts $2,614

16 Delivering Performance that Matters. There is a better answer to ‘brute force’ HPC: heterogeneous compute Platforms with X-Gene® ARM technology and Nvidia GPUs is in production The software ecosystem is established The results are compelling

17