SGI's Platform Strategy: Addressing the Productivity Gap in HPC Dave Parry Senior Vice President and General Manager Server and Platform Group Silicon.

Slides:



Advertisements
Similar presentations
Archive Task Team (ATT) Disk Storage Stuart Doescher, USGS (Ken Gacke) WGISS-18 September 2004 Beijing, China.
Advertisements

© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Enigma Data’s SmartMove.
Commodity Computing Clusters - next generation supercomputers? Paweł Pisarczyk, ATM S. A.
♦ Commodity processor with commodity inter- processor connection Clusters Pentium, Itanium, Opteron, Alpha GigE, Infiniband, Myrinet, Quadrics, SCI NEC.
Ver 0.1 Page 1 SGI Proprietary Introducing the CRAY SV1 CRAY SV1-128 SuperCluster.
SGI’2000Parallel Programming Tutorial Supercomputers 2 With the acknowledgement of Igor Zacharov and Wolfgang Mertz SGI European Headquarters.
25 Years of Changing the World Q3 FY08. SGI PROPRIETARY Who Is SGI Our people provide the best compute, storage and visualization solutions on the planet…
LANs and WANs Network size, vary from –simple office system (few PCs) to –complex global system(thousands PCs) Distinguish by the distances that the network.
Seagate Hybrid Systems & Solutions
Silicon Graphics, Inc. Cracow ‘03 Grid Workshop SAN over WAN - a new way of solving the GRID data access bottleneck Dr. Wolfgang Mertz Business Development.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
Silicon Graphics, Inc. Poster Presented by: SGI Proprietary Technologies for Breakthrough Research Rosario Caltabiano North East Higher Education & Research.
Multiprocessors ELEC 6200 Computer Architecture and Design Instructor: Dr. Agrawal Yu-Chun Chen 10/27/06.
Server Platforms Week 11- Lecture 1. Server Market $ 46,100,000,000 ($ 46.1 Billion) Gartner.
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
AN INTRODUCTION TO LINUX OPERATING SYSTEM Zihui Han.
Mass RHIC Computing Facility Razvan Popescu - Brookhaven National Laboratory.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Copyright © 2002 VERITAS Software Corporation. All rights reserved. VERITAS, the VERITAS logo, and all other VERITAS product names and slogans are trademarks.
SGI Proprietary SGI Update IDC HPC User Forum September, 2008.
1 Advanced Storage Technologies for High Performance Computing Sorin, Faibish EMC NAS Senior Technologist IDC HPC User Forum, April 14-16, Norfolk, VA.
Hosted by Case Study - Storage Consolidation Steve Curry Yahoo Inc.
SGI Contributions to Supercomputing by 2010 Steve Reinhardt Director of Engineering
Enterprise Storage A New Approach to Information Access Darren Thomas Vice President Compaq Computer Corporation.
Meeting the Data Protection Demands of a 24x7 Economy Steve Morihiro VP, Programs & Technology Quantum Storage Solutions Group
CLUSTER COMPUTING STIMI K.O. ROLL NO:53 MCA B-5. INTRODUCTION  A computer cluster is a group of tightly coupled computers that work together closely.
Silicon Graphics, Inc. Re-Configurable Application Specific Computing (RASC/FPGA) David Alexander Director of Engineering.
Russ Miller Center for Computational Research Computer Science & Engineering SUNY-Buffalo Hauptman-Woodward Medical Inst IDF: Multi-Core Processing for.
NCICB Systems Architecture Bill Britton Terrapin Systems LPG/NCICB Dedicated Support.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Copyright © 2001 VERITAS Software Corporation. All Rights Reserved. VERITAS, VERITAS SOFTWARE, the VERITAS logo and VERITAS The Intelligent Storage Software.
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
Taking the Complexity out of Cluster Computing Vendor Update HPC User Forum Arend Dittmer Director Product Management HPC April,
Engr. Gideon Fatunmbi 05/08/2015 FAS2200 Series Customer Presentation.
Copyright © 2002, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners
Storage Trends: DoITT Enterprise Storage Gregory Neuhaus – Assistant Commissioner: Enterprise Systems Matthew Sims – Director of Critical Infrastructure.
GStore: GSI Mass Storage ITEE-Palaver GSI Horst Göringer, Matthias Feyerabend, Sergei Sedykh
Sandor Acs 05/07/
IBM Linux Update for CAVMEN Jim Savoie Linux Sales IBM Americas Group October 21, 2004.
Oracle RAC and Linux in the real enterprise October, 02 Mark Clark Director Merrill Lynch Europe PLC Global Database Technologies October, 02 Mark Clark.
1 U.S. Department of the Interior U.S. Geological Survey Contractor for the USGS at the EROS Data Center EDC CR1 Storage Architecture August 2003 Ken Gacke.
A High-Performance Scalable Graphics Architecture Daniel R. McLachlan Director, Advanced Graphics Engineering SGI.
1 Storage Strategy for the new Millennium. 2 Today’s issues Across the Enterprise. Managing the growth. Managing across the Enterprise. Resources and.
Hosting an Enterprise Financial Forecasting Application with Terminal Server Published: June 2003.
Headline in Arial Bold 30pt HPC User Forum, April 2008 John Hesterberg HPC OS Directions and Requirements.
Large Scale Parallel File System and Cluster Management ICT, CAS.
1 Public DAFS Storage for High Performance Computing using MPI-I/O: Design and Experience Arkady Kanevsky & Peter Corbett Network Appliance Vijay Velusamy.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Facilities and How They Are Used ORNL/Probe Randy Burris Dan Million – facility administrator.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015.
Intel Research & Development ETA: Experience with an IA processor as a Packet Processing Engine HP Labs Computer Systems Colloquium August 2003 Greg Regnier.
CASPUR Site Report Andrei Maslennikov Lead - Systems Amsterdam, May 2003.
Outline Why this subject? What is High Performance Computing?
ClinicalSoftwareSolutions Patient focused.Business minded. Slide 1 Opus Server Architecture Fritz Feltner Sept 7, 2007 Director, IT and Systems Integration.
IBM eServer xSeries Technical Conference © IBM Corporation Session ID: O24 Steve Dobbelstein Lake Buena Vista, FL September 8-12, 2003 Enterprise.
Mass Storage at SARA Peter Michielse (NCF) Mark van de Sanden, Ron Trompert (SARA) GDB – CERN – January 12, 2005.
Tackling I/O Issues 1 David Race 16 March 2010.
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
Background Computer System Architectures Computer System Software.
Solving Today’s Data Protection Challenges with NSB 1.
High Performance Storage System (HPSS) Jason Hick Mass Storage Group HEPiX October 26-30, 2009.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Application of General Purpose HPC Systems in HPEC
Tivoli Storage Manager 4.1
Appro Xtreme-X Supercomputers
IBM Linux Update for CAVMEN
XenData SX-550 LTO Archive Servers
Storage Trends: DoITT Enterprise Storage
IBM Power Systems.
Presentation transcript:

SGI's Platform Strategy: Addressing the Productivity Gap in HPC Dave Parry Senior Vice President and General Manager Server and Platform Group Silicon Graphics, Inc.

Cost Software IT & Engineering Personnel ~ $50/Hr 2002 IT Costs Now in People and Software Basic Hardware ~ $1/Hr Vector RISC Commodity Changing Economics in HPC

Worldwide Production of Information Exabytes Source: Gartner Group Datasets are Getting (Much) Bigger, Too Satellite Systems Archive Growth Source: NOAA

Programming Is Getting Harder (AKA The Folly of “Least Common Denominator Computing”) OpenMP™. a = b SHMEM or MPI2 (one-sided). C 1 "get(b)" i.e. i=shmem_int_get(b) i=MPI_get(b) MPI (two-sided). C 2 "recv"; to wait for C 1 ’s request. C 1 "send"; to ask C 2 for ”b". C 2 finds that ”b" is needed by C 1. C 2 does a local "get(b)". C 2 does a "send(b)". C 1 does a "recv(b)" i.e. a=MPI_recv(b) C1C1 C2C2 mem. space for “a” mem. space for “b” To copy/transfer the value stored in “b” to “a”...

Memory Is Getting “Slower” Origin ® MHz Origin ® MHz Origin MHz

Summing up the Productivity Picture Productivity = cost -1 * value * efficiency * usability Where: cost -1 == MFLOPS/dollar (Moore’s Law) value == hardware cost/cost of ownership efficiency == productive cycles/MFLOPS (Constant at 5–10%) usability == programming effort per productive cycle Productivity Moore’s Law Productivity Gap MFLOPs per acquisition dollar  Productive science per total dollar

Technology Directions to Close the Gap

Visualization Computation A Data-Centric View of Each Aspect of HPC Data Access Focus: One shared view of the data with pervasive access Focus: One shared view of the dataset with pervasive access Focus: One shared view of the visual model with pervasive access

? Image courtesy of Janssen Pharmaceuticals Visual Data Visual Data HPC/Capability HPC/Capacity We Want to Work Differently Grid Infrastructure

A Different View of System Architecture Scalable Shared Memory. Globally addressable. Thousands of ports. Flat & high bandwidth. Flexible & configurable Terascale to Petascale Data Set : Bring Function to Data Compute IO Graphics

Cost Software IT & Engineering Personnel ~ $50/Hr 2002 IT Costs Now in People and Software Basic Hardware ~ $1/Hr Vector RISC Commodity Changing Economics in HPC Challenges: “Impedance match” to HPC applications Availability of HPC-class architectures

Use an HPC Processor for HPC Applications

x advantage for Altix on 2P. Best Opteron result was run with single user mode and interleaved memory banks. SPECfp_rate_base2000 Use an HPC Processor for HPC Applications

Q22H 64p 128p 256p 1024p+ CY2001CY2003 Max NUMAlinked System Size Max Kernel Image or Partition Size 1H2H Max SMP System Size 2p SGI 750 Altix-Itanium2 512p Altix-Madison Combine Your HPC Processor with an HPC Architecture 1H2H CY2004 Altix-Madison9M

Altix, 1.3 Ghz is 1.46x faster than IBM eServer p690, 1.3 Ghz at 128P Altix, 1.5 Ghz is 16% faster than p690, 1.7 Ghz in spite of a lower peak flop rate. Combine Your HPC Processor with an HPC Architecture Source: July 24, 2003 and SGI performance reports Linpack HPC (NxN) Performance

World-record result for 64 and 32-processor systems SGI’s 1.5Ghz, 32P result is 2x better performance than IBM eServer p690, 1.7 Ghz SGI’s 1.3Ghz, 64P result is 1.95x better than Sun Fire 15K, 1.2 Ghz. Combine Your HPC Processor with an HPC Architecture SPECfp_rate_base2000 Performance

New Paradigms (usability) Single Physical Mem. Single O.S. Cache Coherent SGI® NUMA SGI® Origin 3000 SSI 512–1024P Cluster Tools IBM >32P Compaq >32P Sun >64P HP >64P 3 Run OpenMP™ codes Run MPI codes Single Address Space Single Admin. View Bus/Switch IBM® 32P Compaq 32P Sun™ 64P HP™ 64P Cluster D.I.Y. PCs connected 3 New_1? (App Level) New_2? (App Level)

Global Shared Memory between Supercluster Nodes C-Brick Power Bay R-Brick C-Brick R-Brick C-Brick Power Bay C-Brick Power Bay R-Brick C-Brick R-Brick C-Brick Power Bay C-Brick IX-Brick 64P Partition Operating System C-Brick Power Bay R-Brick C-Brick R-Brick C-Brick Power Bay C-Brick Power Bay R-Brick C-Brick R-Brick C-Brick Power Bay C-Brick IX-Brick 64P Partition MPI/shmem app OpenMP™ app CPU_SETS System layer MPI/shmem app Parallel Scheduler, Array Services

The 64P result is a world-record result for a microprocessor-based system and fifth overall 1.56x better performance than IBM eServer p690 at 32P * 128 CPU result uses MPI code to run on Altix Supercluster with two 64P nodes, for smaller CPU counts OpenMP code was used. STREAM Triad Results SSI and SuperCluster Configs

A Path to Architectural Convergence Defense and Homeland Security Media ManufacturingScienceEnergy Origin Altix Origin Altix Application Specific Compute Multi-Paradigm Architecture

A Different View of System Architecture Scalable Shared Memory. Globally addressable. Thousands of ports. Flat & high bandwidth. Flexible & configurable Terascale to Petascale Data Set : Bring Function to Data Reconfigurable Compute IO Graphics Compute Reconfigurable

Multi-Paradigm Computing UltraViolet Scalable Shared Memory. Globally addressable. Thousands of ports. Flat & high bandwidth. Flexible & configurable Terascale to Petascale Data Set : Bring Function to Data Reconfigurable Scalar Vector IO Graphics Vector Streaming Scalar

Visualization Computation A Data-Centric View of Each Aspect of HPC Data Access Focus: One shared view of the data with pervasive access Focus: One shared view of the dataset with pervasive access Focus: One shared view of the visual model with pervasive access

Innovation workflow means data must be shared Design Compute Data Imagine Post- process Visualize Decide Adapting to the way people work:  From the original concept to the final result, data is at the core of the workflow  Information is shared between groups, and data is moved between hosts  Data sets grow at each step  Processes are improved when data copy is avoided, shortening time to insight

SGI in Data Management Integrated HW / SW Solutions SGI ® Data Management Legato, XFS™ / XVM, Snapshot FailSafe™, Fail Over Data Migration Facility / Tape Management Facility SGI ® File Server Scalable Bandwidth Storage Management, SAN Topology, SAN Cluster Management, TP900, TP9100, TP9500, HDS 9960, Ciprico 7000 and TALON™, Brocade, STK, ADIC, SGI Firmware DAS Scalability to over 12 GByte/s and up to 18 M TB Backup Archive / HSM Data Sharing High Availability RAID, JBOD, Hub, Switch, HBA, Tape DAS, NAS, SAN SGI ® SAN Server 1000 Management Topology Monitoring CXFS™, Samba / Cifs, BDS, NFS, FTP,..

SAN with CXFS: High performance data sharing with unlimited scale LAN SAN A unique high performance solution : Each host share one or more volumes consolidated in one or more RAID array. Centralized storage management High modularity True High Performances Data sharing, near local File System performances. Fully Resilient (HA) Fully POSIX Compliant As easy to share files as with NFS, but faster Windows NT ® & 2k SGI ® IRIX Sun TM Solaris Linux 64 for Altix IBM AIX Linux 32 More Under Development True Heterogeneity

Faster than WAN FTP or NFS Single name space = easy to administer, no data copies CXFS Usage - Wide Area & GRID Data Sharing SAN across distances of up to 8000KM

Data Lifecycle Management Storage Hierarchy & TCO Model with DMF TP9400 STK L700 w/9840 Primary Storage Online - high-performance disk Demote > 7 days < 365 Demote > 1 Yr < 2 Yr Promote used last 24 hrs Promote used last 7 days Nearline Disk High Capacity, Low cost, Lower performance Tape Libraries high-performance archive DMF manages data from one platform to another based on: age of file size of file type of file Archive > 2 Yr

SGI ® High-Performance Data Management Leadership Top performance and virtually unlimited scalability –Broke 3 Gbyte/sec SAN barrier (2000) –Delivering first 12 GB/sec (15GB peak) SAN (2002) –First 2 GB SAN Fabric (2001) –Wide area data sharing (2002) –Broke backup record - 10 Tbyte in an hour (2003)

Summing up the Productivity Picture Productivity = cost -1 * value * efficiency * usability Productivity Moore’s Law Productivity Gap Moore’s Law Productivity

Productivity in weather and climate HPC - SGI Altix Brings serious supercomputing capability to Linux Robust multi-OS shared filesystem with unmatched scale Porting of many key development and administration tools Ease of use from largest node size in the industry Environmental codes being ported, optimized, scaled

POP performance 1 degree global problem Forecast years/wallclock day Altix 1.3GHz ES40 Altix 1.5GHz (scaled)

MM5 performance T3a case Altix 1.5Ghz IBM p Ghz Xeon 2.2Ghz/myrinet Athlon 1.4Ghz/Dolphin SCI (all are MPI)

Other applications

© 2003 Silicon Graphics, Inc. All rights reserved. Silicon Graphics, SGI, Origin, OpenGL, XFS, InfiniteReality, IRIX, and the SGI logo are registered trademarks and OpenMP, NUMAflex, CXFS, InfinitePerformance, and the Silicon Graphics logo are trademarks of Silicon Graphics, Inc., in the United States and/or other countries worldwide. R10000 is a registered trademark of MIPS Technologies, Inc. Pentium and Itanium are registered trademarks of Intel Corporation. Windows is a registered trademark or trademark of Microsoft Corporation in the United States and/or other countries worldwide. Linux is a registered trademark of Linus Torvalds. All other trademarks mentioned herein are the property of their respective owners. (06/03)