On the Path to Trinity - Experiences Bringing Codes to the Next Generation ASC Platform Courtenay T. Vaughan and Simon D. Hammond Sandia National Laboratories.

Slides:



Advertisements
Similar presentations
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
Advertisements

Michal Merta Alena Vašatová Václav Hapla David Horák
IBM 1350 Cluster Expansion Doug Johnson Senior Systems Developer.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
1 Approved for unlimited release as SAND C Verification Practices for Code Development Teams Greg Weirs Computational Shock and Multiphysics.
Unstructured Data Partitioning for Large Scale Visualization CSCAPES Workshop June, 2008 Kenneth Moreland Sandia National Laboratories Sandia is a multiprogram.
Problem Uncertainty quantification (UQ) is an important scientific driver for pushing to the exascale, potentially enabling rigorous and accurate predictive.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
OpenFOAM on a GPU-based Heterogeneous Cluster
Exploring Communication Options with Adaptive Mesh Refinement Courtenay T. Vaughan, and Richard F. Barrett Sandia National Laboratories SIAM Computational.
Scientific Computing on Heterogeneous Clusters using DRUM (Dynamic Resource Utilization Model) Jamal Faik 1, J. D. Teresco 2, J. E. Flaherty 1, K. Devine.
1 Parallel Simulations of Underground Flow in Porous and Fractured Media H. Mustapha 1,2, A. Beaudoin 1, J. Erhel 1 and J.R. De Dreuzy IRISA – INRIA.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Iterative and direct linear solvers in fully implicit magnetic reconnection simulations with inexact Newton methods Xuefei (Rebecca) Yuan 1, Xiaoye S.
LTE Review (September 2005 – January 2006) January 17, 2006 Daniel M. Dunlavy John von Neumann Fellow Optimization and Uncertainty Estimation (1411) (8962.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
Dax: Rethinking Visualization Frameworks for Extreme-Scale Computing DOECGF 2011 April 28, 2011 Kenneth Moreland Sandia National Laboratories SAND P.
SAINT2002 Towards Next Generation January 31, 2002 Ly Sauer Sandia National Laboratories Sandia is a multiprogram laboratory operated by Sandia Corporation,
Massively Parallel Magnetohydrodynamics on the Cray XT3 Joshua Breslau and Jin Chen Princeton Plasma Physics Laboratory Cray XT3 Technical Workshop Nashville,
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
Trilinos: From a User’s Perspective Russell Hooper Nov. 7, 2007 SAND P Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed.
Integrating Trilinos Solvers to SEAM code Dagoberto A.R. Justo – UNM Tim Warburton – UNM Bill Spotz – Sandia.
Principles of Scalable HPC System Design March 6, 2012 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
HPC Technology Track: Foundations of Computational Science Lecture 1 Dr. Greg Wettstein, Ph.D. Research Support Group Leader Division of Information Technology.
Data Intensive Computing at Sandia September 15, 2010 Andy Wilson Senior Member of Technical Staff Data Analysis and Visualization Sandia National Laboratories.
Large-Scale Stability Analysis Algorithms Andy Salinger, Roger Pawlowski, Ed Wilkes Louis Romero, Rich Lehoucq, John Shadid Sandia National Labs Albuquerque,
Breakdown voltage calculations using PIC-DSMC Paul S. Crozier, Jeremiah J. Boerner, Matthew M. Hopkins, Christopher H. Moore, Lawrence C. Musson Sandia.
Page 1 Embedded Sensitivities and Optimization From Research to Applications Roscoe A. Bartlett Department of Optimization & Uncertainty Estimation Sandia.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
© 2012 Maplesoft, a division of Waterloo Maple Inc. A CYBERNET group company General Purpose Symbolic Computation Environment Equations are not simply.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
ParCFD Parallel computation of pollutant dispersion in industrial sites Julien Montagnier Marc Buffat David Guibert.
Strategies for Solving Large-Scale Optimization Problems Judith Hill Sandia National Laboratories October 23, 2007 Modeling and High-Performance Computing.
System Architecture: Near, Medium, and Long-term Scalable Architectures Panel Discussion Presentation Sandia CSRI Workshop on Next-generation Scalable.
High Performance Computational Fluid-Thermal Sciences & Engineering Lab GenIDLEST Co-Design Virginia Tech AFOSR-BRI Workshop July 20-21, 2014 Keyur Joshi,
LAMMPS Users’ Workshop
Danny Dunlavy, Andy Salinger Sandia National Laboratories Albuquerque, New Mexico, USA SIAM Parallel Processing February 23, 2006 SAND C Sandia.
1 1 What does Performance Across the Software Stack mean?  High level view: Providing performance for physics simulations meaningful to applications 
Interactive Computational Sciences Laboratory Clarence O. E. Burg Assistant Professor of Mathematics University of Central Arkansas Science Museum of Minnesota.
New Features in ML 2004 Trilinos Users Group Meeting November 2-4, 2004 Jonathan Hu, Ray Tuminaro, Marzio Sala, Michael Gee, Haim Waisman Sandia is a multiprogram.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Threading Opportunities in High-Performance Flash-Memory Storage Craig Ulmer Sandia National Laboratories, California Maya GokhaleLawrence Livermore National.
STK (Sierra Toolkit) Update Trilinos User Group meetings, 2014 R&A: SAND PE Sandia National Laboratories is a multi-program laboratory operated.
Sandia is a multi-program laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Multifidelity Optimization Via Pattern Search and Space Mapping Genetha Gray Computational Sciences & Mathematics Research Sandia National Labs, Livermore,
A Dirichlet-to-Neumann (DtN)Multigrid Algorithm for Locally Conservative Methods Sandia National Laboratories is a multi program laboratory managed and.
Site Report DOECGF April 26, 2011 W. Alan Scott Sandia National Laboratories Sandia National Laboratories is a multi-program laboratory managed and operated.
Photos placed in horizontal position with even amount of white space between photos and header Sandia National Laboratories is a multi-program laboratory.
Algebraic Solvers in FASTMath Argonne Training Program on Extreme-Scale Computing August 2015.
Other Tools HPC Code Development Tools July 29, 2010 Sue Kelly Sandia is a multiprogram laboratory operated by Sandia Corporation, a.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
ANSYS, Inc. Proprietary © 2004 ANSYS, Inc. Chapter 5 Distributed Memory Parallel Computing v9.0.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
Petascale Computing Resource Allocations PRAC – NSF Ed Walker, NSF CISE/ACI March 3,
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
TEMPLATE DESIGN © H. Che 2, E. D’Azevedo 1, M. Sekachev 3, K. Wong 3 1 Oak Ridge National Laboratory, 2 Chinese University.
Automated File Server Disk Quota Management May 13 th, 2008 Bill Claycomb Computer Systems Analyst Infrastructure Computing Systems Department Sandia is.
Evolution at CERN E. Da Riva1 CFD team supports CERN development 19 May 2011.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
CHAPTER 2 - EXPLICIT TRANSIENT DYNAMIC ANALYSYS
Super Computing By RIsaj t r S3 ece, roll 50.
ACCELERATING SPARSE CHOLESKY FACTORIZATION ON GPUs
Unstructured Grids at Sandia National Labs
Ray-Cast Rendering in VTK-m
Meros: Software for Block Preconditioning the Navier-Stokes Equations
Ph.D. Thesis Numerical Solution of PDEs and Their Object-oriented Parallel Implementations Xing Cai October 26, 1998.
Presentation transcript:

On the Path to Trinity - Experiences Bringing Codes to the Next Generation ASC Platform Courtenay T. Vaughan and Simon D. Hammond Sandia National Laboratories SIAM Parallel Processing for Scientific Computing April 14, 2016 SAND C Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-AC04-94AL85000.

Trinity Cray XC40 Total of about nodes –About half are Intel Haswell with 2 processors per node and 16 cores per processor running at 2.3 GHz and 128 GB memory per node –About half are 60+ core Intel Knights Landing processors - will be delivered later this year –Cray Aries Dragonfly interconnect Expect greater than 30 PetaFlops peak

Cielo and Chama Cielo is a Cray XE6 and is our current generation capability machine –8894 nodes with 2 oct-core AMD Magny-Cours processors running at 2.4 GHz –Cray Gemini 3D torus interconnect Chama is current generation capacity machine –Tri-Lab Capacity Cluster –1232 nodes with 2 Intel Sandy Bridge 8 core processors running at 2.6 GHz –Qlogic QDR-InfiniBand Fat-Tree interconnect

Focus Codes Focus on SIERRA applications –SIERRA/Solid Mechanics (SM) –SIERRA/Aerodynamics –SIERRA/Structural Dynamics (SD) SIERRA is a large C++ framework –provides framework for several codes –Includes several Third Party Libraries –Contains common C++ classes and methods –Common infrastructure for parallel codes

SIERRA/SM (Solid Mechanics)  A general purpose massively parallel nonlinear solid mechanics finite element code for explicit transient dynamics, implicit transient dynamics and quasi-statics analysis.  Built upon extensive material, element, contact and solver libraries for analyzing challenging nonlinear mechanics problems for normal, abnormal, and hostile environments.  Similar to LSDyna or Abaqus commercial software systems. 5

Summary of Sierra/Aero Unstructured meshes One and two equation turbulence models LES and Hybrid RANS Uses either FETI or Trilinos for sparse matrix operations and solvers. Assembly is substantial portion of the computational cost. turbulent flow past a cavity

SIERRA/SD Domain Areas 7 General Structural Dynamics, Finite Elements –Vibrations, normal modes, implicitly integrated transient dynamics, frequency response analysis –Shells, Solids, Beams, Point Masses –Complicated Large Structures –Typically many constraint equations Acoustics and Structural Acoustics –Even larger systems –More constraints –Infinite Elements (nonsymmetric) Optimization, UQ and Inverse Methods –Adjoint methods –Material and Parameter inversion –Verification and Validation 12/10/14 Sierra/SD

Code Characteristics SIERRA/SM extensively uses sparse direct solvers –the iterative solve requires a local solve, coarse solve, and a preconditioner –The preconditioning step dominates the cost (>90%). SIERRA/Aero uses Trilinos solvers –GMRES for solver with Symmetric Gauss-Seidel preconditioner SIERRA/SD uses Domain Decomposition solver for eigensolve –84% of non-MPI time spent in solver which makes use of BLAS routines

Code Times (in seconds) Code/problemCoresCieloChamaTrinity Aero implicit Aero explicit Aero hex Aero mixed SM SM refined Salinas

Comments on Performance All of the codes show Trinity faster than Cielo by a factor of about 2 on half as many nodes –Effectively a factor of four per node Most of the codes show Trinity 5% to 12% faster than Chama on half as many nodes –About a factor of two per node Salinas about 18% slower on Trinity than on Chama –Slightly worse than the difference in clock speeds

Summary We ported three codes to our new capability machine and compared its performance to our current machines –We got about a factor of four performance improvement per node over our current capability machine and a factor of two over our current capacity machine We are investigating the performance of SIERRA/SD We have started working on the Knight’s Landing portion of the machine