Chroma: An Application of the SciDAC QCD API(s) Bálint Joó School of Physics University of Edinburgh UKQCD Collaboration Soon to be moving to the JLAB.

Slides:



Advertisements
Similar presentations
Pulan Yu School of Informatics Indiana University Bloomington Web service based Varuna.Net.
Advertisements

I/O and the SciDAC Software API Robert Edwards U.S. SciDAC Software Coordinating Committee May 2, 2003.
Addressing the Trust Asymmetry Problem In Grid Computing with Encrypted Computation Peter A. Dinda Prescience Lab Department of Computer Science Northwestern.
SciDAC Software Infrastructure for Lattice Gauge Theory
Autonomic Systems Justin Moles, Winter 2006 Enabling autonomic behavior in systems software with hot swapping Paper by: J. Appavoo, et al. Presentation.
Nuclear Physics in the SciDAC Era Robert Edwards Jefferson Lab SciDAC 2009 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this.
Dynamical Chiral Fermions The `Grail’ – dyn. chiral fermions Generation of dyn. chiral fermions configs –RBC on the RIKEN QCDOC – Jan 05 (some %) –UKQCD.
Lattice QCD Comes of Age y Richard C. Brower XLIst Rencontres de Moriond March QCD and Hadronic interactions at high energy.
1 Coven a Framework for High Performance Problem Solving Environments Nathan A. DeBardeleben Walter B. Ligon III Sourabh Pandit Dan C. Stanzione Jr. Parallel.
BU SciDAC Meeting Balint Joo Jefferson Lab. Anisotropic Clover Why do it ?  Anisotropy -> Fine Temporal Lattice Spacing at moderate cost  Combine with.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
QDP++ and Chroma Robert Edwards Jefferson Lab
Algorithms for Lattice Field Theory at Extreme Scales Rich Brower 1*, Ron Babich 1, James Brannick 2, Mike Clark 3, Saul Cohen 1, Balint Joo 4, Tony Kennedy.
HackLatt MILC with SciDAC C Carleton DeTar HackLatt 2008.
MILC Code Basics Carleton DeTar KITPC MILC Code Capabilities Molecular dynamics evolution –Staggered fermion actions (Asqtad, Fat7, HISQ,
Application architectures
Active Messages: a Mechanism for Integrated Communication and Computation von Eicken et. al. Brian Kazian CS258 Spring 2008.
HackLatt MILC Code Basics Carleton DeTar HackLatt 2008.
A status report of the QCDSF N f =2+1 Project Yoshifumi Nakamura (NIC/DESY) for the QCDSF collaboration Lattice Regensburg Aug. 3, 2007.
SciDAC Software Infrastructure for Lattice Gauge Theory DOE Grant ’01 -- ’03 (-- ’05?) All Hands Meeting: FNAL Feb. 21, 2003 Richard C.Brower Quick Overview.
I/O and the SciDAC Software API Robert Edwards U.S. SciDAC Software Coordinating Committee May 2, 2003.
The science of simulation falsification algorithms phenomenology machines better theories computer architectures non-perturbative QFT experimental tests.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Lecture 29 Fall 2006 Lecture 29: Parallel Programming Overview.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
Dr Chris Maynard Application Consultant, EPCC UKQCD software for lattice QCD P.A. Boyle, R.D. Kenway and C.M.
1 Chapter Client-Server Interaction. 2 Functionality  Transport layer and layers below  Basic communication  Reliability  Application layer.
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
SciDAC Software Infrastructure for Lattice Gauge Theory Richard C. Brower Annual Progress Review JLab, May 14, 2007 Code distribution see
SALSA: Language and Architecture for Widely Distributed Actor Systems. Carlos Varela, Abe Stephens, Department of.
LWIP TCP/IP Stack 김백규.
Improved pipelining and domain decomposition in QuickPIC Chengkun Huang (UCLA/LANL) and members of FACET collaboration SciDAC COMPASS all hands meeting.
QCD Project Overview Ying Zhang September 26, 2005.
Lattice QCD in Nuclear Physics Robert Edwards Jefferson Lab CCP 2011 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
Chroma I: A High Level View Bálint Joó Jefferson Lab, Newport News, VA given at HackLatt'06 NeSC, Edinburgh March 29, 2006.
SciDAC Software Infrastructure for Lattice Gauge Theory Richard C. Brower & Robert Edwards June 24, 2003.
GraphLab: how I understood it with sample code Aapo Kyrola, Carnegie Mellon Univ. Oct 1, 2009.
UKQCD QCDgrid Richard Kenway. UKQCD Nov 2001QCDgrid2 why build a QCD grid? the computational problem is too big for current computers –configuration generation.
ARTTREEKS A KINEMATIC SYNTHESIS SOLVER FOR TREE TOPOLOGIES. PRESENTED BY ALI TAMIMI.
Lattice QCD and the SciDAC-2 LQCD Computing Project Lattice QCD Workflow Workshop Fermilab, December 18, 2006 Don Holmgren,
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
HackLatt MILC Code Basics Carleton DeTar First presented at Edinburgh EPCC HackLatt 2008 Updated 2013.
Hadron Structure using Dynamical Chiral Fermions A. Alexandru, B. Bistrovic, J. Bratt, R. Brower, M. Burkardt, T. Draper, P. Dreher, R. Edwards, M. Engelhardt,
Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.
Copyright © 2012 UNICOM Systems, Inc. Confidential Information z/Ware Product Overview illustro Systems International A Division of UNICOM Global.
Aspects of the Class Structure in Chroma Bálint Joó Jefferson Lab, Newport News, VA given at HackLatt'06 NeSC, Edinburgh March 29, 2006.
SciDAC Software Infrastructure for Lattice Gauge Theory Richard C. Brower QCD Project Review May 24-25, 2005 Code distribution see
Slide 1 Archive Computing: Scalable Computing Environments on Very Large Archives Andreas J. Wicenec 13-June-2002.
January 2006UKQCD meeting - Edinburgh Light Hadron Spectrum and Pseudoscalar Decay Constants with 2+1f DWF at L s = 8 Robert Tweedie RBC-UKQCD Collaboration.
MESQUITE: Mesh Optimization Toolkit Brian Miller, LLNL
Site Report on Physics Plans and ILDG Usage for US Balint Joo Jefferson Lab.
Connections to Other Packages The Cactus Team Albert Einstein Institute
A QCD Grid: 5 Easy Pieces? Richard Kenway University of Edinburgh.
1 University of Maryland Runtime Program Evolution Jeff Hollingsworth © Copyright 2000, Jeffrey K. Hollingsworth, All Rights Reserved. University of Maryland.
Status and plans at KEK Shoji Hashimoto Workshop on LQCD Software for Blue Gene/L, Boston University, Jan. 27, 2006.
1 Service Oriented Architecture SOA. 2 Service Oriented Architecture (SOA) Definition  SOA is an architecture paradigm that is gaining recently a significant.
QDP++ and Chroma Robert Edwards Jefferson Lab Collaborators: Balint Joo.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
UKQCD NeSCAC Irving, 24/1/061 January 06 UKQCD meeting Staggered fermion project Alan Irving University of Liverpool.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.
© P. Pongcharoen CCSI/1 Scheduling Complex Products using Genetic Algorithms with Alternative Fitness Functions P. Pongcharoen, C. Hicks, P.M. Braiden.
From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.
Fermi National Accelerator Laboratory & Thomas Jefferson National Accelerator Facility SciDAC LQCD Software The Department of Energy (DOE) Office of Science.
LQCD Computing Project Overview
Project Management – Part I
Chroma: An Application of the SciDAC QCD API(s)
Software Design Lecture : 9.
Presentation transcript:

Chroma: An Application of the SciDAC QCD API(s) Bálint Joó School of Physics University of Edinburgh UKQCD Collaboration Soon to be moving to the JLAB JLAB SciDAC Meeting, June 1, 2005

An Overview of Chroma An Open Source Lattice QCD Toolbox Designed to be Portable Efficient Standards/Buzzword Compliant Achieved through Utilizing SciDAC QCD APIs Modern Software design techniques Portable third party libraries

Where Chroma Fits in the API layers SciDAC Level 2: QDP++ and QIO SciDAC Level 1: QMP Message Passing & QLA Chroma: Actions, Linear Operators, Inverters, Eigensolver, Monomials, MD Integrators, HMC, RHMC, Propagators, Sources, Spectroscopy, Three Point Functions and more Level 3: Wilson Dirac Operator (SSE and BAGEL), SSE DWF Inverter, and more yet to be defined....

Key Components... Action Factories Fermion Produce Linear Operator objects Compute Propagators Compute Force for Linear Operators Produce SystemSolver objects for computing propagators Gauge Compute action Compute Forces

Key Components... Monomials Fermion Abstract flavour information for Fermion Action eg: 2 flavour: φ † (M † M) -1 φ eg: Rational 1 flavour: R(M † M) ≈ [ φ † (M † M) -1 φ ] 1/2 Applies chain rule to compute relevant force can be overridden for optimisations Computes action as relevant Can use Chronological Predictor for force Gauge Forwards force calculation to GaugeActions

Key Components... Hamiltonians Sum of monomials Can mix and match monomials as desired eg: Nf=2+1 made up from Gauge Monomial(s) 2 flavour fermion monomial 1 flavour rational fermion monomial Used by MD Integrators Used by HMC algorithms

Key Components... InlineMeasurement Tasks Uniform Interface for measurements Specify list of tasks in XML XML converted to a list of function objects Application executes tasks in sequence Can be used to chain measurements either inline in an HMC evolution after the fact through the Chroma Application Allows XML scripting of complex tasks

Algorithms... Gauge Update Algorithms HMC ( Rational if Rational Monomial is used) Heatbath ( Not yet mature ) MD Integrators PQP Leapfrog, 2 Scale Sexton-Weingarten, Minimal Residual Norm (deForcrand et al) Chronological Predictors for solves Zero, Last Solution, Linear Extrapolation, Minimal Residual Extrapolation

Algorithms... Inverters Conjugate Gradients Multiple Shift (MultiMass) Conjugate Gradients BiCGStab SUMR and others Eigensolver Ritz with or without Kalkreuter Simma acceleration

Measurements and Utilities Spectroscopy for Wilson Like Fermions Hadronic Three Point function s Wilson Loops Smearing and Fuzzing...

Chroma Production Applications chroma Post production measurement Executes list of measurements in input file Measurements can communicate via files hmc Primary gauge production code Flavour choice through Hamiltonians and Monomials Usual bag of tricks...

Chroma RHMC Cross Validation

Platforms Exploited by Chroma QCDOC, using custom QMP, BAGEL Intel Clusters, QMP-MPICH, QMP-MVIA, SSE Assembler BlueGene/L, QMP-MPI, BAGEL (QCDOC code used, single FPU only) IBM Pseries (HPCx), BAGEL QMP-MPI

Chroma People Maintainers and Main Developers Robert Edwards, Bálint Joó Contributors S. Basak, G. Fleming, K. Orginos, D. Renner, D. Richards, I. Sato, A. Pochinsky (LHPC) S. Miller, C. McNeile, E. Gregory (UKQCD) – Staggered Fermion Measurements Z. Sroczynski – CPS & Chroma glue (UKQCD) J. Noaki (UKQCD) – Fuzzed Wilson Loops

Chroma Penetration & Uptake LHPC - Hadron structure project G.T. Fleming et. al. as per previous slide... UKQCD DWF locality Study (A. Yamaguchi, P. Boyle) Static Potential (J. Noaki) Staggered Measurement (C. McNeile et al) Joint USQCD & UKQCD Chiral fermion algorithms (R. G. Edwards, B. Joó, A. D. Kennedy, K. Orginos, U. Wenger)

Chroma and Efficiency Efficiency achieved through Calls to High Performance Libraries Intel SSE Optimized Wilson Dslash (C. McLendon – Jlab) Optimized DWF Inverter (A. Pochinsky – MIT) QCDOC, BlueGene/L, Pseries Optimized Wilson Dslash and Simple BLAS Produced by Peter Boyle's BAGEL Generator PLEASE DON'T ASK ME FOR BAGEL !!! ASK PETER ! Optimized expressions in QDP++

Chroma QCDOC Benchmark (Lat'04)

I/O Timings on 1 QCDOC Rack Singlefile I/O is painfully SLOW. Multifile is OK. But as QIO's performance will improves so will ours

Quick and Dirty BlueGene Benchmark Edinburgh 1 midplane, Co-processor mode 1 CPU Peak Performance=700MHzx2FPUs*2flops=2800 Mflops Max attainable with 1FPU=1400Mflops (Double Prec Mflops)

Near Future Work Polishing and More Optimization e.g.: more optimization of 5D operators (Level 3 style), more assembler, etc. Improved algorithms e.g.: exact handling of low eigenvalues of H in 5D operators and molecular dynamics. QDP++/QMP implementation optimization e.g: BlueGene/L native QMP? Help with QIO improvements?

Chroma Dreams... (Nightmares?) ChromaGUI – Create XML Files ChromaGrid – Grid Enabled Chroma? Send input XML File to remote Web Service Web Service launches chroma Web Service sends back URL to output Web Service XML file

Conclusions Chroma is not just a toy system Growing usage worldwide Ported to currently interesting systems Efficiency may need tweaking on some of the more recent platforms (eg BG/L) Beautiful example of how SciDAC APIs enable rapid exploitation of new systems Bright and colorful (chromatic) future