ATLAS Meeting CERN, 17 October 2011 P. Mato, CERN.

Slides:

Advertisements

Similar presentations

Prescriptive Process models

Advertisements

Lecture # 2 : Process Models

1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.

1 Lawrence Livermore National Laboratory By Chunhua (Leo) Liao, Stephen Guzik, Dan Quinlan A node-level programming model framework for exascale computing*

Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Parallel Programming in C with MPI and OpenMP Michael J. Quinn.

Chapter 13 Embedded Systems

The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.

New Challenges in Cloud Datacenter Monitoring and Management

S/W Project Management Software Process Models. Objectives To understand  Software process and process models, including the main characteristics of.

Chapter 3 – Agile Software Development 1Chapter 3 Agile software development.

REVIEW OF NA61 SOFTWRE UPGRADE PROPOSAL. Mandate The NA61 experiment is contemplating to rewrite its fortran software in modern technology and are requesting.

Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.

Chapter 3 Parallel Algorithm Design. Outline Task/channel model Task/channel model Algorithm design methodology Algorithm design methodology Case studies.

CS 395 Last Lecture Summary, Anti-summary, and Final Thoughts.

LC Software Workshop, May 2009, CERN P. Mato /CERN.

Offline Coordinators  CMSSW_7_1_0 release: 17 June 2014  Usage:  Generation and Simulation samples for run 2 startup  Limited digitization and reconstruction.

A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.

Use of GPUs in ALICE (and elsewhere) Thorsten Kollegger TDOC-PG | CERN |

Requirements for a Next Generation Framework: ATLAS Experience S. Kama, J. Baines, T. Bold, P. Calafiura, W. Lampl, C. Leggett, D. Malon, G. Stewart, B.

SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.

5 May 98 1 Jürgen Knobloch Computing Planning for ATLAS ATLAS Software Week 5 May 1998 Jürgen Knobloch Slides also on:

Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.

Detector Simulation on Modern Processors Vectorization of Physics Models Philippe Canal, Soon Yung Jun (FNAL) John Apostolakis, Mihaly Novak, Sandro Wenzel.

1 Optimizing compiler tools and building blocks project Alexander Drozdov, PhD Sergey Novikov, PhD.

The System and Software Development Process Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.

ATLAS Data Challenges US ATLAS Physics & Computing ANL October 30th 2001 Gilbert Poulard CERN EP-ATC.

SEAL Core Libraries and Services CLHEP Workshop 28 January 2003 P. Mato / CERN Shared Environment for Applications at LHC.

TDAQ Upgrade Software Plans John Baines, Tomasz Bold Contents: Future Framework Exploitation of future Technologies Work for Phase-II IDR.

Lecture 4 TTH 03:30AM-04:45PM Dr. Jianjun Hu CSCE569 Parallel Computing University of South Carolina Department of.

Welcome and Introduction P. Mato, CERN.  Outcome of the FNAL workshop ◦ Interest for common effort to make rapid progress on exploratory R&D activities.

Parallelization of likelihood functions for data analysis Alfio Lazzaro CERN openlab Forum on Concurrent Programming Models and Frameworks.

Mantid Stakeholder Review Nick Draper 01/11/2007.

WLCG Overview Board, September 3 rd 2010 P. Mato, P.Buncic Use of multi-core and virtualization technologies.

6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.

Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.

Status of the LAr OO Reconstruction Srini Rajagopalan ATLAS Larg Week December 7, 1999.

The High Performance Simulation Project Status and short term plans 17 th April 2013 Federico Carminati.

A Roadmap towards Machine Intelligence

The System and Software Development Process Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.

1 - 1 Systems Analysis and Design, Key Ideas Many failed systems were abandoned because analysts tried to build wonderful systems without understanding.

Computing R&D and Milestones LHCb Plenary June 18th, 1998 These slides are on WWW at:

23/2/2000Status of GAUDI 1 P. Mato / CERN Computing meeting, LHCb Week 23 February 2000.

Future computing strategy Some considerations Ian Bird WLCG Overview Board CERN, 28 th September 2012.

Parallelization Strategies Laxmikant Kale. Overview OpenMP Strategies Need for adaptive strategies –Object migration based dynamic load balancing –Minimal.

Computing Performance Recommendations #10, #11, #12, #15, #16, #17.

Modelling the Process and Life Cycle. The Meaning of Process A process: a series of steps involving activities, constrains, and resources that produce.

General requirements for BES III offline & EF selection software Weidong Li.

12 March, 2002 LCG Applications Area - Introduction slide 1 LCG Applications Session LCG Launch Workshop March 12, 2002 John Harvey, CERN LHCb Computing.

Update on G5 prototype Andrei Gheata Computing Upgrade Weekly Meeting 26 June 2012.

Preliminary Ideas for a New Project Proposal.  Motivation  Vision  More details  Impact for Geant4  Project and Timeline P. Mato/CERN 2.

From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.

General Introduction and prospect Makoto Asai (SLAC PPA/SCA)

Uses some of the slides for chapters 3 and 5 accompanying “Introduction to Parallel Computing”, Addison Wesley, 2003.

Concurrency and Performance Based on slides by Henri Casanova.

Next-Generation Navigational Infrastructure and the ATLAS Event Store Abstract: The ATLAS event store employs a persistence framework with extensive navigational.

Follow-up to SFT Review (2009/2010) Priorities and Organization for 2011 and 2012.

Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.

12 March, 2002 LCG Applications Area - Introduction slide 1 LCG Applications Session LCG Launch Workshop March 12, 2002 John Harvey, CERN LHCb Computing.

Marco Cattaneo, 3-June Event Reconstruction for LHCb  What is the scope of the project?  What are the goals (short+medium term)?  How do we organise.

From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.

LHCb Core Software Programme of Work January, 2012 Pere Mato (CERN)

Barthélémy von Haller CERN PH/AID For the ALICE Collaboration The ALICE data quality monitoring system.

ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.

Marco Cattaneo, 20-May Event Reconstruction for LHCb  What is the scope of the project?  What are the goals (short+medium term)?  How do we organise.

16 th Geant4 Collaboration Meeting SLAC, September 2011 P. Mato, CERN.

Parallel Programming By J. H. Wang May 2, 2017.

US ATLAS Physics & Computing

Department of Computer Science University of California, Santa Barbara

Parallel Programming in C with MPI and OpenMP

Planning next release of GAUDI

Presentation transcript:

ATLAS Meeting CERN, 17 October 2011 P. Mato, CERN

 For the last 40 years HEP event processing frameworks have had the same structure ◦ initialize; loop events {loop modules {…} }; finalize ◦ O-O has not added anything substantial ◦ It is simple, intuitive, easy to manage, scalable  Current frameworks designed late 1990’s ◦ We know now better what is really needed ◦ Unnecessary complexity impacts on performance  Clear consensus that we need to adapt HEP applications to new generation CPUs ◦ Multi-process, multi-threads, GPU’s, vectorization, etc. ◦ The one job-per-core scales well but requires too much memory and sequential file merging P. Mato/CERN 2

 Covering a broad scope of applications and environments ◦ Single user desktop/laptop case to minimize latency ◦ Production (throughput computing) to minimize memory and other resources  Investigations are starting and prototypes are being developed ◦ Examples:  Languages investigations (Go, OpenCL, etc.)  Parallelization and vectorization (e.g. Geant prototype)  Porting algorithms to GPU’s ◦ Opportunity to collaborate and organize better the work P. Mato/CERN 3

 Universal framework for simulation, reconstruction, analysis, high level trigger applications  Common framework for use by any experiment  Decomposition of the processing of each event into ‘chunks’ that can be executed concurrently  Ability to process several events concurrently  Optimal scheduling and associated data structures  Minimize any processing requiring exclusive access to resources because it breaks concurrency  Supporting various hardware/software technologies  Facilitate the integration of existing LHC applications code (algorithmic part)  Quick delivery of running prototypes. The opportunity of the 18 months LHC shutdown P. Mato/CERN 4

 Current frameworks used by LHC experiments supports all data processing applications ◦ High-level trigger, reconstruction, analysis, etc. ◦ Nothing really new here  But, simulation applications are designed with a big ‘chunk’ in which all Geant4 processing is happening ◦ We to improve the full and fast simulation using the set common services and infrastructure  Running on the major platforms ◦ Linux, MacOSX, Windows P. Mato/CERN 5

 Frameworks can be shared between experiments ◦ E.g. Gaudi used by LHCb, ATLAS, HARP, MINERVA, GLAST, BES3, etc.  We can do better this time :-) ◦ Expect to work closely with LHC experiments ◦ Aim to support ATLAS and CMS at least  Special emphasis to requirements from: ◦ New experiments  E.g. Linear Collider, SuperB, etc. ◦ Different processing paradigms  E.g. fix target experiments, astroparticles P. Mato/CERN 6

 Framework with the ability to schedule modules/algorithms concurrently ◦ Full data dependency analysis would be required (no global data or hidden dependencies) ◦ Need to resolve the DAGs (Direct Acyclic Graphs) statically and dynamically  Not much gain expected with today’s designed ‘Tasks’ ◦ Algorithm decomposition can be influenced by the framework capabilities  ‘Tasks’ could be processed by different hardware/software ◦ CPU, GPU, threads, process, etc. P. Mato/CERN 7 Time Input Processing Output

 DAG of Brunel ◦ Obtained from the existing code instrumented with ‘Auditors’ ◦ Probably still missing ‘hidden or indirect’ dependencies (e.g. Tools)  Can serve to give an idea of potential ‘concurrency’ ◦ Assuming no changes in current reconstruction algorithms P. Mato/CERN 8

 Need to deal with tails of sequential processing ◦ See Rene’s presentation (*)*  Introducing Pipeline processing ◦ Never tried before! ◦ Exclusive access to resources can be pipelined e.g. file writing  Need to design a very powerful scheduler P. Mato/CERN 9 Time

 Concrete algorithms can be parallelized with some effort ◦ Making use of Threads, OpenMP, MPI, GPUs, etc. ◦ But difficult to integrate them in a complete application  E.g. MT-G4 with Parallel Gaudi ◦ Performance-wise only makes sense to parallelize the complete application and not only parts  Developing and validating parallel code is difficult ◦ ‘Physicists’ should be saved from this ◦ In any case, concurrency will limit what can and can not be done in the algorithmic code  At the Framework level you have the overall view and control of the application P. Mato/CERN 10

 It is not simple but we are not alone ◦ Technologies like the Apple’s Grand Central Dispatch (GCD) are designed to help write applications without having to fiddle directly with threads and locking (and getting it terribly wrong)  New paradigms for concurrency programming ◦ Developer needs to factor out the processing in ‘chunks’ with their dependencies and let the framework (system) to deal with the creation and management of a ‘pool’ of threads that will take care of the execution of the ‘chunks’ ◦ Tries to eliminates lock-based code and makes it more efficient P. Mato/CERN 11

 With an approach like the GDC we could exercise different factorizations ◦ Processing each event (set of primary particles) could be the ‘chunk’ (same as GeantMT)  We could also go at the sub-event level ◦ Development of Rene’s ideas of ‘baskets’ of particles organized by particle type, volume shape, etc. ◦ Would need to develop an efficient summing (‘reduce’) of the results ◦ Would require to study the reproducibility of results (random number sequence) P. Mato/CERN 12

 Collaboration of framework developers of CERN, FNAL, LBL, DESY and possible other Labs ◦ Start with small number of people (at the beginning) ◦ Open to people willing to collaborate ◦ Strong collaboration with ATLAS and CMS (and others)  E.g. Instrumentation of existing applications to provide requirements ◦ Strong collaboration with Geant4 team  Quick delivery of running prototypes (I and II) ◦ First prototype in 12 months :-)  Agile project management with ‘short’ cycles ◦ Weekly meetings to review progress and update plans P. Mato/CERN 13

 We need to evaluate some of the existing technologies and design partial prototypes of critical parts ◦ See next slide  The idea would be to organize these R&D activities in short cycles ◦ Coordinating the interested people to cover all aspects ◦ Coming with conclusions (yes/no) within few months  Possibility to organize a kick-off workshop ◦ Define the initial R&D activities and involved people ◦ Refine the overall vision ◦ Where and When? P. Mato/CERN 14

 Investigate current LHC applications to gather requirements ◦ Dependencies, data access patterns, opportunities for concurrency, etc.  Investigate design and implementations of state – of-the-art concurrency frameworks ◦ Scheduling (static, dynamic, adaptive), memory model, I/O  Prototype framework elements ◦ Identify ‘exemplar’ algorithms to be parallelized ◦ Data structures and memory allocation strategies ◦ New languages (C++11, Go, pypy, …) ◦ and libraries (OpenCL, CnC, STM, …)  Understanding I/O P. Mato/CERN 15

P. Mato/CERN LHC shutdown Today R&D, technology evaluation and design of critical parts Complete Prototype I Initial adaptation of LHC and Geant4 applications Complete Prototype II with experience of porting LHC applications First production quality release Project definition

 Presented a proposal for the development of the new generic data processing framework with concurrency to exploit new CPU/GPU architectures  LHC experiments should be the main players providing specific requirements, participating into the project development and taking advantage of the new framework ◦ Would imply some re-engineering of parts of the experiment applications  Need a R&D program to evaluate existing technologies and development of partial prototypes of critical parts  Some initial ideas for the project definition being outlined P. Mato/CERN 17