Remigius K Mommsen Fermilab A New Event Builder for CMS Run II A New Event Builder for CMS Run II on behalf of the CMS DAQ group.

Slides:



Advertisements
Similar presentations
The Development of Mellanox - NVIDIA GPUDirect over InfiniBand A New Model for GPU to GPU Communications Gilad Shainer.
Advertisements

G ö khan Ü nel / CHEP Interlaken ATLAS 1 Performance of the ATLAS DAQ DataFlow system Introduction/Generalities –Presentation of the ATLAS DAQ components.
♦ Commodity processor with commodity inter- processor connection Clusters Pentium, Itanium, Opteron, Alpha GigE, Infiniband, Myrinet, Quadrics, SCI NEC.
Performance Analysis of Virtualization for High Performance Computing A Practical Evaluation of Hypervisor Overheads Matthew Cawood University of Cape.
Institute of Computer Science Foundation for Research and Technology – Hellas Greece Computer Architecture and VLSI Systems Laboratory Exploiting Spatial.
CHEP04 - Interlaken - Sep. 27th - Oct. 1st 2004T. M. Steinbeck for the Alice Collaboration1/20 New Experiences with the ALICE High Level Trigger Data Transport.
An overview of Infiniband Reykjavik, June 24th 2008 R E Y K J A V I K U N I V E R S I T Y Dept. Computer Science Center for Analysis and Design of Intelligent.
LHCb readout infrastructure NA62 TDAQ WG Meeting April 1 st, 2009 Niko Neufeld, PH/LBC.
© 2013 Mellanox Technologies 1 NoSQL DB Benchmarking with high performance Networking solutions WBDB, Xian, July 2013.
CMS DAQ-2 Shifter Tutorial
LNL CMS G. MaronCPT Week CERN, 23 April Legnaro Event Builder Prototypes Luciano Berti, Gaetano Maron Luciano Berti, Gaetano Maron INFN – Laboratori.
The Application of DAQ-Middleware to the J-PARC E16 Experiment E Hamada 1, M Ikeno 1, D Kawama 2, Y Morino 1, W Nakai 3, 2, Y Obara 3, K Ozawa 1, H Sendai.
Towards a Common Communication Infrastructure for Clusters and Grids Darius Buntinas Argonne National Laboratory.
S. Cittolin CERN/CMS, 22/03/07 DAQ architecture. TDR-2003 DAQ evolution and upgrades DAQ upgrades at SLHC.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
Artdaq Introduction artdaq is a toolkit for creating the event building and filtering portions of a DAQ. A set of ready-to-use components along with hooks.
A TCP/IP transport layer for the DAQ of the CMS Experiment Miklos Kozlovszky for the CMS TriDAS collaboration CERN European Organization for Nuclear Research.
Boosting Event Building Performance Using Infiniband FDR for CMS Upgrade Andrew Forrest – CERN (PH/CMD) Technology and Instrumentation in Particle Physics.
MIDeA :A Multi-Parallel Instrusion Detection Architecture Author: Giorgos Vasiliadis, Michalis Polychronakis,Sotiris Ioannidis Publisher: CCS’11, October.
The NE010 iWARP Adapter Gary Montry Senior Scientist
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
LECC2003 AmsterdamMatthias Müller A RobIn Prototype for a PCI-Bus based Atlas Readout-System B. Gorini, M. Joos, J. Petersen (CERN, Geneva) A. Kugel, R.
Towards a Homogeneous Software Environment for DAQ Applications Luciano Orsini Johannes Gutleber CERN EP/CMD.
VLVNT Amsterdam 2003 – J. Panman1 DAQ: comparison with an LHC experiment J. Panman CERN VLVNT workshop 7 Oct 2003 Use as example CMS (slides taken from.
Network Architecture for the LHCb DAQ Upgrade Guoming Liu CERN, Switzerland Upgrade DAQ Miniworkshop May 27, 2013.
Upgrade of the CMS Event Builder Andrea Petrucci - CERN (PH/CMD) on behalf of the CMS DAQ group 19 th International Conference on Computing in High Energy.
The new CMS DAQ system for LHC operation after 2014 (DAQ2) CHEP2013: Computing in High Energy Physics Oct 2013 Amsterdam Andre Holzner, University.
1 Network Performance Optimisation and Load Balancing Wulf Thannhaeuser.
DAQ2 Shift TutorialcDAQ group1 Monitoring of the DAQ2 system Remi Mommsen, FNAL.
Overview of DAQ at CERN experiments E.Radicioni, INFN MICE Daq and Controls Workshop.
Intel Research & Development ETA: Experience with an IA processor as a Packet Processing Engine HP Labs Computer Systems Colloquium August 2003 Greg Regnier.
Latest ideas in DAQ development for LHC B. Gorini - CERN 1.
LHCb DAQ system LHCb SFC review Nov. 26 th 2004 Niko Neufeld, CERN.
Sep. 17, 2002BESIII Review Meeting BESIII DAQ System BESIII Review Meeting IHEP · Beijing · China Sep , 2002.
June 17th, 2002Gustaaf Brooijmans - All Experimenter's Meeting 1 DØ DAQ Status June 17th, 2002 S. Snyder (BNL), D. Chapin, M. Clements, D. Cutts, S. Mattingly.
LNL 1 SADIRC2000 Resoconto 2000 e Richieste LNL per il 2001 L. Berti 30% M. Biasotto 100% M. Gulmini 50% G. Maron 50% N. Toniolo 30% Le percentuali sono.
The CMS Event Builder Demonstrator based on MyrinetFrans Meijers. CHEP 2000, Padova Italy, Feb The CMS Event Builder Demonstrator based on Myrinet.
Computing Issues for the ATLAS SWT2. What is SWT2? SWT2 is the U.S. ATLAS Southwestern Tier 2 Consortium UTA is lead institution, along with University.
1 Farm Issues L1&HLT Implementation Review Niko Neufeld, CERN-EP Tuesday, April 29 th.
Niko Neufeld HL-LHC Trigger, Online and Offline Computing Working Group Topical Workshop Sep 5 th 2014.
Management of the LHCb DAQ Network Guoming Liu *†, Niko Neufeld * * CERN, Switzerland † University of Ferrara, Italy.
Pierre VANDE VYVRE ALICE Online upgrade October 03, 2012 Offline Meeting, CERN.
PCIe based readout for the LHCb upgrade U. Marconi, INFN Bologna On behalf of the LHCb Online Group (Bologna-CERN-Padova) CHEP2013, Amsterdam, 15 th October.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
DAQ Overview + selected Topics Beat Jost Cern EP.
Introduction to DAQ Architecture Niko Neufeld CERN / IPHE Lausanne.
An Overview over Online Systems at the LHC Invited Talk at NSS-MIC 2012 Anaheim CA, 31 October 2012 Beat Jost, Cern.
Remigius K Mommsen Fermilab CMS Run 2 Event Building.
A Practical Evaluation of Hypervisor Overheads Matthew Cawood Supervised by: Dr. Simon Winberg University of Cape Town Performance Analysis of Virtualization.
The Evaluation Tool for the LHCb Event Builder Network Upgrade Guoming Liu, Niko Neufeld CERN, Switzerland 18 th Real-Time Conference June 13, 2012.
Introduction to Data Analysis with R on HPC Texas Advanced Computing Center Feb
EPS 2007 Alexander Oh, CERN 1 The DAQ and Run Control of CMS EPS 2007, Manchester Alexander Oh, CERN, PH-CMD On behalf of the CMS-CMD Group.
CHEP 2010, October 2010, Taipei, Taiwan 1 18 th International Conference on Computing in High Energy and Nuclear Physics This research project has.
CMS DAQ project at Fermilab
HTCC coffee march /03/2017 Sébastien VALAT – CERN.
Balazs Voneki CERN/EP/LHCb Online group
MPD Data Acquisition System: Architecture and Solutions
M. Bellato INFN Padova and U. Marconi INFN Bologna
LHCb and InfiniBand on FPGA
Presented by Yoon-Soo Lee
Electronics Trigger and DAQ CERN meeting summary.
RT2003, Montreal Niko Neufeld, CERN-EP & Univ. de Lausanne
CMS DAQ Event Builder Based on Gigabit Ethernet
DAQ upgrades at SLHC S. Cittolin CERN/CMS, 22/03/07
PCI BASED READ-OUT RECEIVER CARD IN THE ALICE DAQ SYSTEM
Example of DAQ Trigger issues for the SoLID experiment
John Harvey CERN EP/LBC July 24, 2001
Event Building With Smart NICs
LHCb Trigger, Online and related Electronics
Cluster Computers.
Presentation transcript:

Remigius K Mommsen Fermilab A New Event Builder for CMS Run II A New Event Builder for CMS Run II on behalf of the CMS DAQ group

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Overview Overview of CMS DAQ for LHC run II Event-building protocol Performance measurements 2

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II CMS DAQ for LHC Run II Requirements 100 kHz level 1 trigger rate (unchanged) Event size might double to 2 MB Increase in pileup New detectors Accommodate legacy and new uTCA-based detector readouts 1-2 kB or 8 kB fragments Aging hardware Most components reached end-of-life cycle New technologies Myrinet widely used when DAQ-1 was designed Ethernet and Infiniband dominate the top-500 supercomputers today 3 Top500.org share by Interconnect family Infiniband DAQ1 TDR (2002) Myrinet 1 Gb/s Ethernet 10 Gb/s Ethernet 2014

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II CMS Event Builder Detector front-end (custom electronics) Front-End Readout Optical Link (FEROL) Optical 10 GbE TCP/IP Data Concentrator switches Data to Surface Aggregate into 40 GbE links 72 Readout Units (RUs) Combine FEROL fragments into super-fragment Event Builder switch Infiniband FDR 56 Gbps CLOS network 62 Builder Units (BUs) Event building Write events to transient files on RAM disk Filter Units (FUs) Run HLT selection using files from RAM disk x 10 GbE 200 m 72 x 40 GbE 72 x 64 IB 56 Gbps

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II EvB Protocol EVM RU 1RU 2 BU 1BU 2 FEROLs Fragment Assign event to BU1 5 Event Request Event Request Event Request Fragment Super- fragment Event Fragment

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Achieving Performance Avoid high rate of small messages Request multiple events at the same time Pack data of multiple events into one message Avoid copying data Operate on pointers to data in receiving buffers Copy data directly into RDMA buffers of Infiniband NICs Stay in kernel space when writing data Parallelize the work Use multiple threads for data transmission and event handling Write events concurrently into multiple files Bind everything to CPU cores and memory (NUMA) Each thread bound to a core Memory structures allocated on pre-defined CPU Interrupts from NICs restricted to certain cores Tune Linux TCP stack for maximum performance 6

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Computers Readout Unit (RU) Dell PowerEdge R620 Dual 8 core Xeon CPU E GHz 32 GB of memory Builder Unit (BU) Dell PowerEdge R720 Dual 8 core Xeon CPU E GHz GB of memory (240 GB for Ramdisk on CPU 1)

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Data Network 40/56Gb NICs (Infiniband or Ethernet) Mellanox Technologies MT27500 Family [ConnectX-3] 10/40 GbE switches Mellanox SX1024 & SX1036 Infiniband switches Mellanox SX6036 Infiniband CLOS network

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Performance Measurements 9

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Data Concentrator 10 4 – 16 FEROLs BU 1 1 kB 1 RU 2 BUs EVM RU BU 2 256B - 16kB 1 kB1 – 256 kB

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Data Concentrator 11 Legacy FEDsuTCA FEDs

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II 1 kB 256B - 16kB EVM Builder Unit 12 8 FEROLs per RU 44 RUs kB 1 BU kB

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Builder Unit 13

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II 1 kB kB 1 kB 256B - 16kB Scalability 14 8 FEROLs per RU RUs 1 – 44 BUs EVM kB

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Scalability 15

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Summary CMS has a complete new DAQ system for LHC run II State-of-the-art technology Order of magnitude smaller than previous DAQ system Optimal use of high-end hardware New event-building protocol New software to exploit hardware capabilities A lot of fine-tuning to get full performance EvB scaling behavior not understood Study of Infiniband CLOS network under EvB traffic ongoing DAQ is ready for first physics data More work is needed to achieve the ultimate performance 16

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II

BU Performance vs Threads

Remi Mommsen – CHEP 2015 – A New Event Builder for CMS Run II Scalability 19