CRISP WP18, High-speed data recording Krzysztof Wrona, European XFEL PSI, 18 March 2013.

Slides:



Advertisements
Similar presentations
Operating System.
Advertisements

0 DOD/DT/CEDCV – 20 th & 21 st January Paris meeting SAGEM RTD Activities C2-Sense project Paris – 20 & 21 January 2015.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
Remigius K Mommsen Fermilab A New Event Builder for CMS Run II A New Event Builder for CMS Run II on behalf of the CMS DAQ group.
Embedded Network Controller with Web Interface Bradley University Department of Electrical & Computer Engineering By: Ed Siok Advisor: Dr. Malinowski.
GridPP meeting Feb 03 R. Hughes-Jones Manchester WP7 Networking Richard Hughes-Jones.
Figure 1.1 Interaction between applications and the operating system.
Operating Systems.
Networking, Hardware Issues, SQL Server and Terminal Services Session VII.
I/O Systems ◦ Operating Systems ◦ CS550. Note:  Based on Operating Systems Concepts by Silberschatz, Galvin, and Gagne  Strongly recommended to read.
Why Interchange?. What is Interchange? Interchange Capabilities: Offers complete replacement of CommBridge point-to-point solution with a hub and spoke.
SRP Update Bart Van Assche,.
Performance Tradeoffs for Static Allocation of Zero-Copy Buffers Pål Halvorsen, Espen Jorde, Karl-André Skevik, Vera Goebel, and Thomas Plagemann Institute.
Operating System. Architecture of Computer System Hardware Operating System (OS) Programming Language (e.g. PASCAL) Application Programs (e.g. WORD, EXCEL)
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
100G R&D at Fermilab Gabriele Garzoglio (for the High Throughput Data Program team) Grid and Cloud Computing Department Computing Sector, Fermilab Overview.
Course Introduction Andy Wang COP 5611 Advanced Operating Systems.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
Block1 Wrapping Your Nugget Around Distributed Processing.
Data transfer over the wide area network with a large round trip time H. Matsunaga, T. Isobe, T. Mashimo, H. Sakamoto, I. Ueda International Center for.
10GE network tests with UDP
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
© Paradigm Publishing Inc. 4-1 OPERATING SYSTEMS.
DYNES Storage Infrastructure Artur Barczyk California Institute of Technology LHCOPN Meeting Geneva, October 07, 2010.
John D. McCoy Principal Investigator Tom McKenna Project Manager UltraScienceNet Research Testbed Enabling Computational Genomics Project Overview.
Data Transport Challenges for e-VLBI Julianne S.O. Sansa* * With Arpad Szomoru, Thijs van der Hulst & Mike Garret.
WP18: High-Speed Data Recording Krzysztof Wrona, European XFEL 07 October 2011 CRISP.
Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015.
Test Results of the EuroStore Mass Storage System Ingo Augustin CERNIT-PDP/DM Padova.
WP19 DESY Development Plan Frank Schlünzen Jürgen Starek.
Welcome to CPS 210 Graduate Level Operating Systems –readings, discussions, and programming projects Systems Quals course –midterm and final exams Gateway.
Data Recording Model at XFEL CRISP 2 nd Annual meeting March 18-19, 2013 Djelloul Boukhelef 1Djelloul Boukhelef - XFEL.
DEPARTEMENT DE PHYSIQUE NUCLEAIRE ET CORPUSCULAIRE JRA1 Parallel - DAQ Status, Emlyn Corrin, 8 Oct 2007 EUDET Annual Meeting, Palaiseau, Paris DAQ Status.
LRPC Firefly RPC, Lightweight RPC, Winsock Direct and VIA.
The Million Point PI System – PI Server 3.4 The Million Point PI System PI Server 3.4 Jon Peterson Rulik Perla Denis Vacher.
High Speed Detectors at Diamond Nick Rees. A few words about HDF5 PSI and Dectris held a workshop in May 2012 which identified issues with HDF5: –HDF5.
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
Predrag Buncic Future IT challenges for ALICE Technical Workshop November 6, 2015.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Neutrino Program (MINOS, MINERvA, General) Margaret Votava April 21, 2009 Tactical plan.
Data Transport Challenges for e-VLBI Julianne S.O. Sansa* * With Arpad Szomoru, Thijs van der Hulst & Mike Garret.
GNEW2004 CERN March 2004 R. Hughes-Jones Manchester 1 Lessons Learned in Grid Networking or How do we get end-2-end performance to Real Users ? Richard.
C LUSTER OF R ESEARCH I NFRASTRUCTURES F OR S YNERGIES IN P HYSICS Prototype for High-Speed Data Acquisition at European XFEL CRISP 3 rd Annual meeting.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
Benchmarking Storage Systems How to characterize the system Storage Network Clients Specific benchmarks iozone mdtest h5perf Hdf5-aggregation (tiff2nexus)
INDIANAUNIVERSITYINDIANAUNIVERSITY Tsunami File Transfer Protocol Presentation by ANML January 2003.
1 5/4/05 Fermilab Mass Storage Enstore, dCache and SRM Michael Zalokar Fermilab.
Course 03 Basic Concepts assist. eng. Jánó Rajmond, PhD
Remigius K Mommsen Fermilab CMS Run 2 Event Building.
E-infrastructure requirements from the ESFRI Physics, Astronomy and Analytical Facilities cluster Provisional material based on outcome of workshop held.
100G R&D at Fermilab Gabriele Garzoglio (for the High Throughput Data Program team) Grid and Cloud Computing Department Computing Sector, Fermilab Overview.
Big Data over a 100G Network at Fermilab Gabriele Garzoglio Grid and Cloud Services Department Computing Sector, Fermilab CHEP 2013 – Oct 15, 2013 Overview.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Storage ISM Management Pre-sales Product Training Materials Easy and Efficient WEU IT Solution Team.
Chapter 2 Operating System Overview Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
DART SI-8: Pilot long-distance high speed and secure data transfer between the Repositories DART Workshop on Infrastructure Chief Investigator: Dr. Asad.
GridOS: Operating System Services for Grid Architectures
Introduction to comp. and prog. CS 101 G 964
WP18, High-speed data recording Krzysztof Wrona, European XFEL
FileCatalyst Performance
Introduction the IT and DM Topic
Andy Wang COP 5611 Advanced Operating Systems
Course Introduction Dr. Eggen COP 6611 Advanced Operating Systems
Andy Wang COP 5611 Advanced Operating Systems
WP18, High-speed data recording
Computing Infrastructure for DAQ, DM and SC
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
Andy Wang COP 5611 Advanced Operating Systems
Wide Area Workload Management Work Package DATAGRID project
Andy Wang COP 5611 Advanced Operating Systems
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
Andy Wang COP 5611 Advanced Operating Systems
Presentation transcript:

CRISP WP18, High-speed data recording Krzysztof Wrona, European XFEL PSI, 18 March 2013

2 High-speed Data Recording Objectives: 1.“High-speed recording of data to permanent storage and archive” 2.“Optimized and secure access to data using standard protocols” Partners: DESY, ESRF, ESS, European XFEL, GANIL, ILL, UOXF.DB, Univ. Cambridge

Task 1 3 Assembling requirements and use cases for high-speed data recording to storage systems and data archives. –Input collected as part of the IT&DM survey –Presentations by RIs, description of use cases, discussions, identification of common critical issues –Requirement document has been published (MS3) Reviewing available technologies, selecting tools, and investigating their usability for defined use cases –Survey on hardware and software technologies, exchange of experience

Identified synergies Using 10GE network for high throughput data transfers Online data processing models Writing data to storage Data archiving 4 Three categories: Relatively small data rates at ESS, ILL and SPIRAL2 (neutrons and ions oriented physics), High data rates expected at ESRF, EuroFEL and European XFEL (synchrotrons and FELs) Order of magnitude higher data rates at SKA (astrophysics) but expected few years later.

Network Tests Using 10GE network for high throughput data transfers –Results from 10GE tests presented at the October IT&DM topic meeting –Achieved: UDP and TCP protocols at wire speed Minimized UDP packet loss with advanced tuning Tuning summary –Linux kernel and driver parameters –NIC configuration –PCI bus –IRQ affinity –Disabling general system services Application tuning summary –Binding threads to cores 5

Local Buffers Implementation of local buffer at ESRF 6 Details in presentation by B. Rousselle How to saturate 10GE link through a remote filesystem Using http protocol for data transfer Interoperability between windows 32/64 and Linux OS

Online data processing Online data processing model at XFEL –Pipelining and multithreading on multi-core architecture –Multiple data channels to handle output stream from single detector –Receiving, processing, monitoring, formatting, and sending data to storage 7

Data recording Sending single channel formatted data through 10GE interface using TCP Storage performance: cached vs. direct IO Tests with two types of storage systems: –14 x 900GB 10Krpm, SAS, 2 x 6Gbps, RAID6 –12 x 3TB 7.2Krpm, NL SAS, 2 x 6Gbps, RAID6 8 Details in presentation by D. Boukhelef Results –Achieved data rate per channel: 1.1GB/s and 0.97GB/s, resp. –Direct IO improves performance and stability Currently investigated –Concurrent IO operations: write/write, write/read, file merging –Storage manager: file indexing, disk space management, dynamic data switching

Task 2 9 Collecting requirements for data protection and understanding their implications for high-speed data recording and data access. –Input collected as part of the IT&DM survay –Presentation from extended study case at ILL on October IT&DM topic meeting –Requirement document published as MS8

Data access protection 10 Highlights –Analysis of requirements for data protection Open access data policy with initial period when data access is restricted Large collaborations vs. small experimental teams Online data processing, access to data in the local buffers Long term storage of immutable data –Defining users and facility roles required for accessing and curating data –Extended study case at ILL Datasets identified by proposal id Heterogeneity of the environment, legacy protocols and applications Implementation based on filesystem ACLs –List of recommendations

Task 3 11 Defining and selecting use case applications requiring high throughput data access. Evaluating the usability of standard access protocols Defining data-access architecture Status: –Evaluation of several cluster filesystems (Fraunhofer, glustre), dCache implementation of (pNFS) NFS4.1, NetApp using NFS3 and NFS4.1 –Lesson learned so far Interplay between hardware, operating system, networks and applications Standard tools mdtest+iozone+h5perf help testing basic capabilities –Poor performance on tests -> poor performance in real applications –Good performance on tests does not imply good performance in real applications –Architecture document expected in month 24 (milestone)