OO Software and Data Handling in AMS Computing in High Energy and Nuclear Physics Beijing, September 3-7, 2001 Vitali Choutko, Alexei Klimentov MIT, ETHZ.

Slides:



Advertisements
Similar presentations
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Advertisements

Database System Concepts and Architecture
March 24-28, 2003Computing for High-Energy Physics Configuration Database for BaBar On-line Rainer Bartoldus, Gregory Dubois-Felsmann, Yury Kolomensky,
The Results of Alpha Magnetic Spectrometer (AMS01) Experiment in Space Behcet Alpat I.N.F.N. Perugia TAUP 2001 Laboratori Nazionali del Gran Sasso, Italy.
Computing Strategy of the AMS-02 Experiment B. Shan 1 On behalf of AMS Collaboration 1 Beihang University, China CHEP 2015, Okinawa.
Software Summary Database Data Flow G4MICE Status & Plans Detector Reconstruction 1M.Ellis - CM24 - 3rd June 2009.
1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
AMS COMPUTING VIII International Workshop Advanced Computing And Analysis Techniques in Physics Research Moscow, June 24-29, 2002 Vitali Choutko, Alexei.
Data Acquisition System for 2D X-Ray Detector Beijing Synchrotron Radiation Facility (BSRF) located at Institute of High Energy Physics is the first synchrotron.
March 13thXXXXth RENCONTRES DE MORIOND 1 The Alpha Magnetic Spectrometer on the International Space Station Carmen Palomares CIEMAT (Madrid) On behalf.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
ACAT 2002, Moscow June 24-28thJ. Hernández. DESY-Zeuthen1 Offline Mass Data Processing using Online Computing Resources at HERA-B José Hernández DESY-Zeuthen.
D. VITE' - AMS Genève Cartigny, AMS the Anti-Matter Spectrometer n The past:  1999 n The present:2000 n The future: and beyond… n.
March 2003 CHEP Online Monitoring Software Framework in the ATLAS Experiment Serguei Kolos CERN/PNPI On behalf of the ATLAS Trigger/DAQ Online Software.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
March 2004 At A Glance ITOS is a highly configurable low-cost control and monitoring system. Benefits Extreme low cost Database driven - ITOS software.
Peter Chochula.  DAQ architecture and databases  DCS architecture  Databases in ALICE DCS  Layout  Interface to external systems  Current status.
AMS Computing Y2001-Y2002 AMS Technical Interchange Meeting MIT Jan 22-25, 2002 Vitali Choutko, Alexei Klimentov.
Computer Systems Week 10: File Organisation Alma Whitfield.
Shuei MEG review meeting, 2 July MEG Software Status MEG Software Group Framework Large Prototype software updates Database ROME Monte Carlo.
Event Metadata Records as a Testbed for Scalable Data Mining David Malon, Peter van Gemmeren (Argonne National Laboratory) At a data rate of 200 hertz,
AMS TIM, CERN Apr 12, 2005 AMS Computing and Ground Centers Status Report Alexei Klimentov —
M.Frank LHCb/CERN - In behalf of the LHCb GAUDI team Data Persistency Solution for LHCb ã Motivation ã Data access ã Generic model ã Experience & Conclusions.
Online & Offline software at H.E.S.S.  The H.E.S.S. Experiment  Data storage & Off-line software  Acquisition software  ROOT Problems/Wishlist ROOT.
GLAST LAT ProjectDOE/NASA Baseline-Preliminary Design Review, January 8, 2002 K.Young 1 LAT Data Processing Facility Automatically process Level 0 data.
Simple Database.
Root, GLAST, and IDL Heather Kelly NASA/GSFC Emergent Corporation.
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
Oracle Index study for Event TAG DB M. Boschini S. Della Torre
MWA Data Capture and Archiving Dave Pallot MWA Conference Melbourne Australia 7 th December 2011.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
AMS-02 Computing and Ground Data Handling CHEP 2004 September 29, Interlaken Alexei Klimentov — ETH Zurich and
Status of AMS Regional Data Center The Second International Workshop on HEP Data Grid CHEP, KNU G. N. Kim, J. W. Shin, N. Tasneem, M. W.
David N. Brown Lawrence Berkeley National Lab Representing the BaBar Collaboration The BaBar Mini  BaBar  BaBar’s Data Formats  Design of the Mini 
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
4/5/2007Data handling and transfer in the LHCb experiment1 Data handling and transfer in the LHCb experiment RT NPSS Real Time 2007 FNAL - 4 th May 2007.
Systems in AMS02 AMS July 2003 Computing and Ground MIT Alexei Klimentov —
CHEP as an AMS Remote Data Center International HEP DataGrid Workshop CHEP, KNU G.N. Kim, H.B. Park, K.H. Cho, S. Ro, Y.D. Oh, D. Son (Kyungpook)
Clara Gaspar, March 2005 LHCb Online & the Conditions DB.
Online Software 8-July-98 Commissioning Working Group DØ Workshop S. Fuess Objective: Define for you, the customers of the Online system, the products.
The KLOE computing environment Nuclear Science Symposium Portland, Oregon, USA 20 October 2003 M. Moulson – INFN/Frascati for the KLOE Collaboration.
The Alpha Magnetic Spectrometer (AMS) on the International Space Station (ISS) Maria Ionica I.N.F.N. Perugia International School.
HIGUCHI Takeo Department of Physics, Faulty of Science, University of Tokyo Representing dBASF Development Team BELLE/CHEP20001 Distributed BELLE Analysis.
Experimental Facilities Division ORNL DAQ System Interfaces Neutron Science Software Workshop Oct 13, 2003 Rick Riedel.
Computing Strategy of the AMS-02 Experiment V. Choutko 1, B. Shan 2 A. Egorov 1, A. Eline 1 1 MIT, USA 2 Beihang University, China CHEP 2015, Okinawa.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
The ATLAS TAGs Database - Experiences and further developments Elisabeth Vinek, CERN & University of Vienna on behalf of the TAGs developers group.
M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 1 Markus Oldenburg GridPP Metadata Workshop July 4–7 2006, Oxford University ALICE.
Scientific Computing Facilities for CMS Simulation Shams Shahid Ayub CTC-CERN Computer Lab.
Summary of User Requirements for Calibration and Alignment Database Magali Gruwé CERN PH/AIP ALICE Offline Week Alignment and Calibration Workshop February.
Computing Strategy of the AMS-02 Experiment
The DCS Databases Peter Chochula. 31/05/2005Peter Chochula 2 Outline PVSS basics (boring topic but useful if one wants to understand the DCS data flow)
POCC activities AMS02 meeting at KSC October 11 th 2010.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Overview of PHENIX Muon Tracker Data Analysis PHENIX Muon Tracker Muon Tracker Software Muon Tracker Database Muon Event Display Performance Muon Reconstruction.
A complete simulation of cosmic rays access to a Space Station Davide Grandi INFN Milano, ITALY.
1 DAQ.IHEP Beijing, CAS.CHINA mail to: The Readout In BESIII DAQ Framework The BESIII DAQ system consists of the readout subsystem, the.
AMS02 Data Volume, Staging and Archiving Issues AMS Computing Meeting CERN April 8, 2002 Alexei Klimentov.
1 AMS-02 POCC & SOC MSFC, 9-Apr-2009 Mike Capell Avionics & Operations Lead Senior Research Scientist ISS: 108x80m 420T 86KW 400km AMS: 5x4x3m 7T 2.3+KW.
Jianming Qian, UM/DØ Software & Computing Where we are now Where we want to go Overview Director’s Review, June 5, 2002.
Apr. 25, 2002Why DØRAC? DØRAC FTFM, Jae Yu 1 What do we want DØ Regional Analysis Centers (DØRAC) do? Why do we need a DØRAC? What do we want a DØRAC do?
Evolution of Monitoring System for AMS Science Operation Center
Computing Infrastructure for DAQ, DM and SC
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
An Overview of the BaBar Conditions Database
Presentation transcript:

OO Software and Data Handling in AMS Computing in High Energy and Nuclear Physics Beijing, September 3-7, 2001 Vitali Choutko, Alexei Klimentov MIT, ETHZ

A.Klimentov AMS software and data handling CHEP012 Outline AMS – particle physics experiment on the international space station : Data flow and AMS ground centers Software development Conditions and Tag Database Data Processing AMS Detector STS91 precursor flight AMS ISS mission

A.Klimentov AMS software and data handling CHEP013 AMS : a particle physics experiment in space PHYSICS GOALS : Accurate, high statistics measurements of charged, cosmic ray spectra in space > 0.1GV Nuclei and e- spectra measurement The study of dark matter (90% ?) Determination of the existence or absence of antimatter in the Universe Look for negative nuclei The study of the origin and composition of cosmic rays Measure isotopes D, He, Li, Be… +

A.Klimentov AMS software and data handling CHEP014

A.Klimentov AMS software and data handling CHEP015 Magnet : Nd 2 Fe 14 B TOF : trigger, velocity and Z Si Tracker : charge sign, rigidity, Z Aerogel Threshold Cerenkov : velocity Anticounters : reject multi particle events Results :  Anti-matter search : He / He = 1.1x 10  Charged Cosmic Ray spectra Pr, D, e-, He, N  Geomagnetic effects on CR under/over geomagnetic cutoff components 10 events recorded Trigger rates 0.1-1kHz DAQ lifetime 90% 8 -6 _ Precursor flight : +

A.Klimentov AMS software and data handling CHEP016

A.Klimentov AMS software and data handling CHEP017 AMS on ISS, 3 years in space Separate e- from p,p + _ up to 300 GeV He, He, B, C… 3 4 e-,   up to 1000 GeV +

A.Klimentov AMS software and data handling CHEP018

9 Monitoring & science data Stored data Real-time Data H&S Real-time & “Dump” data Real-time, “Dump”, & White Sand’s LOR playback AMS ACOP High Rate Frame MUX White Sand, NM facility MSFC, Al Payload Data Service system Telescience centers External Communications GSE Long Term Short Term Payload Operations Control Center Science Operations Center ISS to Remote AMS Centers Data Flow ISS NASA Ground Infrastructure Remote AMS Sites H&S Monitoring Science Flight ancillary data Real-time & “dump” NearReal-time File transfer playback

A.KlimentovCHEP0110 AMS Ground Centers Science Operations Center POCC AL AMS Remote center RT data Commanding Monitoring NRT Analysis NRT Data Processing Primary storage Archiving Distribution Science Analysis MC production Data mirror archiving External Communications Science Operations Center XTerm HOSC Web Server and xterm TReK WS commands Monitoring, H&S data Flight Ancillary data AMS science data (selected) TReK WS “ voice”loop Video distribution Production Farm Analysis Facilities PC Farm Data Server Analysis Facilities GSE D S A e T r A v e r GSE Buffer data Retransmit To SOC AMS Station AMS Station AMS Station GSE MC production cmds archive AMS Data, NASA data, metadata

A.Klimentov AMS software and data handling CHEP0111 AMS SW development Been started mid 1996 : basic decisions : new code C++ only (though we had a large part of legacy SW written on Fortran) Existing libraries (CERNLIB, Geant, etc) incorporated via C/Fortran interface (R.Burow) transient and persistent classes are separated with implementing of copy member functions Decide to use Root and HBOOK for histogramming and data visualization

A.Klimentov AMS software and data handling CHEP0112 AMS SW development (cont’d) Use different persistency solutions for various type of data :  Flat files for the raw data  Ntuples and Root files for ESD  Relational Database (Oracle) tables for file catalogues  Relational Database (Oracle) [Objectivity up to Sep 1998] o Event Tags o Calibration data o Slow control data o NASA ancillary data o Various catalogues (processing history, etc )

A.Klimentov AMS software and data handling CHEP0113 Tag Storage with Oracle RDBMS Tag is an unsigned 32 bit integer containing 16, 1 to 5 bit long parameters such as charge, momentum sign, ß,… Model : Query : retrieve tags with 3 parameters satisfied to the given limits (query taken from the “real analysis chain”) Data stored on Raid array connected to AS4100 (quad-CPU rated at 600MHz, 2GB RAM) Flat files – 2400 files, one file per DAQ run, tags are stored as an array of unsigned int. RootN - 10 files, each file with ~240 trees, one tree per DAQ run with single branch (tag) per tree RootS - 10 files, each file with ~240 trees, one tree per DAQ run, having 16 branches, every parameter stored in a dedicated branch OracleN - table with 10 partitions and 1 column, mapping tag to a column OracleI - table with 10 partitions and 1 column with 16 bitmap indices, mapping tag to a column OracleS – table with 10 partitions and 16 columns, every parameter mapping to a column

A.Klimentov AMS software and data handling CHEP0114 Oracle RDBMS to store AMS tags Method Size Gbyte Query Time (sec) Total write time (sec) Record write time (  sec) Flat Files RootN RootS OracleN OracleS OracleI ) 1) 500 sec to build indices for 100M tags

A.Klimentov AMS software and data handling CHEP0115 Design of the Conditions Database  Collection of Time Dependent Values (TDVs)  Primary access keys : name, id, validity interval  Secondary key : insert time  Major Components : table of names and ids, default TDVs, TDVs  Applications : Loading data into database Fetching conditions during event reconstruction Management utilities (TDV browser) Name, id Validity begin, validity end time Insert time Array of unsigned integers (size 100 byte – 8 Mbyte)

A.Klimentov AMS software and data handling CHEP0116 AMS Conditions Database Initially Objectivity, then flat files, now Oracle Performance test for TDV Total Records Flat file size (Mbyte) Oracle table size (Mbyte) Record write time (msec) TOF Temperature Tracker pedestals (a) Tracker pedestals (b) (a) BLOB array is stored inside the table, (b) - outside  TOF temperature (many short records)  Tracker pedestals (small amount of large records)

A.Klimentov AMS software and data handling CHEP0117 Oracle RDBMS to store Tags and TDVs Currently 8 Gbyte is stored in the Conditions DB (115 different TDV types) 100 million event tags are stored in Tag DB Oracle RDBMS performance and functionality satisfy AMS requirements. Using of bitmap indices for tags improves query time dramatically. The current implementation works with distributed CORBA technology. It allowes to reduce the number of database clients and machine loading.

A.Klimentov AMS software and data handling CHEP0118 producers Raw data server Oracle RDBMS Conditions DB Tag DB Active Tables : Hosts, Interfaces, Producers, Servers Catalogues server Nominal Tables Hosts, Interfaces Producers, Servers… ESD server Raw data {I} {II} {III} {IV} {V} {VI} {I} submit 1 st server {II} “cold” start {III} read “active” tables (available hosts, number of servers, producers, jobs/host) {IV} submit servers {V} get “run”info (runs to be processed, ESD output path) {VI} submit producers (LILO, LIRO,RIRO…) Notify servers AMS Production

A.Klimentov AMS software and data handling CHEP0119 AMS Production Highlights Stable running for more than 1 month Average efficiency 95% (98% without Oracle) Processes communication and control via Corba LSF for process submission Run Oracle server on AS4100 Alpha and Oracle clients on Linux. Oracle RDBMS  Tag DB with 100M entries  Conditions DB with 100K entries  Bookkeeping  Production status  Runs history  File catalogues