Data Preparation Group Summary of the Activities

Slides:



Advertisements
Similar presentations
July 2010 D2.1 Upgrading strategy Javier Soto Catalog Release 3. Communities.
Advertisements

Validata Release Coordinator Accelerated application delivery through automated end-to-end release management.
HORIZONT 1 ProcMan ® The Handover Process Manager Product Presentation HORIZONT Software for Datacenters Garmischer Str. 8 D München Tel ++49(0)89.
Reconstruction and Analysis on Demand: A Success Story Christopher D. Jones Cornell University, USA.
Performance Testing Design By Omri Lapidot Symantec Corporation Mobile: At SIGiST Israel Meeting November 2007.
Trains status&tests M. Gheata. Train types run centrally FILTERING – Default trains for p-p and Pb-Pb, data and MC (4) Special configuration need to be.
Chiara Zampolli in collaboration with C. Cheshkov, A. Dainese ALICE Offline Week Feb 2009C. Zampolli 1.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Quality Control B. von Haller 8th June 2015 CERN.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
Plans for Trigger Software Validation During Running Trigger Data Quality Assurance Workshop May 6, 2008 Ricardo Gonçalo, David Strom.
Copyright © Yokogawa Electric Corporation Release 2.10 Functionality Overview September 2004.
Introduction Advantages/ disadvantages Code examples Speed Summary Running on the AOD Analysis Platforms 1/11/2007 Andrew Mehta.
ALICE, ATLAS, CMS & LHCb joint workshop on
Analysis infrastructure/framework A collection of questions, observations, suggestions concerning analysis infrastructure and framework Compiled by Marco.
Andreas Morsch, CERN EP/AIP CHEP 2003 Simulation in ALICE Andreas Morsch For the ALICE Offline Project 2003 Conference for Computing in High Energy and.
EGEE is a project funded by the European Union under contract IST HEP Use Cases for Grid Computing J. A. Templon Undecided (NIKHEF) Grid Tutorial,
Infrastructure for QA and automatic trending F. Bellini, M. Germain ALICE Offline Week, 19 th November 2014.
PWG3 Analysis: status, experience, requests Andrea Dainese on behalf of PWG3 ALICE Offline Week, CERN, Andrea Dainese 1.
Analysis trains – Status & experience from operation Mihaela Gheata.
ALICE Pixel Operational Experience R. Santoro On behalf of the ITS collaboration in the ALICE experiment at LHC.
September 2007CHEP 07 Conference 1 A software framework for Data Quality Monitoring in ATLAS S.Kolos, A.Corso-Radu University of California, Irvine, M.Hauschild.
1 Checks on SDD Data Piergiorgio Cerello, Francesco Prino, Melinda Siciliano.
A. Gheata, ALICE offline week March 09 Status of the analysis framework.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
Predrag Buncic CERN Future of the Offline. Data Preparation Group.
Data processing Offline review Feb 2, Productions, tools and results Three basic types of processing RAW MC Trains/AODs I will go through these.
PWG3 analysis (barrel)
The DCS Databases Peter Chochula. 31/05/2005Peter Chochula 2 Outline PVSS basics (boring topic but useful if one wants to understand the DCS data flow)
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Analysis train M.Gheata ALICE offline week, 17 March '09.
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
Some topics for discussion 31/03/2016 P. Hristov 1.
CDF SAM Deployment Status Doug Benjamin Duke University (for the CDF Data Handling Group)
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
AliRoot survey: Calibration P.Hristov 11/06/2013.
1 CAA 20 th Cross Calibration Meeting, MPS, Gottingen 16th Oct 2014 CAA Cross Cal Meeting Oct 2014 Pipeline Automation Chris Perry.
7/8/2016 OAF Jean-Jacques Gras Stephen Jackson Blazej Kolad 1.
CALIBRATION: PREPARATION FOR RUN2 ALICE Offline Week, 25 June 2014 C. Zampolli.
1 MC Production and Reconstruction Summary and Plans Valeria Bartsch, Fabrizio Salvatore, A.-M. Magnan, and Nigel Watson.
Calibration: preparation for pa
Graphical Data Engineering
Jacek Otwinowski (Data Preparation Group)
Overview Modern chip designs have multiple IP components with different process, voltage, temperature sensitivities Optimizing mix to different customer.
Estimate Testing Size and Effort Using Test Case Point Analysis
Software Project Configuration Management
Database Replication and Monitoring
PLM, Document and Workflow Management
Analysis trains – Status & experience from operation
CMS High Level Trigger Configuration Management
Tracker Upgrade Simulations Software Harry Cheung (Fermilab)
Developments of the PWG3 muon analysis code
DPG Activities DPG Session, ALICE Monthly Mini Week
Chapter 18 Maintaining Information Systems
David Adams Brookhaven National Laboratory September 28, 2006
Massimo Masera INFN sezione di Torino
ALICE analysis preservation
MICE Collaboration Meeting Saturday 22nd October 2005 Malcolm Ellis
News on the CDB Framework
fields of possible improvement
Existing Perl/Oracle Pipeline
Offline shifter training tutorial
Analysis Trains - Reloaded
Jacek Otwinowski (for the DPG QA tools and WP7 groups)
Analysis framework - status
Overview of CLAS12 Calibration
QA tools – introduction and summary of activities
Customization Guidelines for BMC Remedy IT Service Management 7.5
Customization Guidelines for BMC Remedy IT Service Management 7.5
Use of Geant4 in experiment interactive frameworks AliRoot
Presentation transcript:

Data Preparation Group Summary of the Activities C. Zampolli 7 June 2016

DPG Organization 07/06/16 C. Zampolli - 07 June 2016

Analysis Objects and Tools DPG Structure DPG Coordination PROC Processing QAT QA and Tools AOT Analysis Objects and Tools 07/06/16 C. Zampolli - 07 June 2016

DPG Structure DPG Coordination PROC QAT AOT Data Monte Carlo Processing QAT QA and Tools AOT Analysis Objects and Tools Data preparation and follow-up Monte Carlo preparation and follow-up Processing execution 07/06/16 C. Zampolli - 07 June 2016

DPG Structure DPG Coordination QAT AOT Data Monte Carlo QA and Tools AOT Analysis Objects and Tools preparation and follow-up Data Monte Carlo preparation and follow-up Processing execution Quality Assurance planning and execution Tools for Quality Assurance and validation PROC Processing 07/06/16 C. Zampolli - 07 June 2016

DPG Structure DPG Coordination AOT Quality Assurance Tools Data Analysis Objects and Tools Quality Assurance planning and execution for Quality Assurance and validation Tools preparation and follow-up Data Monte Carlo preparation and follow-up Processing execution AOD definition and maintenance Event properties Global Tracks properties PROC Processing QAT QA and Tools 07/06/16 C. Zampolli - 07 June 2016

DPG Structure DPG Coordination Quality Assurance Monte Carlo planning and execution for Quality Assurance and validation Tools preparation and follow-up Data definition and maintenance AOD Global Tracks properties Monte Carlo preparation and follow-up Processing execution Event properties PROC Processing QAT QA and Tools AOT Analysis Objects and Tools 07/06/16 C. Zampolli - 07 June 2016

DPG People DPG Coordination PROC QAT AOT Processing QA and Tools Chiara Zampolli PROC Processing Roberto Preghenella Catalin-Lucian Ristea (*) Chiara Zampolli QAT QA and Tools Marie Germain Jacek Otwinowski (*) AOT Analysis Objects and Tools Catalin-Lucian Ristea (*), Constantin Loizides Andrea Dainese, Andrea Rossi (*) detectors PWGs detectors detectors PWGs (*) Institutional responsibilities Collaboration with members of old PWGPP (event selection, track selection, centrality) 07/06/16 C. Zampolli - 07 June 2016

DPG organization 2 Coordination weekly meetings, 1h long Wednesday, 13-14, dedicated to productions, to prepare for PB on Thu Friday, 13-14, general coordination meeting, to discuss DPG activities, and prepare for CB on Mon Dedicated coordination meetings to focus on specific topics organized ad-hoc General meeting during Mini-Weeks Tuesday morning (overlapping with PWG-DQ though) Agenda will include: Presentation(s) of DPG activities Topical talks Input from detectors, PWGs, … PWGPP-calibration meeting still used to discuss production-related technical issues DPG has one contact per detector (same as for Computing Board) and per PWG, plus external contacts (HLT, BTG, RC…) 07/06/16 C. Zampolli - 07 June 2016

DPG activities …in addition to continuous data production, Monte Carlo production, and QA activities… 07/06/16 C. Zampolli - 07 June 2016

Repository AliDPG repository created: https://github.com/alisw/AliDPG AOD MC QA bin Macros/scripts to process data to be added 07/06/16 C. Zampolli - 07 June 2016

Thanks to Giulio and Dario Repository AliDPG repository created: https://github.com/alisw/AliDPG Tagging strategy: Tagging for main AliRoot/AliPhysics version: E.g.: AliDPG-05-07-XX should work for all the AliPhysics/AliRoot oftype v5-07 Needs discussion within Offline AliRoot/AliPhysics tags need more stability for data production Release validation helps, but many pieces change and may interfere with data production Thanks to Giulio and Dario 07/06/16 C. Zampolli - 07 June 2016

Monte Carlo Simulations Setting up of Monte Carlo simulations in a new fashion: One steering script (dpgsim.sh) can trigger several options: OCDB snapshot creation  also for MC Simulation Reconstruction QA AOD production All the above dpgsim.sh takes as input the mode (above), generator (9 predefined + custom possible), run, energy, system, detector configuration (default or custom), B field, simulation config, reconstruction config, number of events, QA config, AOD config, UID (for seeding), bmin, bmax, … Detector configuration separated from generator configuration Custom configuration defined in separate files Roberto, Catalin 07/06/16 C. Zampolli - 07 June 2016

Monte Carlo Simulations Setting up of Monte Carlo simulations in a new fashion: One steering script (dpgsim.sh) can trigger several options: OCDB snapshot creation  also for MC Simulation Reconstruction QA AOD production All the above dpgsim.sh takes as input the mode (above), generator (9 predefined + custom possible), run, energy, system, detector configuration (default or custom), B field, simulation config, reconstruction config, number of events, QA config, AOD config, UID (for seeding), bmin, bmax, … Detector configuration separated from generator configuration Custom configuration defined in separate files Roberto, Catalin LHC16e3: first official production with AliDPG: many thanks to Costin! 07/06/16 C. Zampolli - 07 June 2016

DPG interface/database Brainstorming started on the “DPG interface” The goal is to have a tool that allows people (all ALICE users) to easily retrieve the information about the data produced and available in ALICE It will be, as first layer, an interface to several databases MonALISA, functional tests, QA… Should allow people to Relate datasets Retrieve run lists for specific analysis Verify usage of datasets Check the quality of a dataset As a whole, or in terms of single runs … The requirements are being defined, the implementation will come next Very demanding task! 07/06/16 C. Zampolli - 07 June 2016

QA tools Better interface to QA, more functionalities One single entry point for QA Correlations Comparisons Trending Visualization Easy/fast retrieval of information/data Scalability Exploration of what is on the market ElasticSearch + Kibana Jihyun Jacek Dynamicity 07/06/16 C. Zampolli - 07 June 2016

QA scheduling QA activities very demanding Marie Many productions (data, MC, reprocessing) always ongoing Might have different processing life and timings (e.g.: feedback on data might be crucial for data taking) Typically one person per detector assigned to all QA activities for that detector Effort to re-think the QA organization aiming at: More involvement of ALICE collaboration Faster feedback But: better to have good (or reliable) data slowly, than bad (or unreliable) data fast Requires: Robust and easy QA interfaces and tools, more automatization Could affect time scheduling, reshuffling of QA-ed systems For now, trying to improve what we have with as little changes as possible Changes in scheduling Active feedback from QA experts required More automatization Easier feedback mechanism (digest, table) Marie More discussions within the QA group Thanks to Costin for the technical support! 07/06/16 C. Zampolli - 07 June 2016

Physics Selection Recent changes in the Physics Selection to work also with AOD New cuts introduced for pp data Detailed presentation by Evgeny at the next Physics Forum Evgeny Constantin 07/06/16 C. Zampolli - 07 June 2016

Summary and conclusions The structure of the DPG is now defined Many activities are being followed, avoiding any rupture with PWGPP, and any hic-ups in the data and Monte Carlo processing flow Tools for ALICE users are being discussed and implemented, the challenge is in coupling simplicity (use) with complexity (content) If you want to reach us: alice-dpg-coordination@cern.ch alice-dpg-general@cern.ch alice-dpg-qa@cern.ch alice-dpg-aot-event-props@cern.ch alice-dpg-external-contacts@cern.ch alice-dpg-pwg-contacts@cern.ch 07/06/16 C. Zampolli - 07 June 2016