Analysis framework plans A.Gheata Offline week 13 July 2011.

Slides:



Advertisements
Similar presentations
Network II.5 simulator ..
Advertisements

DEV300: A Tiny CPU and OS in C# Scott Hanselman Technology Evangelist/Architect Corillian Corporation
The LEGO Train Framework
Process Description and Control Module 1.0. Major Requirements of an Operating System Interleave the execution of several processes to maximize processor.
Processes CSCI 444/544 Operating Systems Fall 2008.
Operating System Support Focus on Architecture
CS 104 Introduction to Computer Science and Graphics Problems
Home: Phones OFF Please Unix Kernel Parminder Singh Kang Home:
Efficiently Sharing Common Data HTCondor Week 2015 Zach Miller Center for High Throughput Computing Department of Computer Sciences.
Automated Tests in NICOS Nightly Control System Alexander Undrus Brookhaven National Laboratory, Upton, NY Software testing is a difficult, time-consuming.
Trains status&tests M. Gheata. Train types run centrally FILTERING – Default trains for p-p and Pb-Pb, data and MC (4) Special configuration need to be.
HP Project & Portfolio Management Resource Management July 11, 2007 CIMpleBS.com/HP-PPM/Training-Resource Management.pps By Dan Gallagher See NOTES page.
Virtual Memory.
CE Operating Systems Lecture 5 Processes. Overview of lecture In this lecture we will be looking at What is a process? Structure of a process Process.
Unit tests, Integration tests Physics tests Andrea Dotti, Gunter Folger, Pere Mato CERN – PH/SFT Geant4 workshop 2012.
FINAL MPX DELIVERABLE Due when you schedule your interview and presentation.
Design and Programming Chapter 7 Applied Software Project Management, Stellman & Greene See also:
Semantic Interoperability Berlin, 25 March 2008 Semantically Enhanced Resource Allocator Marc de Palol Jorge Ejarque, Iñigo Goiri, Ferran Julià, Jordi.
Chapter 4 Processes. Process: what is it? A program in execution A program in execution usually usually Can also have suspended or waiting processes Can.
ISM 5316 Week 3 Learning Objectives You should be able to: u Define and list issues and steps in Project Integration u List and describe the components.
Costin Grigoras ALICE Offline. In the period of steady LHC operation, The Grid usage is constant and high and, as foreseen, is used for massive RAW and.
Analysis infrastructure/framework A collection of questions, observations, suggestions concerning analysis infrastructure and framework Compiled by Marco.
Multiprogramming. Readings r Silberschatz, Galvin, Gagne, “Operating System Concepts”, 8 th edition: Chapter 3.1, 3.2.
1 Control Software (CAT) Introduction USB Interface implementation Calorimeter Electronics Upgrade Meeting Frédéric Machefert Wednesday 5 th May, 2010.
Topics Covered Phase 1: Preliminary investigation Phase 1: Preliminary investigation Phase 2: Feasibility Study Phase 2: Feasibility Study Phase 3: System.
Infrastructure for QA and automatic trending F. Bellini, M. Germain ALICE Offline Week, 19 th November 2014.
PWG3 Analysis: status, experience, requests Andrea Dainese on behalf of PWG3 ALICE Offline Week, CERN, Andrea Dainese 1.
Andrei Gheata, Mihaela Gheata, Andreas Morsch ALICE offline week, 5-9 July 2010.
Analysis trains – Status & experience from operation Mihaela Gheata.
Interrupt driven I/O. MIPS RISC Exception Mechanism The processor operates in The processor operates in user mode user mode kernel mode kernel mode Access.
5/2/  Online  Offline 5/2/20072  Online  Raw data : within the DAQ monitoring framework  Reconstructed data : with the HLT monitoring framework.
Planning and status of the Full Dress Rehearsal Latchezar Betev ALICE Offline week, Oct.12, 2007.
Silberschatz, Galvin and Gagne  Operating System Concepts UNIT II Operating System Services.
1 Computer Systems II Introduction to Processes. 2 First Two Major Computer System Evolution Steps Led to the idea of multiprogramming (multiple concurrent.
CERN – Alice Offline – Thu, 20 Mar 2008 – Marco MEONI - 1 Status of Cosmic Reconstruction Offline weekly meeting.
PHENIX and the data grid >400 collaborators 3 continents + Israel +Brazil 100’s of TB of data per year Complex data with multiple disparate physics goals.
Click to add text Systems Analysis, Prototyping and Iteration.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
Interrupt driven I/O Computer Organization and Assembly Language: Module 12.
Recent Enhancements to Quality Assurance and Case Management within the Emissions Modeling Framework Alison Eyth, R. Partheepan, Q. He Carolina Environmental.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
Update on G5 prototype Andrei Gheata Computing Upgrade Weekly Meeting 26 June 2012.
A. Gheata, ALICE offline week March 09 Status of the analysis framework.
AliRoot survey: Analysis P.Hristov 11/06/2013. Are you involved in analysis activities?(85.1% Yes, 14.9% No) 2 Involved since 4.5±2.4 years Dedicated.
1 Offline Week, October 28 th 2009 PWG3-Muon: Analysis Status From ESD to AOD:  inclusion of MC branch in the AOD  standard AOD creation for PDC09 files.
Predrag Buncic CERN ALICE Status Report LHCC Referee Meeting 01/12/2015.
Data processing Offline review Feb 2, Productions, tools and results Three basic types of processing RAW MC Trains/AODs I will go through these.
M. Gheata ALICE offline week, October Current train wagons GroupAOD producersWork on ESD input Work on AOD input PWG PWG31 (vertexing)2 (+
PWG3 Analysis: status, experience, requests Andrea Dainese on behalf of PWG3 ALICE Offline Week, CERN, Andrea Dainese 1.
PWG3 analysis (barrel)
Analysis train M.Gheata ALICE offline week, 17 March '09.
M. Gheata ALICE offline week, 24 June  A new analysis train macro was designed for production  /ANALYSIS/macros/AnalysisTrainNew.C /ANALYSIS/macros/AnalysisTrainNew.C.
Framework support for Accelerators Sami Kama. Introduction Current Status Future Accelerator use modes Symmetric resource Asymmetric resource 09/11/2015.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Some topics for discussion 31/03/2016 P. Hristov 1.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
1 Module 3: Processes Reading: Chapter Next Module: –Inter-process Communication –Process Scheduling –Reading: Chapter 4.5, 6.1 – 6.3.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
MANAGEMENT INFORMATION SYSTEM
Multiprogramming. Readings r Chapter 2.1 of the textbook.
Jan Fiete Grosse-Oetringhaus
Process Management Process Concept Why only the global variables?
Analysis trains – Status & experience from operation
ALICE analysis preservation
Analysis Trains - Reloaded
Systems Analysis and Design
Analysis framework - status
System calls….. C-program->POSIX call
Mr. M. D. Jamadar Assistant Professor
Presentation transcript:

Analysis framework plans A.Gheata Offline week 13 July 2011

Some facts The number of analysis tasks registered in the system constantly increase with time, resources stay mostly constant. Existing analysis code growing fat acording to: « Make it work then add on top » Old data need to be reprocessed regularly while new data filling the pipeline. Most analysis code not written by expert C++ developers. The framework was developed to give freedom, not put limits, but the free space has been exhausted and it becomes more difficult to face contingencies…

Action list 1.Assess correctly the current status in terms of CPU share and disk space: – Ratio between organized/chaotic analysis – Central productions: automatic/on demand – Estimate train costs before launching (!) 2.Rationalize production requests and central trains – Assess priorities and run a train queue – Implement more train management tools to be used by experts 3.Provide more support and encourage organized analysis – Build tools to facilitate assembling trains while factorizing the train management effort – Give regular slots in the production queue to PWG analysis 4.Implement tools for getting insight over what is done wrong in analysis – Automatic diagnosis of memory, CPU, histogram usage, common errors (mostly implemented) – Generation of test setups for individual tasks via the plug-in Leak detector, performance maps, output analyzer

Support for task metadata What for: Support for describing how to run a given task – Location of AddTask macro, libraries needed, handler type, utility tasks (e.g. physics selection, centrality, or tender)… – Embed this info in a class (e.g. AliAnalysisTaskConfig) Able to read/write from root or text file Use case: The plug-in could read one or more such modules and generate the train steering macro – This may give more flexibility and modularity in future This approach is based on some recent discussions for automating the train procedure that is being looked into

Task configuration snippet AliAnalysisTaskConfig myConfig("AliAnalysisTaskQAsym"); myConfig.SetLibs("libPWG1"); myConfig.SetMacro("$ALICE_ROOT/PWG1/PilotAnalysis/AddTaskQAsim.C"); myConfig.SetArguments(""); myConfig.SetDataType("ESD AOD MC"); myConfig.SetDependencies("AliPhysicsSelectionTask") plugin->AddModule(&myConfig);... plugin->GenerateTrain("full") The configuration can be preloaded from a root file or text produced via web form, then altered if needed (add dependencies, run parameters,...) Many of the configuration needed for existing will probably stay unchanged and persistent (OADB) Trains can be configured and assembled on the fly by PWG train managers (see Jan-Fiete's proposal) The skeleton of such generated train already discussed, waiting for implementation

Analysis trains operated by PWG Purpose: test and operate physics trains according a central schedule – To change the balance between chaotic and organized analysis – To allow planning according priorities and share the production-related responsibilities with PWG PWG minimal trains to enter soon production phase – Tests already done, trains already prepared – Procedure for ~automatic testing implemented for PWG1 QA train (see Mihaela’s talk), to be done for PWGn once in production New exercise that will steam-up gradually

Analysis practice sniffer How many of your histograms actually get filled ? Do you know that the choice of your histogram types (e.g. TH2F vs. TH2S) highly impacts on the memory size and is rarely appropriate ? Do you know that for unoccupied un weighted large histograms you can get profit in using a histogram buffer ? Most of this can be automatically spotted and advertised by the framework in Terminate Some changes in ROOT would help – Lazy histogram array allocation, auto-buffering