Preservation of LEP Data There is still hope Is there? Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, 22.3.13, DPHEP7 1.

Slides:



Advertisements
Similar presentations
Replicating Data from the Large Electron Positron (LEP) collider at CERN (Aleph Experiment) Under the DPHEP umbrella Marcello Maggi/INFN –Bari Tommaso.
Advertisements

MEG-Review Feb MEG Software Group MEG Software Status Framework for MC, Schedule, DC reconstruction update and Discussion on a ROOT-based offline.
Simulation / Reconstruction Working group Toby Burnett University of Washington Jan 2000 T.
Simulation / Reconstruction Working group Toby Burnett University of Washington 11 Jan 2000 T.
DATA PRESERVATION IN ALICE FEDERICO CARMINATI. MOTIVATION ALICE is a 150 M CHF investment by a large scientific community The ALICE data is unique and.
The D0 Monte Carlo Challenge Gregory E. Graham University of Maryland (for the D0 Collaboration) February 8, 2000 CHEP 2000.
Status of Hall C 6 GeV Analysis Software Robust Fortran/CERNLIB code, “ENGINE”, for analysis of HMS/SOS coincidence and single arm experiments that has.
Shuei MEG review meeting, 2 July MEG Software Status MEG Software Group Framework Large Prototype software updates Database ROME Monte Carlo.
Remote Production and Regional Analysis Centers Iain Bertram 24 May 2002 Draft 1 Lancaster University.
Status of Hall C 6 GeV Analysis Software Robust Fortran/CERNLIB code, “ENGINE”, for analysis of HMS/SOS coincidence and single arm experiments that has.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
HERA/LHC Workshop, MC Tools working group, HzTool, JetWeb and CEDAR Tools for validating and tuning MC models Ben Waugh, UCL Workshop on.
As of 28 Juni 2005Getting Starged with GEM - Shuei Yamada 1 Getting Started with GEM Shuei YAMADA ICEPP, University of Tokyo What is GEM? Before you start.
Updating JUPITER framework using XML interface Kobe University Susumu Kishimoto.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
PHENIX Simulation System 1 December 7, 1999 Simulation: Status and Milestones Tarun Ghosh, Indrani Ojha, Charles Vanderbilt University.
CDF Offline Production Farms Stephen Wolbers for the CDF Production Farms Group May 30, 2001.
5 May 98 1 Jürgen Knobloch Computing Planning for ATLAS ATLAS Software Week 5 May 1998 Jürgen Knobloch Slides also on:
ALICE Simulation Framework Ivana Hrivnacova 1 and Andreas Morsch 2 1 NPI ASCR, Rez, Czech Republic 2 CERN, Geneva, Switzerland For the ALICE Collaboration.
Status report from T2K-SK group Task list of this group discussion about NEUT Kaneyuki, Walter, Konaka We have just started the discussion.
MINER A Software The Goals Software being developed have to be portable maintainable over the expected lifetime of the experiment extensible accessible.
ATLAS Data Challenges US ATLAS Physics & Computing ANL October 30th 2001 Gilbert Poulard CERN EP-ATC.
Update on MC-04(05) –New interaction region –Attenuation lengths vs endcap module numbers –Regeneration and nuclear interaction –EMC time resolution –
Long Term Data Preservation Frank Berghaus On Behalf of the DPHEP Collaboration 06/03/15 Data Preservation - Frank Berghaus1.
Marco Cattaneo, Aleph plenary, 23rd April Long term archive of LEP data  LEPC working group report Purpose Assumptions Conclusions  Physics goals.
GDB Meeting - 10 June 2003 ATLAS Offline Software David R. Quarrie Lawrence Berkeley National Laboratory
AliRoot survey P.Hristov 11/06/2013. Offline framework  AliRoot in development since 1998  Directly based on ROOT  Used since the detector TDR’s for.
Status of OPAL data Matthias Schröder
Introduction What is detector simulation? A detector simulation program must provide the possibility of describing accurately an experimental setup (both.
CERN SRM Development Benjamin Coutourier Shaun de Witt CHEP06 - Mumbai.
2-Dec Offline Report Matthias Schröder Topics: Scientific Linux Fatmen Monte Carlo Production.
Simulation Commissioning, Validation, Data Quality A brain dump to prompt discussion Many points applicable to any of LHCb software but some simulation.
CBM ECAL simulation status Prokudin Mikhail ITEP.
GLAST LAT Offline SoftwareCore review, Jan. 17, 2001 Review of the “Core” software: Introduction Environment: THB, Thomas, Ian, Heather Geometry: Joanne.
SoLID simulation with GEMC Zhiwen Zhao 2015/03/26.
Status of the LAr OO Reconstruction Srini Rajagopalan ATLAS Larg Week December 7, 1999.
Why A Software Review? Now have experience of real data and first major analysis results –What have we learned? –How should that change what we do next.
Computing R&D and Milestones LHCb Plenary June 18th, 1998 These slides are on WWW at:
5 June 2003Alan Norton / Focus / EP Topics1 Other EP Topics Some 2003 Running Experiments - NA48/2 (Flavio Marchetto) - Compass (Benigno Gobbo) - NA60.
Upgrade Software University and INFN Catania Upgrade Software Alessia Tricomi University and INFN Catania CMS Trigger Workshop CERN, 23 July 2009.
T. Lari – INFN Milan Status of ATLAS Pixel Test beam simulation Status of the validation studies with test-beam data of the Geant4 simulation and Pixel.
Andrea Valassi (CERN IT-DB)CHEP 2004 Poster Session (Thursday, 30 September 2004) 1 HARP DATA AND SOFTWARE MIGRATION FROM TO ORACLE Authors: A.Valassi,
Remote Institute Tasks Frank Filthaut 11 February 2002  Monte Carlo production  Algorithm development  Alignment, calibration  Data analysis  Data.
Computing Issues for the ATLAS SWT2. What is SWT2? SWT2 is the U.S. ATLAS Southwestern Tier 2 Consortium UTA is lead institution, along with University.
LHCb Data Challenge in 2002 A.Tsaregorodtsev, CPPM, Marseille DataGRID France meeting, Lyon, 18 April 2002.
ATLAS Distributed Computing perspectives for Run-2 Simone Campana CERN-IT/SDC on behalf of ADC.
The MEG Offline Project General Architecture Offline Organization Responsibilities Milestones PSI 2/7/2004Corrado Gatto INFN.
ATLAS Data preservation April 2015 Roger Jones for the ATLAS Collaboration.
Status of Hall C 6 GeV Analysis Software Robust Fortran/CERNLIB code, “ENGINE”, for analysis of HMS/SOS coincidence and single arm experiments that has.
R 3 B Calorimeter Simulation H. Alvarez Pol – R 3 B Calorimeter Simulation NUSTAR Calorimeter WG – Valencia 17/06/05 H. Alvarez Pol, D. Cortina, I. Durán.
Lcsim software: status and future plans ECFA-LC DESY May 30, 2013 Norman Graf (for the sim/reco group)
VI/ CERN Dec 4 CMS Software Architecture vs Hybrid Store Vincenzo Innocente CMS Week CERN, Dec
2-December Offline Report Matthias Schröder Topics: Monte Carlo Production New Linux Version Tape Handling Desktop Computers.
PHENIX Simulation System 1 September 8, 1999 Simulation Work-in-Progress: ROOT-in-PISA Indrani Ojha Banaras Hindu University and Vanderbilt.
Usecases: 1.ISIS Neutron Source 2.DP for HEP Matthew Viljoen STFC, UK APARSEN-EGI workshop: preserving big data for research Amsterdam Science Park 4-6.
Status of the ALCPG simulation & reconstruction Norman Graf (SLAC) TILC09, Tsukuba April 19, 2009.
1 Calice TB Review DESY 15/6/06D.R. Ward David Ward Post mortem on May’06 DESY running. What’s still needed for DESY analysis? What’s needed for CERN data.
LEP DATA PRESERVATION 11 years of data taking 4 Experiments Large Luminosity ~1200 Scientific Papers ALEPH Raw data 5 Terabytes DST 800 Gigabytes Mini.
4 Dec., 2001 Software Week Data flow in the LArG Reconstruction software chain Updated status for various reconstruction algorithm LAr Converters and miscellaneous.
1 GlueX Software Oct. 21, 2004 D. Lawrence, JLab.
LHCb Software Week 25/11/99 Gonzalo Gracia Abril 1 r Status of Geant4 in LHCb. r Ideas on how to populate the LHCb Detector Description Data Base (LHCb.
LHCb Computing 2015 Q3 Report Stefan Roiser LHCC Referees Meeting 1 December 2015.
BESIII data processing
CMS High Level Trigger Configuration Management
ALICE analysis preservation
US ATLAS Physics & Computing
OffLine Physics Computing
ATLAS DC2 & Continuous production
HEC Beam Test Software schematic view T D S MC events ASCII-TDS
Presentation transcript:

Preservation of LEP Data There is still hope Is there? Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 1

LEP? Large Electron Positron Collider Four general purpose experiments – ALEPH, DELPHI, L3, OPAL – Varying detector layouts – Varying data structures – Varying software approaches Optimization Personal preferences Data taking: 1989 – 2000 – ‘89 to ‘95: at and around Z-peak (91 GeV) – ‘95 to ‘00 increase in various steps up to 209 GeV Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 2

Publications Individual publications – Several hundred per collaboration – Rate decreased ~2004 Now in the (long?) tail Combined publications – Dedicated topics – WG ‘s with experts from all collaborations – Based of analysis results, not common analysis Topic specific, in general (hbook) ntuples Traces on HepData and Rivet Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 3

Motivation for DP Unique data sets – Some ‘overlap’ with SLD data Profit from improved theoretical models Profit from new results at other facilities Allow combining data sets Open Access Education Use case determines preservation needs Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 4

New Theoretical Models Require full Monte Carlo production chain – New four-vectors Standard format – Full detector simulation Geometry (Geant3) Digitization – Full detector reconstruction – Old MC samples for consistency / validation check – Analysis Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 5

New Results at other Facilities Sufficient to access reconstructed objects? – Probably, but why limit yourself Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 6

Combining Data Sets Requires common object definition Requires common data format Translate assumptions coded in framework Only useful at analysis level – Or rewrite all simulation & reconstruction code Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 7

Open Access Policies: – df df – Archive/Delphi-access-FINALrules pdf Archive/Delphi-access-FINALrules pdf – Archive/Opal-futureUseOfData.pdf Archive/Opal-futureUseOfData.pdf For education: – Often done by former collaborators – Using (hbook) ntuples or DSTs Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 8

Data RAW data Processed data – Various levels of ‘abstraction’ per experiment Selections – Tagging – Streaming DSTs File catalogs ~100 TB per experiment Stored at CERN (castor, dual copies) Outside copies desirable Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 9

Software Mature (dated?) Fortran – Memory management + IO BOS ZEBRA – Little (but some  ) commercial libraries – HEPDB Calibration Status updates – HBOOK Portable – Proven through lifetime of experiments IBM mainframe, Cray, DEC (VMS & Ultrix), Apollo, HPUX, SGI, Linux Maintainable? – So far non-disruptive(?) – For how much longer? Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 10

Tools Wrappers Scripts – Data management – MC production chain Aleph & Delphi still exercising it – One with one w/o book-keeping OPAL’s died with fatmen  – Job control Environment variables – Paths, version numbers, … Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 11

Monte Carlo Generators Detector Geometry & Simulation – Geant3 – Detector specific tuning 4-vector files Digitized samples (?) Processed events Production chains Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 12

Documentation Mainly targeting newcomers & collaborators – (Very?) steep learning curve – Rarely sufficient w/o experiment expert And their memories also have a half-life Format – Latex – Html – PS – ? Accessible? – review documentation considered ‘internal’ Searchable? Probably un-versioned Required: Documentation for external dependencies (CERNLIB) Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 13

Current Status (Very) Few people still doing analysis – Using RAW, DSTs and (hbook) ntuples? Most experts busy with new experiments – Or retired, or… No resources – Rely on individual contributions – No room for experiment specific effort Trying to raise awareness for DP… – New data is so much more tempting Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 14

DP Approaches Freeze current environment in VM – Until when will hypervisors run them? – Have to make all data ‘local’? Keep alive by continued porting – External dependencies CERNLIB BOS ? – When will compilers refuse to accept the code – Date formats? – Delphi: all sources put on standalone CD Migrate to more modern format – Information loss – Rewrite all software Play safe – don’t put all your data in one tier – In every aspect! Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 15