LEP DATA PRESERVATION 11 years of data taking 4 Experiments Large Luminosity ~1200 Scientific Papers ALEPH Raw data 5 Terabytes DST 800 Gigabytes Mini.

Slides:



Advertisements
Similar presentations
Rare B Decays Chris Parkes SUPA Postgraduate Lectures Introduction Radiative decays b  s  b  d  Electroweak Penguin b  sl + l - Rate Forward backward.
Advertisements

Jos Engelen CERN HEP and its data What is the problem? A possible way forward Permanent Access to the Records of Science Brussels - November 15 th 2007.
Resources for the ATLAS Offline Computing Basis for the Estimates ATLAS Distributed Computing Model Cost Estimates Present Status Sharing of Resources.
Electroweak b physics at LEP V. Ciulli INFN Firenze.
What have we done? What are we doing? What can we do? Travis Brooks (SLAC) Zaven Akopov (DESY)
Information Systems and Data Acquisition for ATLAS What was achievedWhat is proposedTasks Database Access DCS TDAQ Athena ConditionsDB Time varying data.
Recent Electroweak Results from the Tevatron Weak Interactions and Neutrinos Workshop Delphi, Greece, 6-11 June, 2005 Dhiman Chakraborty Northern Illinois.
ATLAS Analysis Model. Introduction On Feb 11, 2008 the Analysis Model Forum published a report (D. Costanzo, I. Hinchliffe, S. Menke, ATL- GEN-INT )
Top Physics at the Tevatron Mike Arov (Louisiana Tech University) for D0 and CDF Collaborations 1.
ATLAS Authorship Policy R. Voss Physics Department, CERN IUPAP C11 ICHEP’04, Beijing, China, August 18, 2004.
Top Quark Physics: An Overview Young Scientists’ Workshop, Ringberg castle, July 21 st 2006 Andrea Bangert.
Patrick Janot Introduction  TLEP / FCC-ee u Physics, Experiments, Detectors break-out session l Conveners: Alain Blondel, Patrick Janot è Follows six.
DATA PRESERVATION IN ALICE FEDERICO CARMINATI. MOTIVATION ALICE is a 150 M CHF investment by a large scientific community The ALICE data is unique and.
EPLC Deliverables Sherry Brown-Scoggins & Wanda Hall
International collaboration in high energy physics experiments  All large high energy physics experiments today are strongly international.  A necessary.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire CDS Invenio CERN’s open source digital library information.
JINR DOCUMENT SERVER: Current Status and Future Plans I. Filozova 1, S. Kuniaev 2, G. Musulmanbekov 1, R. Semenov 1, G. Shestakova 1, P. Ustenko 2, T.Zaikina.
 ATLAS Data Preservation and Access Roger Jones.
Institute for Anything of the University of Everything Claudia-Elisabeth Wulz New Physics at the LHC C.-E. Wulz (Institute of High Energy Physics) 1 Institute.
Human Resource Management Lecture 27 MGT 350. Last Lecture What is change. why do we require change. You have to be comfortable with the change before.
Evaluation of software engineering. Software engineering research : Research in SE aims to achieve two main goals: 1) To increase the knowledge about.
HERA/LHC Workshop, MC Tools working group, HzTool, JetWeb and CEDAR Tools for validating and tuning MC models Ben Waugh, UCL Workshop on.
Software Engineering Saeed Akhtar The University of Lahore Lecture 8 Originally shared for: mashhoood.webs.com.
Analysis Plans for Jets + EtMiss Signatures Pierre Savard ATLAS Toronto Group Meeting January
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
P. Schirmbacher Humboldt-Universität zu Berlin The Changing Process of Scholarly Publishing or the Necessity of a New Culture of Electronic.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
1 C.Diaconu, DPHEP3, CERN, December 7-9, 2009 Blueprint Start the production of a detailed document on data preservation – Gets in details of the individual.
Scenarios for long term analysis (Summary) Stephen Wolbers Fermilab Workshop on Data Preservation and Long Term Analysis in HEP DESY, January 26-28, 2009.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire High-Energy Physics Data Delivering Data in Science ICSTI.
Lecture # 3 & 4 Chapter # 2 Database System Concepts and Architecture Muhammad Emran Database Systems 1.
WGISS /09/2015 DATA PRESERVATION – CNES APPROACH B. Chausserie-Laprée.
Selection Strategies for Digital Institutional Repositories Kent Woynowski 30 September 2004.
Possibility of tan  measurement with in CMS Majid Hashemi CERN, CMS IPM,Tehran,Iran QCD and Hadronic Interactions, March 2005, La Thuile, Italy.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
Peter Granda Archival Assistant Director / Data Archives and Data Producers: A Cooperative Partnership.
Marco Cattaneo, Aleph plenary, 23rd April Long term archive of LEP data  LEPC working group report Purpose Assumptions Conclusions  Physics goals.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Examples for Open Access Scholar Electronic Repository by New Bulgarian University IP LibCMASS Sofia 2011 Contract № 2011-ERA-IP-7 Sofia, September,
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Status of OPAL data Matthias Schröder
Open Archive Workshop, CERN th March 2001 Peer Review - the HEP View Mick Draper, CERN ETT Division
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
DOE Data Management Plan Requirements
1 Pioneer Investments Legal and Compliance System Assessment Weekly Status Update June 23, 2005.
LHC Computing, CERN, & Federated Identities
18/12/2002 Status of L3 - Salvatore Mele Status of L3 Organization in 2002 Activity in 2002 Organization in 2003 Outlook Latest physics results in:
Physics Results from CDF and Prospects for a FY 2011 Run Kevin Pitts / University of Illinois DOE S&T Review of Scientific User Facilities June 30 – July.
Charged Higgs boson at the LHC 이강영 ( 건국대학교 연세대학교
RECFA September Kenneth Österberg, University of Helsinki Experimental e + e - physics – LEP&LC DELPHI Major finnish LEP contribution analysis.
LHCbComputing Computing for the LHCb Upgrade. 2 LHCb Upgrade: goal and timescale m LHCb upgrade will be operational after LS2 (~2020) m Increase significantly.
Preservation of LEP Data There is still hope Is there? Marcello Maggi, Ulrich Schwickerath, Matthias Schröder, , DPHEP7 1.
ATLAS Data preservation April 2015 Roger Jones for the ATLAS Collaboration.
International Collaboration for Data Preservation and Long Term Analysis in High Energy Physics RECODE - Final Workshop - January.
SciencePAD Open Software for Open Science Alberto Di Meglio – CERN.
F. Richard ECFA Study June 2008 A 4th generation scenario F. Richard LAL/Orsay Beyond the 3SM generation at the LHC era.
CERN Document Server 19 tth January 2006 CERN Document Server Jean-Yves Le Meur 19 th January 2006.
Future Colliders Gordon Watts University of Washington/Seattle APS NW Meeting May 12-14, 2016.
Chapter 1 Overview of Databases and Transaction Processing.
Physics activities toward the TDR will be coordinated by A. Bevan, D. Brown, M. Ciuchini and A. Stocchi A document, also in Italian is needed by the end.
Recent results on non-DDbar decays of  (3770) at BES HaiLong Ma [For BES Collaboration] The IVIIth Rencontres de Moriond session devoted to QCD AND HIGH.
ATLAS – statements of interest (1) A degree of hierarchy between the different computing facilities, with distinct roles at each level –Event filter Online.
HEP LTDP Use Case & EOSC Pilot
TechStambha PMP Certification Training
Tim Smith CERN Geneva, Switzerland
WW CROSS SECTIONS AND |Vcs| On behalf the LEP collaborations
Data Management: Documentation & Metadata
Archiving and Disseminating Historic Land Use Information
University of Tsukuba, Japan Particle Physics Phenomenology,
Building an open library without walls : Archiving of particle physics data and results for long-term access and use Joanne Yeomans CERN Scientific Information.
Presentation transcript:

LEP DATA PRESERVATION 11 years of data taking 4 Experiments Large Luminosity ~1200 Scientific Papers ALEPH Raw data 5 Terabytes DST 800 Gigabytes Mini 80 Gigabytes MC Files 16 T + 9 T Similar for other experiments

 Interest triggered by CERN directorate in year  Formal agreement between LEP experiments and IT department in  Working group active until 2004 with partial success History of Lep data archiving (1)

 Development by IT of a "museum computing system", based and frozen on existing lxplus technology/software, with access possibilities to (at present CASTOR) mass storage where all data are stored. These activities were started by Andreas Pfeiffer and Tony Cass. History of Lep data archiving (2)

 the safeguarding of 'standard' analysis framework software and of mini-data on a number of PC’s  the development of a modern C++ analysis framework (in some cases)  the establishment of rules for access to data by non-members of the Collaboration. History of Lep data archiving (3)

History of Lep data archiving- Aleph Statement (1) The data collected by the Aleph experiment in the years have been archived to allow their use for physics analyses after the closure of the Collaboration. The archiving includes the last set of simulated events and the most updated version of the analysis software. Limitations. The available information is not sufficient to repeat all analyses, particularly when systematic effects play an important role as, for instance, for precision measurements in the electroweak sector. Examples of physics analyses that cannot be repeated on archived data are  The measurement of the Z lineshape  The measurement of the W mass  The measurement of the tau polarization  The measurement of lepton and quark forward-backward asymmetries  Most heavy flavour measurements, such as the measurement of Rb, of the CKM matrix elements, of Bd and Bs oscillations  The searches for the Higgs boson  Many searches in the Susy sector

History of Lep data archiving- Aleph Statement (2) Authorized Users. The use of archived Aleph data is authorized to former members of the Aleph Collaboration and their collaborators. The use of a subset of data for teaching and pedagogical purposes, under the guidance of former members of the Collaboration, is allowed. Authorship. The publication of results based on archived Aleph data is not allowed until 1 year after the official termination of the Collaboration, foreseen for the end of The authors of the analysis take full responsibility for the publication. Any figure, plot or table using Aleph data should contain the label “ALEPH Archived Data”. A reference to the present document “Statement on the use of Aleph data for long-term analyses” must be present in the publication.

Special Case : ALEPH QCD archive 

THE PROBLEM of HEP data preservation  The HEP data model is a highly complex data model (from the start difficult to export to OA a` la astronomy)  Raw data -> calibrated data -> skimmed data -> high-level objects  Final results depend on all the grey-literature on constants, human knowledge, algorithms which are needed for each pass  Experiment lifetimes > computing environment lifetimes. Many migrations within the lifetime or an experiment (in this sense preservation is not an issue !)

Lesson learned from LEP  Apart from publication of numbers or tables, no real OA Either little useful or little usable (with small exceptions): continuous need for additional knowledge, difficult to encode and store.  Regardless of community openness in pre-printing, wide-spreading of preliminary results at conferences and insider information, little priority on OA bringing to partial failures of LEP data archiving for the "general" public.  Need force-majeure (Discovery at LHC of something we should have seen at LEP?) to access data again.  Final results (containing additional unpublished information) but also high-level objects have been already combined (LEP Electroweak vs LEP Higgs)

The "Parallel way" to archiving and publishing data  In addition to internal data models, elaborate a parallel format for useful and usable high-level objects  Publish high-level objects behind each scientific paper (after a time lapse?)  Publish all high-level objects after end of collaboration  Address issues of accountability, reproducibility of results, "careless discovers", "careless measurements"

A possible R&D program  Use LEP as a case study for information retrieval to better assess the different methods  Define some high-level object to make a OA-based analysis possible for an "external" but "motivated" researcher of the field  Propose strategies to define "parallel" high-level objects to be included in the LHC data model, that is not post-mortem but aim to make it part of the data-model designing process. This is very timely.  Imagine solutions to expand digital-library records of experimental results to include the OA data behind the results  Initiate a discussion on priority issues and time-delays in making these "parallel" high-level objects available. This is very timely. Credit: to Salvatore Mele for many of the ideas in these slides