 ATLAS Data Preservation and Access Roger Jones.

Slides:



Advertisements
Similar presentations
Testing Relational Database
Advertisements

03/31/2006 DOSAR Workshop at UTA Pat Skubic (OCHEP) OCHEP Physics Interests Pat Skubic, for the Oklahoma Center for High Energy Physics Introduction Physics.
Current limits (95% C.L.): LEP direct searches m H > GeV Global fit to precision EW data (excludes direct search results) m H < 157 GeV Latest Tevatron.
Recent Results on the Possibility of Observing a Standard Model Higgs Boson Decaying to WW (*) Majid Hashemi University of Antwerp, Belgium.
Higgs Searches using Vector Boson Fusion. 2 Why a “Low Mass” Higgs (1) M H
Discovery of Long-Lived The LHC Bryan Smith West Coast Theory Network University of California, Irvine 4 th May 2007 Work with Jonathan Feng,
September 27, 2005FAKT 2005, ViennaSlide 1 B-Tagging and ttH, H → bb Analysis on Fully Simulated Events in the ATLAS Experiment A.H. Wildauer Universität.
ATLAS Analysis Model. Introduction On Feb 11, 2008 the Analysis Model Forum published a report (D. Costanzo, I. Hinchliffe, S. Menke, ATL- GEN-INT )
The new Silicon detector at RunIIb Tevatron II: the world’s highest energy collider What’s new?  Data will be collected from 5 to 15 fb -1 at  s=1.96.
MSIS 110: Introduction to Computers; Instructor: S. Mathiyalakan1 Systems Design, Implementation, Maintenance, and Review Chapter 13.
In order to acquire the full physics potential of the LHC, the ATLAS electromagnetic calorimeter must be able to efficiently identify photons and electrons.
Data-based background predictions using forward events Victor Pavlunin and David Stuart University of California Santa Barbara July 10, 2008.
1 Techincolor and Heavy Top/B search Tulika Bose Meenakshi Narain (Brown University) Work done in the context of Les Houches 2007.
US CMS Data Preservation Discussion, 16 February 2012 CMS Data Preservation Policy On behalf of Data preservation working group Active Members:
DATA PRESERVATION IN ALICE FEDERICO CARMINATI. MOTIVATION ALICE is a 150 M CHF investment by a large scientific community The ALICE data is unique and.
 ATLAS Data Preservation and Access Roger Jones.
Evaluation 101 Everything You Need to Know to Get Started Evaluating Informal Science Education Media Everything You Need to Know to Get Started Evaluating.
LHC’s Second Run Hyunseok Lee 1. 2 ■ Discovery of the Higgs particle.
Irakli Chakaberia Final Examination April 28, 2014.
Analysis Plans for Jets + EtMiss Signatures Pierre Savard ATLAS Toronto Group Meeting January
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
David N. Brown Lawrence Berkeley National Lab Representing the BaBar Collaboration The BaBar Mini  BaBar  BaBar’s Data Formats  Design of the Mini 
1 Realistic top Quark Reconstruction for Vertex Detector Optimisation Talini Pinto Jayawardena (RAL) Kristian Harder (RAL) LCFI Collaboration Meeting 23/09/08.
MINERVA Identifying Particle Tracks nneLynne Long University of Birmingham With thanks to Tom McLaughlan & Hardeep Bansil An exercise for students in the.
19/07/20061 Nectarios Ch. Benekos 1, Rosy Nicolaidou 2, Stathes Paganis 3, Kirill Prokofiev 3 for the collaboration among: 1 Max-Planck-Institut für Physik,
Study of the to Dilepton Channel with the Total Transverse Energy Kinematic Variable Athens, April 17 th 2003 Victoria Giakoumopoulou University of Athens,
Minerva, Feb 9, 2011Monika Wielers (RAL)1 Minerva Abreviation for Masterclass INvolving Event Recognition Visualised with Atlantis developed and 1 st used.
9-13/9/03 Atlas Overview WeekPeter Sherwood 1 Atlfast, Artemis and Atlantis What, Where and How.
Organisation of the Beatenberg Trigger Workshop Ricardo Gonçalo Higgs WG Meeting - 22 Jan.09.
Physics discussion ATLAS W exercise Konrad Jende Alpen LHC CMS ALICE.
International Masterclasses 2013 Farid Ould-Saada On behalf of IMC steering group & authors of the material presented.
LHC Physics Analysis and Databases or: “How to discover the Higgs Boson inside a database” Maaike Limper.
W path data analysis International Masterclasses CERN, March 2012.
FIMCMS, 26 May, 2008 S. Lehti HIP Charged Higgs Project Preparative Analysis for Background Measurements with Data R.Kinnunen, M. Kortelainen, S. Lehti,
Higgs Reach Through VBF with ATLAS Bruce Mellado University of Wisconsin-Madison Recontres de Moriond 2004 QCD and High Energy Hadronic Interactions.
Search for the Higgs boson in H  ZZ (*) decay modes on ATLAS German D Carrillo Montoya, Lashkar Kashif University of Wisconsin-Madison On behalf of the.
ATLAS Z-Path Masterclass It’s a time of exciting new discoveries in particle physics! At CERN, the LHC and its experiments are underway. The ATLAS.
Jean-Roch Vlimant, CERN Physics Performance and Dataset Project Physics Data & MC Validation Group McM : The Evolution of PREP. The CMS tool for Monte-Carlo.
Computing R&D and Milestones LHCb Plenary June 18th, 1998 These slides are on WWW at:
5/9/111 Update on TMVA J. Bouchet. 5/9/112 What changed background and signal have increased statistic to recall, signal are (Kpi) pairs taken from single.
Study of pair-produced doubly charged Higgs bosons with a four muon final state at the CMS detector (CMS NOTE 2006/081, Authors : T.Rommerskirchen and.
Alternatives: Beyond SUSY Searches in CMS Dimitri Bourilkov University of Florida For the CMS Collaboration SUSY06, June 2006, Irvine, CA, USA.
Z-PATH 2016 COMPARISONS TO ATLAS F. Ould-Saada et al., University of Oslo H→ γγ ‘H’→ZZ * → llll.
ATLAS Z-Path Masterclass Masterclass Analysis Intro.
2012 IMC Debriefing: ATLAS W-path Content 1.Review 2011 & Students’ Results 3.Website statistics 4.Brief analysis of feedback 5.First results of.
LHCbComputing Computing for the LHCb Upgrade. 2 LHCb Upgrade: goal and timescale m LHCb upgrade will be operational after LS2 (~2020) m Increase significantly.
The ATLAS Computing & Analysis Model Roger Jones Lancaster University ATLAS UK 06 IPPP, 20/9/2006.
Régis Lefèvre (LPC Clermont-Ferrand - France)ATLAS Physics Workshop - Lund - September 2001 In situ jet energy calibration General considerations The different.
IPPOG , CERN International Masterclasses 2013 Report from Pre-meeting Uta Bilow, Ken Cecire, Farid Ould-Saada.
10 January 2008Neil Collins - University of Birmingham 1 Tau Trigger Performance Neil Collins ATLAS UK Physics Meeting Thursday 10 th January 2008.
ATLAS Data preservation April 2015 Roger Jones for the ATLAS Collaboration.
H Y P A T I A HYbrid Pupil’s Analysis Tool for Interactions in Atlas
Living Long At the LHC G. WATTS (UW/SEATTLE/MARSEILLE) WG3: EXOTIC HIGGS FERMILAB MAY 21, 2015.
CMS Masterclass It’s a time of exciting new discoveries in particle physics! At CERN, the LHC succesfully completed Run I at 8 TeV of collision.
Elba -- June 7, 2006 Collaboration Meeting 1 CDF Melisa Rossi -- Udine University On behalf of the Multilepton Group CDF Collaboration Meeting.
I'm concerned that the OS requirement for the signal is inefficient as the charge of the TeV scale leptons can be easily mis-assigned. As a result we do.
E. Soldatov Tight photon efficiency study using FSR photons from Z  ll  decays E.Yu.Soldatov* *National Research Nuclear University “MEPhI”
LEP DATA PRESERVATION 11 years of data taking 4 Experiments Large Luminosity ~1200 Scientific Papers ALEPH Raw data 5 Terabytes DST 800 Gigabytes Mini.
Introduction to Particle Physics II Sinéad Farrington 19 th February 2015.
Open and Hidden Beauty Production in 920 GeV p-N interactions Presented by Mauro Villa for the Hera-B collaboration 2002/3 data taking:
Search for Standard Model Higgs in ZH  l + l  bb channel at DØ Shaohua Fu Fermilab For the DØ Collaboration DPF 2006, Oct. 29 – Nov. 3 Honolulu, Hawaii.
Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1.
Search for Anomalous Production of Multi-lepton Events at CDF Alon Attal Outline  Motivation  R p V SUSY  CDF & lepton detection  Analysis  Results.
Search for a new light boson in  decays J.Stepaniak, M.Berłowski, NCBJ Warsaw For WASA-at-COSY Collaboration Meson2014,Krakow
HEP LTDP Use Case & EOSC Pilot
ALICE analysis preservation
Cadet Lieutenant Amanda Fort USMA Class of 2009 From Stratham, NH
Top quark search update
Higgs → t+t- in Vector Boson Fusion
Presentation transcript:

 ATLAS Data Preservation and Access Roger Jones

Data Preservation & Access  Opening data access  Preparatory discussions with “management”, CB chair, authorship and Pubcom chairs  Has clear implications for authorship/membership rules  Needs CB-level discussion  Past experience says these topics provoke long discussion in the CB!  Common principles proposed by LHC experiment Data Policy Harmonization Group straw man  This has been reviewed by the SIPB and taken to CERN Council to become a “policy suggestion”  A draft policy is with the management for discussion & has been seen by the ICB

ATLAS DMP Organization  Data Preservation now included as part of the upgrade activity planning  May increase the funding options – some evidence already  Data Management Planning is now required by some funders for upgrade grants  Looking at the cost/benefit of various strategies  Resource tensioning with other upgrade activities

Principles for preservation & access  General agreement RAW data is preserved for the experiment and future – open data access is not usually possible even to the collaboration members (level 4 data) and is not proposed for general use  Full reconstruction outputs for analysis might be made available after an embargo period – tbd, but clearly embargo of several years. The resource implications to make this useful are high. (Level 3 data)  We support limited access of samples in simple formats for outreach and teaching (level 2 data) – but these are best integrated to our presenter tools  Techniques like Recast may make data (information) usefully available, although it does not meet all the open access criteria for levels 2 & 3  We already make data from papers and supporting information available through HEPDAT/Inspire (Level 1 data)

Data Preservation Policies  Data Preservation  There are DP policies implied in the Computing TDRs  conserve all raw data during the lifetime of the experiment  All formats & code used for paper analyses to be archived  Tier 0/1s responsible for the physical preservation  Some tacit belief that older sets may be ‘retired’  Retired data no longer to be on disk or under active analysis  This may need to be revised e.g. if external access is then granted  Obvious resource implications  First priority to to preserve data for active use by the collaboration

ATLAS DP Practical Steps  Making sure raw data can be reprocessed long-term (Level 4)  Identifying key datasets for ‘unique data’ preservation  Setting up regular reprocessing and validation  This has been underway as a test case for the 2009 data, but progress is slow  Forward/backward compatibility issues illustrated in John Chapman’s talk on simulation release plans14/3/13  Ensure the capability to run old trigger selections offline  AODfixing will help (reprocessing at analysis format level)  This means level 4 operations can be applied to level 3 AOD format

Digesting validation results  Must display the results of the validation in a comprehensible way: web based interface  The test must determine the nature of the results  Could be simple yes/no, plots, ROOT files, text-files with keywords or length,...  Need for semi-automated, detailed physics validation  David South is on ATLAS and was central to the DESY SP and DPHEP activities  Identify the useful common components  Identify the ATLAS-specific elements  Set up CERN-based instance for ATLAS (and others?)

Existing open datasets  The CB has authorized various datasets in (level-2) outreach formats for open use in education/outreach  Event displays for interactive analysis (MINERVA/HYPATIA/LPPP/CAMELIA)  JIVE-XML, root format data  Absolutely not intended for any serious analysis, but illustrative

ATLAS Zpath  Master the invariant mass technique  to study and measure the (Z, J/  ) decaying to l + l -  to search for new physics (Z’)  And Higgs boson in  and l + l - l + l - HYPATIA using the ATLANTIS event display Data from 2011 – events ~2.5 GB (password protected, 100 open) – 13 data groups/directories, 20 subgroups (A-T), and 50 events/mixed sample/2 students – 50% Z, 30%  10% (J/ , 5% Z', 5% l + l - l + l - – Higgs candidate events: – 1 fb -1 and cuts according to ATLAS publication – 125 GeV Higgs MC signals ready to upload (1fb -1, 10fb -1,25fb -1 ) M(  )=125 GeV M(ee  )=123 GeV 9

ATLAS Zpath tests  OPloT:  M ll and/or M  and/or M llll to be discussed locally  Moderator: 1 slide with 3 invariant masses; Invariant mass as a tool to identify particles, to discover new particles, and to search for exotic particles  Web pages updated and measurement ready  org/exercises/ATLAS- 2013/en/zpath.htm org/exercises/ATLAS- 2013/en/zpath.htm org/exercises/ATLAS- 2013/en/zpath.htm  Introduced Higgs  Described new measurements  Prepared material for instructors, moderators, for discussions, … 10

OPloT Tests 2013  Higgs comments  4l provided without requiring 2l from Z, with lower cut on other pair   provide MC with 125 Higgs and background  Upload 125 Higgs MC ((1)&10 & 25 fb -1 ) 11

 Measurements  W  l  W  l  W+/W- ratio  Angular distribution between leptons in WW events  MINERVA program using the ATLANTIS event display  2011 real data: 693 WW/Higgs candidates (from released 1fb -1 ) mixed with 5307 W and other background events  Histogram tool  spreadsheet and histogram websites connected with database  New measurement tested ATLAS W-path with real WW (+H) events 12

ATLAS W-path 13 Data from 2011, 1.1fb should be WW (w/o Higgs) 160 should be ttbar or single top 120 should be Z+Jets50 should be W+Jets15 should be from H  WW

 or e + e - ?  Left: p T >1GeV; right p T >5 GeV  2 apparent tracks pointing to 2 calorimeter objects  Zoom reveals 2-pairs e + e -  information The conclusion is that the 2 calorimeter objects correspond to 2 photons, which have converted and lead to 4 tracks; the tracks from one pair had less than 3 pixel hits So, to be classified and entered as  14

Level-2 observations  The applications are all trying to illustrate the analyses and physics in the true context of a detector  They use ATLANTIS as a presenter in most cases, which defines the natural common format  Other formats would require an additional interface, to what benefit?  Use case and resource justification for a common format not clear

 ATLAS Analysis Roger Jones

The Generic Analysis Flow

Level 3  ATLAS has no approved level-3 formats for external use, and such release will require such approval  We are concerned that anything released be useful, not consume large amounts of collaboration effort (both in production and response)  As such, tools like Recast are more attractive  The information incorporates the efficiency, acceptances and corrections – so is robust  It also helps meet the internal requirement of full documentation of analyses

Analysis Practical steps - RECAST  Framework developed to extend impact of existing analyses  Candidate for within-experiment and long-term analysis archival, encapsulating the full trigger & event selection, data, backgrounds, systematics arXiv:  Allow an existing analysis to be reinterpreted under an alternate model hypothesis  Complete information from original analysis, including the tacit information, contained in the data  Not optimized for the new model, but more reliable than a naïve reanalysis? Recast seen as a very promising solution for preserving analyses and useful, cost effective preservation of information – addresses levels ~1-~3