Realistic, large-scale MC production M. Moulson, 20 December 2002 Summary presentation for KLOE General Meeting Outline: Production proposal Refinements.

Slides:



Advertisements
Similar presentations
Status reports/Discussion items S. DellAgnello DC geometry review DC wire sag measurements P. de SimoneDC s-t relations with new sag model S. Miscetti.
Advertisements

Status reports Trigger simulationM. Palutan, B. Sciascia DC GeometryA. Antonelli, S. Dell’Agnello DC wire sag measurementS. Dell’Agnello, C. Forti DC s-t.
CBM Calorimeter System CBM collaboration meeting, October 2008 I.Korolko(ITEP, Moscow)
VETO Analysis Update Michael Wood University of Massachusetts, Amherst Outline Introduction and basics Reconstruction packages Efficiencies Simulation.
Recent Results on Radiative Kaon decays from NA48 and NA48/2. Silvia Goy López (for the NA48 and NA48/2 collaborations) Universitá degli Studi di Torino.
Status report on EMC simulation... Part 3 The plan is to:  SURVEY the existing situation over the runs - Validation of plots using  and.
Measurement of the absolute BR(K  +  -  + ) : an update Patrizia de Simone KLOE Kaon meeting – 21 May 2009.
Work finished M. Antonelli ISR and f decay simulation M. MoulsonBank-reduction code for DST’s M. Palutan et al.Trigger simulation parameters S. GiovannellaGEANFI.
Off-axis Simulations Peter Litchfield, Minnesota  What has been simulated?  Will the experiment work?  Can we choose a technology based on simulations?
Status of  b Scan Jianchun Wang Syracuse University Representing L b scanners CLEO Meeting 05/11/02.
MC Study on B°  J/  ° With J/      °     Jianchun Wang Syracuse University BTeV meeting 03/04/01.
GRD - Collimation Simulation with SIXTRACK - MIB WG - October 2005 LHC COLLIMATION SYSTEM STUDIES USING SIXTRACK Ralph Assmann, Stefano Redaelli, Guillaume.
PPR meeting - January 23, 2003 Andrea Dainese 1 TPC tracking parameterization: a useful tool for simulation studies with large statistics Motivation Implementation.
The Transverse detector is made of an array of 256 scintillating fibers coupled to Avalanche PhotoDiodes (APD). The small size of the fibers (5X5mm) results.
KLOE Offline Status Report Data Reconstruction MonteCarlo updates MonteCarlo production Computing Farm C.Bloise – May 28th 2003.
Outline: (1) The data sample (2) Some news on the analysis method (3) Efficiency revised (4) Background revised (5) Data: spectrum + “phi-curve”
Status reports M. Antonelli ISR and f decay simulation S. Dell’AgnelloDC geometry review P. de SimoneDC s-t relations with new sag model A. AntonelliDC.
Thomas Jefferson National Accelerator Facility Page 1 EC / PCAL ENERGY CALIBRATION Cole Smith UVA PCAL EC Outline Why 2 calorimeters? Requirements Using.
MC/Offline Planning M. Moulson Frascati, 21 March 2005 MC status Reconstruction coverage and data quality Running time estimates MC & reprocessing production.
LAV Software Status Emanuele Leonardi – Tommaso Spadaro Photon Veto WG meeting – 2015/03/24.
Progress on BR(K L  p + p - ) using a double-tag method A. Antonelli, M. Antonelli, M. Dreucci, M. Moulson CP working group meeting, 25 Feb 2003.
STATUS OF KLOE F. Bossi Capri, May KLOE SHUTDOWN ACTIVITIES  New interaction region  QCAL upgrades  New computing resources  Monte Carlo.
Kalanand Mishra April 27, Branching Ratio Measurements of Decays D 0  π - π + π 0, D 0  K - K + π 0 Relative to D 0  K - π + π 0 Giampiero Mancinelli,
E. De LuciaNeutral and Charged Kaon Meeting – 7 May 2007 Updates on BR(K +  π + π 0 ) E. De Lucia.
Track extrapolation to TOF with Kalman filter F. Pierella for the TOF-Offline Group INFN & Bologna University PPR Meeting, January 2003.
K  group activities K + K - retracking features New vs Old : resolution future planning. Conclusion and outlook K + K - retracking features New vs Old.
V.Patera – KLOE GM – Otranto – 10 June 2002 K  reconstruction status K + K - retracking features New vs Old : resolution New vs Old : efficiencies Conclusion.
Update on MC-04(05) –New interaction region –Attenuation lengths vs endcap module numbers –Regeneration and nuclear interaction –EMC time resolution –
Status of the KLOE experiment M. Moulson, for the KLOE collaboration LNF Scientific Committee 23 May 2002 Data taking in 2001 and 2002 Hardware and startup.
Status of K S   e analysis C. GattiT. Spadaro Selection on data Selection on MC Efficiencies from data, K L   e, K S    ,      , bhabha.
The KLOE computing environment Nuclear Science Symposium Portland, Oregon, USA 20 October 2003 M. Moulson – INFN/Frascati for the KLOE Collaboration.
Offline meeting Outcome of Scientific Committee Status of MC04 Proposal for a minimum bias EvCl sample Other offline activies: –Reprocessing –Dead wires.
 0  5  Outline Event selection & analysis Background rejection Efficiencies Mass spectrum Comparison data-MC Branching ratio evaluation Systematics.
2004 Fall JPS meeting (English version) K.Okada1 Measurement of prompt photon in sqrt(s)=200GeV pp collisions Kensuke Okada (RIKEN-BNL research center)
A proposed plan for fine tuning of EMC MC The plan is to :  SURVEY the existing situation over the runs - Validation of plots using  and.
Preliminary results for the BR(K S  M. Martini and S. Miscetti.
HEP Tel Aviv University LumiCal (pads design) Simulation Ronen Ingbir FCAL Simulation meeting, Zeuthen Tel Aviv University HEP experimental Group Collaboration.
Offline Status Report A. Antonelli Summary presentation for KLOE General Meeting Outline: Reprocessing status DST production Data Quality MC production.
Status of the hadronic cross section (small angle) Federico Nguyen February 22 nd 2005  the 2002 data sample and available MC sets  trigger efficiency.
00 Cooler CSB Direct or Extra Photons in d+d  0 Andrew Bacher for the CSB Cooler Collaboration ECT Trento, June 2005.
Preliminary Measurement of the Ke3 Form Factor f + (t) M. Antonelli, M. Dreucci, C. Gatti Introduction: Form Factor Parametrization Fitting Function and.
STAR Collaboration Meeting, BNL – march 2003 Alexandre A. P. Suaide Wayne State University Slide 1 EMC Update Update on EMC –Hardware installed and current.
STAR Analysis Meeting, BNL – oct 2002 Alexandre A. P. Suaide Wayne State University Slide 1 EMC update Status of EMC analysis –Calibration –Transverse.
Work finished M. Antonelli ISR and f decay simulation M. MoulsonBank-reduction code for DST’s.
Search for High-Mass Resonances in e + e - Jia Liu Madelyne Greene, Lana Muniz, Jane Nachtman Goal for the summer Searching for new particle Z’ --- a massive.
MC sign-off and running plans M. Moulson, 1 March 2004 Offline discussion Outline: Results of tests on new MC files Sign-off on production version What.
Jet + Isolated Photon Triple Differential Cross Section Nikolay Skachkov: “Photon2007”, Paris, 9-13 July 2007 DO Measurement of Triple Differential Photon.
The K L   0  0 decays in KLOE Introduction  pairing Discriminant variables K L   0  0 counting : 3,4,5  ’s samples Control samples for efficiencies.
Tuning of trigger simulation parameters (Bossi, Moulson, Palutan, Sciascia) Calorimeter trigger thresholds calibration using prompt photons from K S 
Status report on analysis of BR(K S  p + p - p 0 ) A. Antonelli, M. Moulson, Second KLOE Physics Workshop, Otranto, June 2002.
1 Reminder: Why reprocess? data Reconstruction: DBV DSTs: DBV data DBV-19 for reconstruction & DSTs, Run < DBV-20 for.
Paolo Massarotti Kaon meeting March 2007  ±  X    X  Time measurement use neutral vertex only in order to obtain a completely independent.
Tau31 Tracking Efficiency at BaBar Ian Nugent UNIVERSITY OF VICTORIA Sept 2005 Outline Introduction  Decays Efficiency Charge Asymmetry Pt Dependence.
DST data structures: Status report M. Moulson, CP working group 25 October 2001.
M. Martemianov, ITEP, October 2003 Analysis of ratio BR(K     0 )/BR(K    ) M. Martemianov V. Kulikov Motivation Selection and cuts Trigger efficiency.
Extrapolation Techniques  Four different techniques have been used to extrapolate near detector data to the far detector to predict the neutrino energy.
Status Report on Data Reconstruction May 2002 C.Bloise Results of the study of the reconstructed events in year 2001 Data Reprocessing in Y2002 DST production.
Status of the measurement of K L lifetime - Data sample (old): ~ 440 pb -1 ( ) - MC sample: ~125 pb -1 ( mk0 stream ) Selection: standard tag (|
June 4, 2009 STAR TPC review Estimation of TPC Aging Based on dE/dx Measurements Yuri Fisyak.
Update on the false-dead DC wire problem
Search for A’ from p0  A’ g
analisys: Systematics checks
MC ’04-’05 objectives                                                       
Large scale MC production II
Tracking results from Au+Au test Beam
Charles F. Maguire Vanderbilt University
TPC status - Offline Q&A
KLOE General Meeting - LNF March,
p0 life time analysis: general method, updates and preliminary result
° status report analysis details: overview; “where we are”; plans: before finalizing result.. I.Larin 02/13/2009.
Presentation transcript:

Realistic, large-scale MC production M. Moulson, 20 December 2002 Summary presentation for KLOE General Meeting Outline: Production proposal Refinements to GEANFI Background insertion MC DST’s Production logistics

General proposal for MC production 500 pb -1 of K S   all, K L  all (about 500M events) Can generated and reconstructed in under 2 months if done efficiently on IBM farm Useful for background studies for all K S K L analyses and for K S K L - contributed background in all other analyses Prototype for future production campaigns: K + K -, f  all Goal: Best possible reproduction of time-variable conditions State-of-the-art simulation of the detector Both in generation and reconstruction Run-variable input for  s, p f, x f, trigger thresholds, dead channels, etc. Realistic machine background, obtained from data run-by-run Output in the form of MC DST’s

New f -decay generator Improved radiator function with explicit calculation of matrix elements for all processes with 1 or 2 radiated photons Photons from both beams ISR photon tracked by GEANT Sampling of  s now precedes choice of f decay mode Probabilities for different final states now depend on  s Improved f cross sections: Correct treatment of phase space w and r terms in 3 p cross section s( e + e -  p + p - p 0 ) dN/dM( p + p - p 0 );  s = m f w f

Simulation of EmC trigger Careful tuning of effective thresholds in TSKT (cluster energy needed to fire a sector) Data MC z position on barrel (cm) Effective threshold (MeV) g ’s from K S  p 0 p 0 on barrel Photons from K S  p 0 p 0 Variation of effective threshold with position well within 10% all over detector Somewhat better on barrel Pions from K S  p + p - Variation at same level as for g ’s Effective threshold for MC few % less than for data Implications for EmC trigger efficiency estimates from MC: D E th (overall) = 5% means K S  p 0 p 0 De = 0.5% K S  p + p - De = 1%

Simulation of DC trigger T2D threshold in TSKT tuned based on effective number of reconstructed DC hits For MC: background from gg events corresponding to data-taking periods added using INSERT RMS variation of effective threshold over 10 intervals (1 pb -1 ) from : K + K reconstructed hits K S  p + p reconstructed hits Implications for DC trigger efficiency estimates from MC: K S  p + p - ( De)/e = 0.7% K S K L ( De)/e = 1.3% K + K - ( De)/e = 2.4% p + p - p 0 ( De)/e = 3.2% DC hits T2D efficiency DC hits + Data + MC K+K-K+K- KS  p+p-KS  p+p- early 2002

Survey MC/data differences in runs Make diagnostic plots using p + p - p 0 and e + e -  e + e - g samples: Energy response and resolution, as function of position Timing response and resolution Efficiencies Splitting and shower fragments Implement tools to adjust MC response Probably at reconstruction level: Adjustment of energy response in CLUFIXENE Threshold simulation Simulate “holes” in EmC response Minimum effort for maximum result: Start with large effects, see how far we get.... Planned work on EmC response in MC

Comprehensive DC geometry review Beam pipe Inner wall Correctly simulated according to nominal understanding Continue to study with multiple scattering, dE/dx (conversions?) Inner wall treated as equivalent thickness of CF in reconstruction Outer wall Originally simulated as 0.4 cm CF 2 × 20 m m Al plating and CF struts recently added Endplates Geometry correctly simulated Adding 50 m m Cu plating to simulate FEE Other material IR support “legs” added to simulation Material at borders of endplates not added: Ti screws, gas feedthroughs, DC “feet” Not limiting factor in understanding EmC response at endcap/barrel interface Global shift IP/DC/EmC currently simulated as coaxial DC shifted by D y = - 1 cm in real life EmC shifted by D y = to –0.7 cm in real life Studying feasibility of including these offsets

MC representation of wire sag  W wires Al wires Layer (stringing convention) Wire sag ( m m) Wire sag on most internal layers of DC much larger (400 m m) than on other layers (250 m m) In generation: Constant sag of 250 m m assumed s-t relations from GARFIELD In reconstruction: Sag not taken into account at all s-t relations for data reflect crosstalk between bins in f s-t relations for MC from GARFIELD Causes characteristic distortion of track momenta: want to reproduce in MC Generation: Wire sag adjusted to measured value on each layer s-t relations from GARFIELD Reconstruction: Ignore sag, just as in data reconstruction s-t relations calibrated using MC cosmic rays Wire sag will be taken into account in future reconstruction

Wire sag and momentum reconstruction f (deg) e - from Bhabha events, q < 40° e + from Bhabha events, q > 140° Generated with 250 m m sag, all layers Generated with zero sag, all layers p (MeV)

Background insertion: principles Previously existing tools for inserting background: ACCELE: EmC clusters MBCKADD: DC hits Both feature selection and insertion phases (modules) Objectives: Complete simulation of background in physical event Interplay of hit blocking, t 0 corruption, etc. Insert background for both EmC and DC simultaneously Unified selection of background events Single output file in standard format: compressed YBOS

A/C module for background insertion New A/C module: INSERT Opens background file Reads events from the file into secondary YBOS array, reusing events appropriately Decompresses (unzips) events in secondary array Gracefully handles EOF of the secondary file Next step: open/read from secondary file using KID Straightforward, just drop-in subroutine replacements. Handled by INSERT at present: LRID copied from “background” file into BRID in the “MC” file DC hits in DTCE in “background” file inserted into “MC” event

Background event selection Background obtained from recognized physics events and inserted in simulated physics events  Sampled evenly as a function of integrated luminosity Event type should be: Relatively abundant Easily identified Separable from background in DC and EmC Isolation easier in EmC: prefer neutral events Use gg events (solution common to ACCELE and MBCKADD) Event selection and cluster isolation are closely related problems

Two clusters in barrel: D t < min(5 s t, 1 ns) E cl > 450 MeV E tot > 900 MeV | Df | > 179° | S z| < 10 cm Additionally: R 12 > 100 cm R 12 < 400 cm (eliminates splits) Selection criteria for background events D R vs D t, all cluster pairs Reflection/splits ggg Splits Fragments

Isolation of background clusters Sideband 1 Center Close 1 Close 2 Sideband 2 T – R/c (ns) N acc = 1 N acc > 1 Excess counts near D t = 0 confined to case with only 1 accidental cluster Previous studies have shown that activity in sidebands 1, 2 has same distributions of E, q, etc. Use sidebands to study excess near D t = 0 Normalize to width of D t interval

Analysis of “in-time” clusters dN/dE N acc = 1 MeV dN/dE N acc > 1 dN/d(cos q ) N acc = 1 dN/d(cos q ) N acc > 1

Reproduction of background distributions Excess clusters at D t  0 for N acc = 1 presumably from ggg and cluster fragments: Obtain ratio of counts in central region/sidebands as function of cos q, E Downscale selection of events with N acc = 1 according to this ratio Accidental cluster multiplicity reproduced fairly well after this correction dN/dE, N acc = 1dN/d(cos q), N acc = 1Multiplicity (N acc ) MeV

Event selection/EmC background: status Currently have: Stable event selection criteria Statistical separation of accidental clusters/clusters correlated with gg Need to implement: Event selection module featuring removal of gg clusters on output Output of DC hits requires no additional work Cluster superposition code in INSERT CELE times and energies need to be adjusted for differences between v fib, L att in MC and data

DC background insertion 1.Read non t 0 -subtracted, non hot-suppressed DTCE on secondary array 2.Read T0GL on secondary array 3.Perform t 0 subtraction, keeping hits with negative times 4.Intercalate MC and background hits: Keep hit with earlier time when two hits overlap 5.Suppress negative hits only at end Sign of drift distance unusable: SMEAR_T0 and DCONVR assume sign carries L/R info Negative times not allowed to fluctuate positive because of TSKT Treatment of hot/dead channels needs refinement: Suppress dead channels in INSERT: TSKT shouldn’t see them Suppress hot channels in separate A/C module after TSKT: TSKT sees everything and applies its own hot-channel vetoes

Raw s-t relations for MC and data Raw s-t relations for MC/data differ by m m at 1-2 cm from wire Background hits not reconstructed with same radii when inserted Implications for reconstruction efficiency/quality for inserted tracks? Probable solution: Adjust “data” times to “MC” times in INSERT Problem: can be done for either raw or fine s-t relations, but not both Only substantial issue remaining to be addressed for DC insertion

Effect of different s-t relations MC events: KS, KL neutrals All hit banks dropped “Background” events: Events in bha stream At least 2 tracks with >20 hits each Study reproducibility of track reconstruction when inserting tracks into MC events without hits by visual scan of 100 events (200 tracks): Intact reconstruction Perfect164 Split reproduced9 Different reconstruction New split16 Split recovered5 Badly reconstructed1 Lost1 About 90% reproducible, with few % excess of split tracks and small losses (allowing cancellation of new/recovered splits)

MC DST’s: development principles MC DST’s produced from.mcr output Reconstruction bankset same as for data DST’s MC-truth bankset highly compressed Most variables for PROD2NTU structures precalculated/stored Most MC hit banks and related link banks dropped Code for creating new MC DST banks in MCT library New KLOE offline library; also contains insert module Banks defined with header files and descriptions in $K_IMCT Bank structures must accommodate presence of background hits Existing code in TLS (PROD2NTU) must work out of the box

MC DST’s: banks currently present Headers, etc.LRID HEAD EVCL BRIN RUNG MC truthKINE VERT t 0 -relatedT0MC T0GL TriggerTDST CTRG EmC recon.CLPS CLLS CSPS EmC truthCFHI CEKA CEKE QCALQCAE QWRK QCKA DC recon. DTFS DVFS DC truthMDKI MDTF MDCN TCATCLO Event class.ECLS ECLO VNVO INVO KNVO

MC DST banks for tracking MKIN: MC details for KINE tracks (20 words per KINE) Number of DHIT hits and layer crossings; inner/outermost layer x, p at first/last DHIT hits Path length and TOF MDTF: MC truth for DTFS tracks (28 words per DTFS) Indices of 3 main KINE contributors; number of hits contributed Index of KINE at first/last DTFS hit; layer, x, p for first/last hit Layer, x, p for first/last hit from majority KINE contributor MDCN: MC DC hit count summary bank (10 words/event) Substitutes DCNH for MC DST’s Counters for DHIT and DHRE hits, hits used by PR/TF Itemized by small/big cell; generated/background hits

MC DST banks for trigger and EmC CEKACELE Number of KINE contributors Total energy of KINE contributors KINE contrib #1Energy from KINE #1 KINE contrib #2Energy from KINE #2 …… Number of KINE contributors Total energy of KINE contributors EmC: MC information relatively compact; only discussion is over fate of CHIT CEKE bank created as possible alternative: Composition and weight of KINE contributions to cluster elements DC banks give similar composition for tracks Trigger: Format of TDST bank same for MC/data DST’s TORTA word: L1 type (EmC/DC/both), LET/cosmic multiplicities E/B/W, cosmic veto flag T1C, T1D, and T2D times Number of L2 DC hits Injection and fiducial clock signals not filled in MC DST’s

MC DST’s: status and size estimate Output size estimate: 1000 K S  all, K L  all events Generated and reconstructed on AIX w/ standard path.mcr23.9 MB (i.e., KB/evt).dst4.1 MB Very close to a final figure, to compare with 6 KB/evt pessimistically estimated last time 500 pb -1 = 500 M evts  2 TB Variations: Standard w/CHIT instead of CEKE4.7 MB (KB/evt) Standard + QIHI4.4 MB Standard + QIHI and CEKE  CHIT4.9 MB

Proposal for production scheme Production scheme must satisfy two important criteria: 1.Run-variable conditions must be correctly time-averaged over data set 2.MC output must be able to be divided up while maintaining relevance to a particular set or runs (or standard data set must be defined) Both satisfied by generating MC files corresponding to actual runs  s, p f, dead channels, etc. known by run number: Presumed to vary slowly with time Run-by-run generation handles averaging over data set: Background levels highly variable within any given run Important to time-average background correctly when inserting

Background sampling Physics events generated with definite cross section f  K S K L events, s  1050 nb Background also taken from events with definite cross section Recognized gg events, s  30 nb Insert each background event into fixed number of MC events If gg selection efficiency not significantly dependent on background, obtain temporal profile for background that matches data Need to simulate chunks of runs for which  L dt available f  K S K L events : max MC file size evts  25 nb - 1 Run size in data 20  200 nb - 1 Raw file size: 2-6 nb -1 in 2001, 6-15 nb -1 in 2002 (have  L dt!) Generate MC files corresponding to raw files?

Production scheme: examples KSKLKSKL f  all KS  p+p-KL  p+p-KS  p+p-KL  p+p- s (nb) Max evts/1GB file Max L per file (nb -1 ) Files/200 nb -1 run8 full length20 full length (1 file ~320 evts) If 15 raw files of ~13 nb -1 each… OK 15 files ~55% full Must split MC files 30 files ~50% full 15 files ~20 evts each Must group raws Background reuse factor  50 (raw files used twice) To split MC files across raw files: background from entire raw file used for each corresponding MC file; reuse factor adjusted accordingly

Requirements on DB2 Background can be treated as a datarec stream New DB requirements for MC runs/files: Runs are generated for each raw file in data set Additional complications from grouping raw files/splitting MC files New DB2 tables in logger schema for official MC production Link MC runs with background files used for reconstruction New tables only supplement existing tables  Fully backward compatible Note: MC run number will not correspond to simulated physical run number Correspondence will be available from database

Database modifications logger.mc_runs One entry per MC run logger.mc_runs_raws One entry per MC run and background file MCCard_ID, MCRun_Nr Primary keys identifying MC run Bkg_Run_Nr, Bkg_Version, Bkg_Offline_ID, Bkg_Datarec_Nr, Bkg_Stream_ID, Bkg_GB_Nr Primary keys of associated background, can be used to index: logger.datarec_logger logger.datarec_raws  logger.raw_logger Example of new table to extend information in logger.mc_runs: Create views e.g., to allow MC files to be selected by physical run number

Complete production flowchart gg selector bha bkg p f, x f,  s,  L.mcr.mco s-t N(bkg) write.uicDB access GEANFI INSERT datarec.dst cards Loop over good raw files in DB

Combine or separate neutral kaon runs? Combined production K S  all, K L  all Separate production K S  p 0 p 0, K L  all K S  p + p -, K L  all Differentiated by K L decay in DC? Combined generation K S  all, K L  all Streaming to dst by MC truth Simpler to produceSimpler to analyze Simple to produce and analyze (if no reprocessing) Fewer files (if file length unsaturated) Smaller files Less disk turnover? (if people cooperate) Less disk turnover? (if event subset dominates interest) Less disk turnover? (if event subset dominates interest) Lighter disk access (if event subset dominates interest) Lighter disk access (if event subset dominates interest) No need to prioritizePossible to prioritizeNo need to prioritize Naturally treats rare channels Rare channels treated well in generation Problems with zero-length files Well-suited for background studies (rare K S decays, non- K S K L physics) Acceptable compromise for background studies (mechanically more running, total volume and content of data set unchanged)