Web Data Quality Analysis 1 WEB DATA QUALITY ANALYSIS Jonathan Levell (Summer Student) Working with: Eric van Herwijnen (EvH) Agnieszka Jacholkowska (AJ)
Web Data Quality Analysis 2 TALK OUTLINE Introduction How is the Data Quality Checking done? What is SICBCHK? What is WDQA? Results Typical Data CERN/MAP Summary
Web Data Quality Analysis 3 Check Data Quality (DQ): During MC and DST production of test data After each detector modification Produce a standalone DQ program which calls checking routines supplied by subdetector groups Provide WDQA package to display and compare sets of histograms for each subdetector
Web Data Quality Analysis 4 SICBMC SICBDST SICBCHK WDQA RAWH DST1/DST2 PostScriptGIFS NTUPLE HTML
Web Data Quality Analysis 5 Fortran Program Separate from SICBMC/SICBDST (package called: SICBCHK) Split into branches for each subdetector Branches maintained individually Latest version installed in September
Web Data Quality Analysis 6 * = JL leaves on 15th September
Web Data Quality Analysis 7 SICBCHK.F UGINIT.FCHKRUN.FUGLAST.F CHKTRIG.F SUANAL.F For each event Initialisation Tidy Up SUCHECK.F Subdetector Routines Miscellaneous Checks
Web Data Quality Analysis 8 Package controlled by.lmac scripts similar to PAW’s.kumac scripts Small Interpreter written in C++ Based on ROOT Converts ntuple files to ROOT files LMAC scripts can call ROOT macros Easy to use Manual currently being written Eventually details in Computing meeting
Web Data Quality Analysis 9 // Conversion to ROOT files. hbookin ~/work/data/ >.hbook file 1 hbookin ~/work/data2/ >.hbook opt nowait outputdir velo ps velo.ps outputdir velo/gifs settitle Event Type: > (Vertex) zone 1 2 set style default plot 1001 set ymax 250 set fillcolor 5 file 1 overlay 1001 // Call standard ROOT macro rootpad fit2gauss.c gifs page1 wait ps close close file 1 close An Example.LMAC script
Web Data Quality Analysis 10
Web Data Quality Analysis 11
Web Data Quality Analysis 12
Web Data Quality Analysis 13
Web Data Quality Analysis 14
Web Data Quality Analysis 15
Web Data Quality Analysis 16
Web Data Quality Analysis 17
Web Data Quality Analysis 18
Web Data Quality Analysis 19
Web Data Quality Analysis 20
Web Data Quality Analysis 21
Web Data Quality Analysis 22
Web Data Quality Analysis 23
Web Data Quality Analysis 24 Are the differences significant?
Web Data Quality Analysis 25 Bigger sample required before real conclusions can be drawn. Differences currently at 3 level Larger sample (500 events) will be requested, if possible with same random seeds. SICBCHK/WDQA still require more work but a new version will be released in mid-September