I/O and Metadata Jack Cranshaw Argonne National Laboratory November 9, 2015 1 ATLAS Core Software TIM.

Slides:



Advertisements
Similar presentations
P5, M1, D1.
Advertisements

Query Execution, Concluded Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems November 18, 2003 Some slide content may.
ITIL: Service Transition
Software Configuration Management Donna Albino LIS489, December 3, 2014.
File Management Chapter 12. File Management File management system is considered part of the operating system Input to applications is by means of a file.
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
Reconstruction and Analysis on Demand: A Success Story Christopher D. Jones Cornell University, USA.
Components and Architecture CS 543 – Data Warehousing.
Usage of the Python Programming Language in the CMS Experiment Rick Wilkinson (Caltech), Benedikt Hegner (CERN) On behalf of CMS Offline & Computing 1.
Implementation/Acceptance Testing / 1 Implementation and Acceptance Testing Physical Implementation Criteria: 1. Data availability 2. Data reliability.
Software Testing Prasad G.
Automated Tests in NICOS Nightly Control System Alexander Undrus Brookhaven National Laboratory, Upton, NY Software testing is a difficult, time-consuming.
Configuration Management Process and Environment MACS Review 1 February 5th, 2010 Roland Moser PR a-RMO, February 5 th, 2010 R. Moser 1 R. Gutleber.
Introduction to Systems Analysis and Design Trisha Cummings.
MC, REPROCESSING, TRAINS EXPERIENCE FROM DATA PROCESSING.
Module 3: Table Selection
XAOD persistence considerations: size, versioning, schema evolution Joint US ATLAS Software and Computing / Physics Support Meeting Peter van Gemmeren.
Wahid Bhimji University of Edinburgh J. Cranshaw, P. van Gemmeren, D. Malon, R. D. Schaffer, and I. Vukotic On behalf of the ATLAS collaboration CHEP 2012.
15-740/ Oct. 17, 2012 Stefan Muller.  Problem: Software is buggy!  More specific problem: Want to make sure software doesn’t have bad property.
Tracking The Problem  By Aaron Jackson. What’s a Problem?  A suspicious or unwanted behavior in a program  Not all problems are errors as some perceived.
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
Event Data History David Adams BNL Atlas Software Week December 2001.
ATLAS, U.S. ATLAS, and Databases David Malon Argonne National Laboratory DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National Laboratory.
Recent Software Issues L3 Review of SM Software, 28 Oct Recent Software Issues Occasional runs had large numbers of single-event files. INIT message.
Pre-OTS Testing in Penticton Sonja Vrcic Socorro, December 11, 2007.
File Management Chapter 12. File Management File management system is considered part of the operating system Input to applications is by means of a file.
Introduction Advantages/ disadvantages Code examples Speed Summary Running on the AOD Analysis Platforms 1/11/2007 Andrew Mehta.
ATLAS Data Challenges US ATLAS Physics & Computing ANL October 30th 2001 Gilbert Poulard CERN EP-ATC.
Toward a Next-generation I/O Framework David Malon, Jack Cranshaw, Peter van Gemmeren, Marcin Nowak, Alexandre Vaniachine US ATLAS Technical Planning Meeting.
QA and Testing. QA Activity Processes monitoring Standards compliance monitoring Software testing Infrastructure testing Documentation testing Usability.
AMB HW LOW LEVEL SIMULATION VS HW OUTPUT G. Volpi, INFN Pisa.
Retail Transaction Processing Year End Review and Recent Issues RMS January 2007.
Debugging of # P. Hristov 04/03/2013. Introduction Difficult problem – The behavior is “random” and depends on the “history” – The debugger doesn’t.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
I/O Infrastructure Support and Development David Malon ATLAS Software Technical Interchange Meeting 9 November 2015.
CMap Version 0.16 Ben Faga. CMap CMap Version 0.16 Bug fixes and code optimizations More intuitive menu system Asynchronous loading of comparative map.
LCIO A persistency framework and data model for the linear collider CHEP 04, Interlaken Core Software, Wednesday Frank Gaede, DESY -IT-
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
MVC WITH CODEIGNITER Presented By Bhanu Priya.
1 SICBDST and Brunel Migration status and plans. 2 Migration Step 1: SICBMC/SICBDST split  Last LHCb week: Split done but not tested  Software week.
ATA/SE-SQM/OM-4/Workshop 1.PPT/V3.1 Slide 1 of 105 © 2010 NUS. All rights reserved. Master of Technology in Software Engineering Workshop 1: Content Management.
Peter W. PhillipsATLAS SCT Week, CERN, September/October 2002 Electrical Tests of SCT modules using RODs Peter W Phillips Rutherford Appleton Laboratory.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
Andrea Valassi (CERN IT-DB)CHEP 2004 Poster Session (Thursday, 30 September 2004) 1 HARP DATA AND SOFTWARE MIGRATION FROM TO ORACLE Authors: A.Valassi,
I/O Strategies for Multicore Processing in ATLAS P van Gemmeren 1, S Binet 2, P Calafiura 3, W Lavrijsen 3, D Malon 1 and V Tsulaia 3 on behalf of the.
TAGS in the Analysis Model Jack Cranshaw, Argonne National Lab September 10, 2009.
ATLAS Distributed Computing perspectives for Run-2 Simone Campana CERN-IT/SDC on behalf of ADC.
&. & DNS and IPv6 IPv6 Summit, Canberra 31st October & 1 st November 2005 Chris Wright, Chief Technology Officer &
ELSSISuite Services QIZHI ZHANG Argonne National Laboratory on behalf of the TAG developers group ATLAS Software and Computing Week, 4~8 April, 2011.
Bogor-Java Environment for Eclipse MSE Presentation III Yong Peng.
Event Management. EMU Graham Heyes April Overview Background Requirements Solution Status.
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
Next-Generation Navigational Infrastructure and the ATLAS Event Store Abstract: The ATLAS event store employs a persistence framework with extensive navigational.
IceCube DAQ Mtg. 10,28-30 IceCube DAQ: Implementation Plan.
I/O aspects for parallel event processing frameworks Workshop on Concurrency in the many-Cores Era Peter van Gemmeren (Argonne/ATLAS)
11th September 2002Tim Adye1 BaBar Experience Tim Adye Rutherford Appleton Laboratory PPNCG Meeting Brighton 11 th September 2002.
111 State Management Beginning ASP.NET in C# and VB Chapter 4 Pages
Open Science Grid Configuring RSV OSG Resource & Service Validation Thomas Wang Grid Operations Center (OSG-GOC) Indiana University.
Mini-Workshop on multi-core joint project Peter van Gemmeren (ANL) I/O challenges for HEP applications on multi-core processors An ATLAS Perspective.
Multi Process I/O Peter Van Gemmeren (Argonne National Laboratory (US))
PerfSONAR Schema and Topology Martin Swany. Schema Key Goals: Extensibility, Normalization, Readability Break representation of performance measurements.
Athena I/O Component Refactorization - Overview Random notes about todays agenda. Peter Van Gemmeren (Argonne National Laboratory (US))
SOFTWARE TESTING TRAINING TOOLS SUPPORT FOR SOFTWARE TESTING Chapter 6 immaculateres 1.
ITIL: Service Transition
Root I/O and the Gaudi Framework
Elizabeth Gallas - Oxford ADC Weekly September 13, 2011
Metadata Status and future
Detailed Release Note Change Re-baseline UMIG XSDs
Detailed Release Note Change Re-baseline UMIG XSDs
Planning next release of GAUDI
Presentation transcript:

I/O and Metadata Jack Cranshaw Argonne National Laboratory November 9, ATLAS Core Software TIM

Outline Review Bookkeeping Issues Event Service issues Analysis support issues Merging issues Comments and consequences for future frameworks November 9, ATLAS Core Software TIM

Metadata Infrastructure Metadata is processed in athena but in addition to the framework state transitions it listens to Incidents fired by the I/O framework such as file boundaries, etc. This infrastructure uses Tools which listen to these Incidents and prepare output metadata in the output metadata store. A similar system of stores and incidents exists in some analysis frameworks allowing us to build dual-use tools. November 9, ATLAS Core Software TIM

Incident-driven metadata infrastructure ATLAS Core Software TIM 4 November 9, 2015 Input File 1Input File N Output File 1 Input Store MetaData Tools - Summarize metadata MetaData Service - Handle stores - Setup Tools retrieve Metadata Store record/ retrieve Input File 2 flush load metadata proxies … write 1) BeginInputFile incident 2) EndInputFile incident Event Selector CHEP2015 talk

In-file Metadata Serves as a cache and a propagation tool. Stored in a MetaDataTree in the file. There is a MetaData DataHeader which sets up proxies and loads addresses. Metadata Itemlist Cache IOV conditions Trigger menus Configuration parameters Propagation Luminosity bookkeeping Event bookkeeping November 9, 2015ATLAS Core Software TIM 5

Problems with Lumi Accounting Updated xAOD version in production since Spring. After data taking started, it was noticed that all lumiblocks were being labeled as Suspect. This was fixed in the associated metadata tool in July. Problem of Suspect lumiblocks resurfaces in October due to AthenaMP. Bug in incident firing confused tool. Fixed. The tool was designed to read the number of input events from the input file. In a multi-process environment this leads to multiplication of the number during merging and consequently the lumiblocks are again labeled all as Suspect. Detailed in ATEAM-235 Scope still under investigation. Can probably be fixed in tool, but points at some possible architectural issues (FF).ATEAM-235 November 9, ATLAS Core Software TIM

CutBookkeepers Issues There was a problem that was spotted in September where CutBookkeepers were double counting events in derivations. This was a bug in the way input streams and cycles were used, not a duplicate events issue. Fix detailed in ATLASG- 242.ATLASG- 242 Because CutBookkeepers count events in each worker, the counting problem with AthenaMP does not affect them AFAICT. I am migrating functionality from CutFlowSvc to a tool to both fit the Athena metadata I/O better and to have shared dual-use tools with the analysis frameworks. More in Analysis slide. November 9, ATLAS Core Software TIM

Metadata for Analysis I operate as the interface between Core and Train Coordination. Many of the issues have been metadata related. Previously mentioned Bookkeeper issues. New metadata needed by analysis Root-readable versions of some IOV metadata Simple name value pairs, standing jira ticket (ATLASG-173) for requests.ATLASG-173 New uses for CutBookkeepers: PDFSumOfWeights Tracked in ATLASG-308ATLASG-308 Time critical for December reprocessing Workaround in place, developing longer term solution, e.g. multiple cutbookkeeper tools. Extensibility is becoming more of an issue. Physicists like to store simple Root objects and just use hadd for merging. November 9, ATLAS Core Software TIM

Merging Our default for merging is full athena merge. Metadata is merged using metadata tools attached to the MetaDataSvc and listening to Incidents. General problem: Metadata can be dropped during merging. Depends on metadata item list from release not from input. ATEAM-145ATEAM-145 Tools may not recognize all input metadata. Generally a silent error. No validation. Root-level (fast) merging Metadata merging is not normally just a stacking process. Nevertheless, in Run 1, we were able to use a hybrid merge where the event data was simply stacked using Root while the metadata was merged using athena. Note: This is available for Run 2, but the event data output would not be optimized for reading in the same way as the direct athena output. If we deferred the actual union of the metadata, then we could stack both event and metadata when merging, and delay the union/merge to when the data is read. Could be two stages of merge as well. Should we have tools doing the merging or should the objects know how to merge themselves? Tools are easier to evolve and specialize. November 9, ATLAS Core Software TIM

Pointing to Conditions Data As noted on the Analysis slide, physicists like to add simple metadata, histograms, etc. which is root native and could by itself be ‘hadded’ but does not necessarily play well with athena. Two differences between analysis and core show up. One of them is that in analysis the events themselves can act as a file-level IOV database using pointers in the events, whereas in athena conditions we keep a separate IOVDb. Both have pro’s. Events are more independent with pointers. IOV means not checking every event. No planned changes. Make IOV content root-readable. One could think of sharing data in these frameworks if the payload in the IOV containers was root readable and addressable. Look at doing this for attribute list first. November 9, ATLAS Core Software TIM

Incidents We have many incidents and many classes which fire incidents. The IncidentSvc doesn’t report who fires, who listens, or whether an incident was missed or out of order. Discussion. November 9, ATLAS Core Software TIM