SAM Overview (training session) for CDF Users Doug Benjamin Duke University Krzysztof Genser Fermilab/CD.

Slides:



Advertisements
Similar presentations
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Advertisements

Amber Boehnlein, FNAL D0 Computing Model and Plans Amber Boehnlein D0 Financial Committee November 18, 2002.
Proposal for dCache based Analysis Disk Pool for CDF presented by Doug Benjamin Duke University on behalf of the CDF Offline Group.
Batch Production and Monte Carlo + CDB work status Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
EventStore Managing Event Versioning and Data Partitioning using Legacy Data Formats Chris Jones Valentin Kuznetsov Dan Riley Greg Sharp CLEO Collaboration.
The Sam-Grid project Gabriele Garzoglio ODS, Computing Division, Fermilab PPDG, DOE SciDAC ACAT 2002, Moscow, Russia June 26, 2002.
Workload Management Massimo Sgaravatto INFN Padova.
Sep Donatella Lucchesi 1 CDF Status of Computing Donatella Lucchesi INFN and University of Padova.
JIM Deployment for the CDF Experiment M. Burgon-Lyon 1, A. Baranowski 2, V. Bartsch 3,S. Belforte 4, G. Garzoglio 2, R. Herber 2, R. Illingworth 2, R.
18 Feb 2004Computing Division Project Status Report1 Project Status Report : SAMGrid  SAMGrid Management, Status, Operations – Merritt  SAMGrid Development.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
HEP Experiment Integration within GriPhyN/PPDG/iVDGL Rick Cavanaugh University of Florida DataTAG/WP4 Meeting 23 May, 2002.
S. Veseli - SAM Project Status SAMGrid Developments – Part I Siniša Veseli CD/D0CA.
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
LcgCAF:CDF submission portal to LCG Federica Fanzago for CDF-Italian Computing Group Gabriele Compostella, Francesco Delli Paoli, Donatella Lucchesi, Daniel.
5 November 2001GridPP Collaboration Meeting1 CDF and the Grid Requirements and Anti-Requirements CDF-o-Centric View Proposal Conclusion: CDF/D0 Deliverables.
Grid Job and Information Management (JIM) for D0 and CDF Gabriele Garzoglio for the JIM Team.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
November 7, 2001Dutch Datagrid SARA 1 DØ Monte Carlo Challenge A HEP Application.
Building a distributed software environment for CDF within the ESLEA framework V. Bartsch, M. Lancaster University College London.
CDF Grid Status Stefan Stonjek 05-Jul th GridPP meeting / Durham.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
CHEP 2003Stefan Stonjek1 Physics with SAM-Grid Stefan Stonjek University of Oxford CHEP th March 2003 San Diego.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
Interactive Job Monitor: CafMon kill CafMon tail CafMon dir CafMon log CafMon top CafMon ps LcgCAF: CDF submission portal to LCG resources Francesco Delli.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
SAM and D0 Grid Computing Igor Terekhov, FNAL/CD.
GridPP18 Glasgow Mar 07 DØ – SAMGrid Where’ve we come from, and where are we going? Evolution of a ‘long’ established plan Gavin Davies Imperial College.
DØ Computing Model & Monte Carlo & Data Reprocessing Gavin Davies Imperial College London DOSAR Workshop, Sao Paulo, September 2005.
Movement of MC Data using SAM & SRM Interface Manoj K. Jha (INFN- Bologna) Gabriele Compostella, Antonio Cuomo, Donatella Lucchesi,, Simone Pagan (INFN.
Remote Control Room and SAM DH Shifts at KISTI for CDF Experiment 김현우, 조기현, 정민호 (KISTI), 김동희, 양유철, 서준석, 공대정, 김지은, 장성현, 칸 아딜 ( 경북대 ), 김수봉, 이재승, 이영장, 문창성,
16 September GridPP 5 th Collaboration Meeting D0&CDF SAM and The Grid Act I: Grid, Sam and Run II Rick St. Denis – Glasgow University Act II: Sam4CDF.
4 March 2004GridPP 9th Collaboration Meeting SAMGrid:JIM and CDF Development CDF Accepts the Need for the Grid –Requirements How to Meet the Need –Status.
Data reprocessing for DZero on the SAM-Grid Gabriele Garzoglio for the SAM-Grid Team Fermilab, Computing Division.
SAM - Sequential Data Access via Metadata Schema Metadata Functionality Workshop Glasgow University April 26-28,2004.
Metadata Mòrag Burgon-Lyon University of Glasgow.
GridPP11 Liverpool Sept04 SAMGrid GridPP11 Liverpool Sept 2004 Gavin Davies Imperial College London.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
19 February 2004SAMGrid Project Review SAMGrid: Future Plans CDF Accepts the Need for the Grid –Requirements D0 Relies on the Grid –Requirements How to.
SharePoint Administrative Communications Planning: Dynamic User Notifications for Upgrades, Migrations, Testing, … PRESENTED BY ROBERT FREEMAN (
DBS/DLS Data Management and Discovery Lee Lueking 3 December, 2006 Asia and EU-Grid Workshop 1-4 December, 2006.
May Donatella Lucchesi 1 CDF Status of Computing Donatella Lucchesi INFN and University of Padova.
UTA MC Production Farm & Grid Computing Activities Jae Yu UT Arlington DØRACE Workshop Feb. 12, 2002 UTA DØMC Farm MCFARM Job control and packaging software.
Offline Operations Aidan Robson (for the offline group) CDF Weekly, 6 August 2010 Calibrations: Calibration tables are being finalised for p30 (13 April.
Analysis Tools at D0 PPDG Analysis Grid Computing Project, CS 11 Caltech Meeting Lee Lueking Femilab Computing Division December 19, 2002.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
General SAM Hints and Tips Adam Lyon (FNAL/CD/D0) GCAS Meeting 12/19/02 Removing Special RunsChecking Sam’s Health Cutting on Detector QualitySamRoot DB.
Run II Review Closeout 15 Sept., 2004 FNAL. Thanks! …all the hard work from the reviewees –And all the speakers …hospitality of our hosts Good progress.
Adapting SAM for CDF Gabriele Garzoglio Fermilab/CD/CCF/MAP CHEP 2003.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
D0 File Replication PPDG SLAC File replication workshop 9/20/00 Vicky White.
The GridPP DIRAC project DIRAC for non-LHC communities.
10 March Andrey Grid Tools Working Prototype of Distributed Computing Infrastructure for Physics Analysis SUNY.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
A Data Handling System for Modern and Future Fermilab Experiments Robert Illingworth Fermilab Scientific Computing Division.
CDF SAM Deployment Status Doug Benjamin Duke University (for the CDF Data Handling Group)
Recent Evolution of the Offline Computing Model of the NOA Experiment Talk #200 Craig Group & Alec Habig CHEP 2015 Okinawa, Japan.
Sept Wyatt Merritt Run II Computing Review1 Status of SAMGrid / Future Plans for SAMGrid  Brief introduction to SAMGrid  Status and deployments.
DØ Computing Model and Operational Status Gavin Davies Imperial College London Run II Computing Review, September 2005.
DØ Grid Computing Gavin Davies, Frédéric Villeneuve-Séguier Imperial College London On behalf of the DØ Collaboration and the SAMGrid team The 2007 Europhysics.
SAM projects status Robert Illingworth 29 August 2012.
Workload Management Workpackage
Project Status Report : SAMGrid
Simulation use cases for T2 in ALICE
DØ MC and Data Processing on the Grid
Presentation transcript:

SAM Overview (training session) for CDF Users Doug Benjamin Duke University Krzysztof Genser Fermilab/CD

12/16/2005SAM Overview for CDF USers2 Why SAM? Why should CDF change its data handling model/methods? No real choice… CDF has lost and is losing the human resources required to maintain its own Data Handling system (Data File Catalog, DFC) Computing Division is combining efforts to support the Tevatron Experiments (CDF/D0)

12/16/2005SAM Overview for CDF USers3 Some SAM History SAM was designed and written as a joint project between D0 and Computing Division. D0 has been using it for many years now Monte Carlo submission and scheduling (not a feature used in CDF) Data Handling Metadata catalogue (where to find the data) Data delivery (SAM delivers files to users) Monitoring/Tracking use of files through projects CDF started work on SAM some time ago Several offsite locations have been using it for a few years now

12/16/2005SAM Overview for CDF USers4 CDF CDF personnel who have worked on SAM integration and deployment Valeria Bartsch, Doug Benjamin, Morag Burgon-Lyon, Gabrielle Compostella, Armando Fella, Krzysztof Genser, Randolph Herber, Suen Hou, Tsan Hsieh, Shih-Chieh Hsu, Elliot Lipeles, Donatella Luchessi, Ulrich Kerzel, Art Kreymer, Thoms Kuhr, Matt Norman, Fedor Ratnikov, Aidan Robson, Igor Sfiligoi, Stefan Stonjek, Alan Sill, Rick St. Denis, Bernd Stelzer,… please let us know if we missed anyone ) and the SAM Users Committee David Dagenhart (editor), Ray Culbertson, Kenichi Hatakeyama, Daniel Whiteson, Konstantin Anikeev, Matt Herndon A lot of work by the SAM Team to accommodate CDF requests

12/16/2005SAM Overview for CDF USers5 CDF Support Model Each Physics Group has a power user who will help the members of the group. Wiki (Pasha’s tiki) used for internal documentation. Power users able to make changes… Rick St. Denis assisting in answering all questions that the power users can’t answer Doug Benjamin and Krzysztof Genser act as contacts between CDF and CD SAM team (as per agreement with Computing Division)

12/16/2005SAM Overview for CDF USers6 SAM power users B group: Konstantin Anikeev and Matt Herndon Electroweak: David Dagenhart Exotics: Ray Culbertson QCD: Kenichi Hatakeyama Top: Daniel Whiteson

12/16/2005SAM Overview for CDF USers7 CDF (Now) All DFC Metadata had been replicated automatically in SAM Production Farm Output (starting since June 2005 e.g. current 0h and 0i) only in SAM (not in DFC) Soon to be applied to RAW Data as well Calibrations for the Farm Production Exe and DQM use SAM to read the data Stn & Top ntuples are being produced accessing data using SAM

12/16/2005SAM Overview for CDF USers8 Going from DFC to SAM diskcache_i interface to SAM and CAF make using SAM relatively transparent (not pain free though). Some things change: SAM delivers data via files (not filesets) specified by user defined datasets SAM optimization not same as DFC Files are NOT delivered to job sections in a fixed order (every time job runs sections can get them in a different order) SAM provides good reporting for files read by jobs

12/16/2005SAM Overview for CDF USers9 CDF (soon) Gen 6 MC will be uploaded into SAM only The tools exist (sam_upload) They were developed and tested for B physics data skimming Tested for upload of MC data from Toronto last week

12/16/2005SAM Overview for CDF USers10 (future enhancements) More advanced handshaking between CAF & SAM to facilitate more robust bookkeeping & error recovery (by the recovery projects) Improved interface between SAM & dCache

12/16/2005SAM Overview for CDF USers11 (more use cases) Ntuples…. Ntuples can be loaded into SAM Ntuple files need to be sufficiently large (> 1 GB) (small files are bad for pnfs/dCache/enstore) Need Meta data associated with Ntuples A root macro example using SAM: Needs further testing/development must use SAM when running on ntuples in dCache on CAFs; using the old methods is no longer an option due to administrative and scalability issues Need help from the collaboration to solve this problem...

12/16/2005SAM Overview for CDF USers12 (for offsite use and installation) Please look at the following documentation SAM can be used to fetch data from FNAL to offsite and handle it Datasets available at remote cafs/SAM stations

SAM Overview for CDF USers SAM & Data Servers (used in data reading) SAM Stager downloads/purges files to/from the cache (not used in the SAM/dCache interface implementation) Enstore/dCache file read (or copy) Offline Data Base SAM Data Base Servers list of files (Snapshot) User Job SAM Client starts/stops Project/Consumer/ Consumer Processes; requests/releases files Snapshot Project name SAM Station initiate file transfers/purging deliver filenames SAM Project Masters processes spawned by the station keep project context track file consumption; do bookkeeping file location info Project/Dataset name (constraints) file path & name file release status file delivery/consumption info snapshot/project info

12/16/2005SAM Overview for CDF USers14 tests on cdf-sam reconstruction started on the SAM based farm example of a remote use

12/16/2005SAM Overview for CDF USers15 Conclusions SAM usage at CDF is increasing AC++/SAM/CAF interfaces are maturing help needed from the collaboration especially with the ntuple use case