ATLAS on Grid3/OSG R. Gardner December 16, 2004.

Slides:



Advertisements
Similar presentations
Role Based VO Authorization Services Ian Fisk Gabriele Carcassi July 20, 2005.
Advertisements

4/2/2002HEP Globus Testing Request - Jae Yu x Participating in Globus Test-bed Activity for DØGrid UTA HEP group is playing a leading role in establishing.
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
S. Gadomski, "ATLAS computing in Geneva", journee de reflexion, 14 Sept ATLAS computing in Geneva Szymon Gadomski description of the hardware the.
The Panda System Mark Sosebee (for K. De) University of Texas at Arlington dosar workshop March 30, 2006.
1 ATLAS DC2 Production …on Grid3 M. Mambelli, University of Chicago for the US ATLAS DC2 team September 28, 2004 CHEP04.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
CHEP'07 September D0 data reprocessing on OSG Authors Andrew Baranovski (Fermilab) for B. Abbot, M. Diesburg, G. Garzoglio, T. Kurca, P. Mhashilkar.
ATLAS DC2 seen from Prague Tier2 center - some remarks Atlas sw workshop September 2004.
DDM-Panda Issues Kaushik De University of Texas At Arlington DDM Workshop, BNL September 29, 2006.
F. Fassi, S. Cabrera, R. Vives, S. González de la Hoz, Á. Fernández, J. Sánchez, L. March, J. Salt, A. Lamas IFIC-CSIC-UV, Valencia, Spain Third EELA conference,
WORD JUMBLE. Months of the year Word in jumbled form e r r f b u y a Word in jumbled form e r r f b u y a february Click for the answer Next Question.
BNL Tier 1 Service Planning & Monitoring Bruce G. Gibbard GDB 5-6 August 2006.
Review of Condor,SGE,LSF,PBS
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
Role Based VO Authorization Services Ian Fisk Gabriele Carcassi July 20, 2005.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
ATLAS WAN Requirements at BNL Slides Extracted From Presentation Given By Bruce G. Gibbard 13 December 2004.
Post-DC2/Rome Production Kaushik De, Mark Sosebee University of Texas at Arlington U.S. Grid Phone Meeting July 13, 2005.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
The ATLAS Cloud Model Simone Campana. LCG sites and ATLAS sites LCG counts almost 200 sites. –Almost all of them support the ATLAS VO. –The ATLAS production.
2011 Calendar Important Dates/Events/Homework. SunSatFriThursWedTuesMon January
Auditing Project Architecture VERY HIGH LEVEL Tanya Levshina.
ATLAS Grid Computing Rob Gardner University of Chicago ICFA Workshop on HEP Networking, Grid, and Digital Divide Issues for Global e-Science THE CENTER.
LCG Accounting Update John Gordon, CCLRC-RAL WLCG Workshop, CERN 24/1/2007 LCG.
OPERATIONS REPORT JUNE – SEPTEMBER 2015 Stefan Roiser CERN.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
EGEE is a project funded by the European Union under contract INFSO-RI Grid accounting with GridICE Sergio Fantinel, INFN LNL/PD LCG Workshop November.
1 The Capone Workflow Manager M. Mambelli, University of Chicago R. Gardner, University of Chicago J. Gieraltowsky, Argonne National Laboratory 14 th February.
The ATLAS Strategy for Distributed Analysis on several Grid Infrastructures D. Liko, IT/PSS for the ATLAS Distributed Analysis Community.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
VO VOCE - Availability and Stability of Resources Enabling Grids for E-sciencE VO VOCE - Availability and Stability of Resources.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
Western Tier 2 Site at SLAC Wei Yang US ATLAS Tier 2 Workshop Harvard University August 17-18, 2006.
WMS baseline issues in Atlas Miguel Branco Alessandro De Salvo Outline  The Atlas Production System  WMS baseline issues in Atlas.
LCG Accounting Update John Gordon, CCLRC-RAL 10/1/2007.
LCG Introduction John Gordon, STFC-RAL GDB June 11 th, 2008.
LCG Introduction John Gordon, STFC-RAL GDB November 7th, 2007.
Job submission overview Marco Mambelli – August OSG Summer Workshop TTU - Lubbock, TX THE UNIVERSITY OF CHICAGO.
VO Experiences with Open Science Grid Storage OSG Storage Forum | Wednesday September 22, 2010 (10:30am)
ATLAS Computing: Experience from first data processing and analysis Workshop TYL’10.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Panda Monitoring, Job Information, Performance Collection Kaushik De (UT Arlington), Torre Wenaus (BNL) OSG All Hands Consortium Meeting March 3, 2008.
LHCb Computing 2015 Q3 Report Stefan Roiser LHCC Referees Meeting 1 December 2015.
Condor DAGMan: Managing Job Dependencies with Condor
Xiaomei Zhang CMS IHEP Group Meeting December
U.S. ATLAS Grid Production Experience
Belle II Physics Analysis Center at TIFR
Data Challenge with the Grid in ATLAS
Joint JRA1/JRA3/NA4 session
INFN-GRID Workshop Bari, October, 26, 2004
The LHCb Software and Computing NSS/IEEE workshop Ph. Charpentier, CERN B00le.
LHCb Computing Model and Data Handling Angelo Carbone 5° workshop italiano sulla fisica p-p ad LHC 31st January 2008.
Philippe Charpentier CERN – LHCb On behalf of the LHCb Computing Group
Readiness of ATLAS Computing - A personal view
ATLAS DC2 ISGC-2005 Taipei 27th April 2005
ATLAS Sites Jamboree, CERN January, 2017
JRA2 Pisa, Tuesday, 25 October 2005
Grid Deployment Board meeting, 8 November 2006, CERN
Job workflow Pre production operations:
Summary from last MB “The MB agreed that a detailed deployment plan and a realistic time scale are required for deploying glexec with setuid mode at WLCG.
Simulation use cases for T2 in ALICE
US ATLAS Physics & Computing
Tropical cyclones movement
ATLAS DC2 & Continuous production
Habitat Changes and Fish Migration
2015 January February March April May June July August September
Habitat Changes and Fish Migration
The LHCb Computing Data Challenge DC06
Presentation transcript:

ATLAS on Grid3/OSG R. Gardner December 16, 2004

ATLAS Applications Pythia Generation Geant4 simulation Pileup Digitization Reconstruction

ATLAS Users DC2 production team User production Managed production High priority 7 users User production Opportunistic production and reconstruction 3 users growing

ATLAS DC2 on Grid3 Production statistics on Grid3 (End of November 2004) Overall “success” rate: 74% Through September: 66% During last 2 months: finished: 53163 failed:14353  success rate: 78%. We improved our results since (September) Only 2-3 submit-clients now (10-20 in September ) # Job status Capone Total 1 failed 33165 2 finished 90534 3 running 101 4 submitted 42  

Job Success Rate on GRID3 Passed Failed Success Rate July 8799 6676 57% August 17083 9448 64% September 17283 7717 69% October 26600 5186 84% November 21869 5038 81% Key factors in improved success rate: Experienced team using common submit hosts Quicker response to large scale site/network/hardware failures Can we improve more? Some shifts >95% success, others <50% Automatic throttle for failures? But still lose all running jobs Do we care? K. De + improvements to Capone/GCE

ATLAS ProdDB 1 BU_ATLAS_Tier2 19395 16349 3046 84.29 2 UTA_dpcc 19214 # CE Gatekeeper Finished+Failed Jobs Finished Jobs Failed Success Rate (%) 1 BU_ATLAS_Tier2 19395 16349 3046 84.29 2 UTA_dpcc 19214 14634 4580 76.16 3 UC_ATLAS_Tier2 13285 11196 2089 84.28 4 BNL_ATLAS 11261 8993 2268 79.86 5 IU_ATLAS_Tier2 10528 8403 2125 79.82 6 UM_ATLAS 9434 6054 3380 64.17 7 BNL_ATLAS_BAK 6061 4578 1483 75.53 8 UBuffalo_CCR 4654 3992 662 85.78 9 PDSF 5075 3590 1485 70.74 10 FNAL_CMS 3857 2222 1635 57.61 11 CalTech_PG 3136 2178 958 69.45 12 UCSanDiego_PG 2828 2101 727 74.29 13 FNAL_CMS2 2157 1506 651 69.82 14 SMU_Physics_Cluster 1462 969 493 66.28 15 BU_AGT_Tier2 975 820 155 84.10 16 PSU_Grid3 769 583 186 75.81 17 OU_OSCER 843 575 268 68.21 18 UFlorida_PG 946 451 495 47.67 19 Rice_Grid3 569 370 199 65.03 20 UWMadison 803 363 440 45.21 21 UNM_HPC 502 347 69.12 22 OU_OSCER_LSF 412 251 161 60.92 ATLAS ProdDB

Detailed Job Failures (un-normalized) Total, till Nov. Total, till Sep. Last 2 months Submission 894 472 422 Execution 428 Post Run 10131 1147 8984 Stage-Out 10833 8037 2796 RLS 1065 989 76 Capone 3975 2725 1250 Windmill 564 57 507 Other 5225 5139 86 TOTAL 33165 19303 13862

Status of GRID3 Jobs evgen simul digi pile-up Done % dc2.003003.B1_jets_180 100 100% 19998 11899 60% 14833 74% dc2.003028.A9_susy 400 11409 71% 7992 50% dc2.003034.J1_Pt_17_35 2 dc2.003035.J2_Pt_35_70 dc2.003036.J3_Pt_70_140 dc2.003037.J4_Pt_140_280 dc2.003038.J5_Pt_280_560 dc2.003039.J6_Pt_560_1120 dc2.003040.J7_Pt_1120_2240 1 200 dc2.003041.J8_Pt_2240 dc2.003043.B2_gamjet 4000 3990 dc2.003054.B3_Bmumu 4300 86% 0% dc2.003080.B4_jets17 9606 96% To Do – extra A9 simulation, some digitization and some B1 pile-up Note – also waiting for some B3 and B4 input evgen files from LCG K. De

ATLAS historical use ACDC archive

ATLAS Jobs by site ACDC archive

Grid3OSG Resource Availability ATLAS expects to be running continuous production starting now throughout 2005 This activity consists of: Completion of DC2 Production for the Rome physics workshop in June User production via Capone clients Distributed analysis via ADA Expect trend towards resource saturation to continue as more users are equipped with job submission tools

Some OSG Issues Managed storage is now the biggest problem facing continued DC2 production for both access and space management Authorization role based, access rights, queue priorities policy infrastructure, publication Accounting service user-level what resources have been used cpu, storage over an arbitrary time period Operations – extend operations protocol between BNL Tier1 and iGOC/OSG operations activity