Efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group Computation and Enrico Fermi.

Slides:



Advertisements
Similar presentations
SkimSlimService ENABLING NEW WAYS. Problems of Current Analysis Model 2/18/13ILIJA VUKOTIC 2 Unsustainable in the long run (higher luminosity, no faster.
Advertisements

FAX status. Overview Status of endpoints and redirectors Monitoring Failover Overflow.
Outline Network related issues and thinking for FAX Cost among sites, who has problems Analytics of FAX meta data, what are the problems  The main object.
Efi.uchicago.edu ci.uchicago.edu FAX update Rob Gardner Computation and Enrico Fermi Institutes University of Chicago Sep 9, 2013.
Efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago US ATLAS Computing Integration.
ATLAS federated xrootd monitoring requirements Rob Gardner July 26, 2012.
YAN, Tian On behalf of distributed computing group Institute of High Energy Physics (IHEP), CAS, China CHEP-2015, Apr th, OIST, Okinawa.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP35, Liverpool 11 Sep 2015.
Storage Wahid Bhimji DPM Collaboration : Tasks. Xrootd: Status; Using for Tier2 reading from “Tier3”; Server data mining.
FAX UPDATE 1 ST JULY Discussion points: FAX failover summary and issues Mailing issues Panda re-brokering to sites using FAX cost and access Issue.
FAX UPDATE 26 TH AUGUST Running issues FAX failover Moving to new AMQ server Informing on endpoint status Monitoring developments Monitoring validation.
Efi.uchicago.edu ci.uchicago.edu ATLAS Experiment Status Run2 Plans Federation Requirements Ilija Vukotic XRootD UCSD San Diego 27 January,
Xrootd Monitoring for the CMS Experiment Abstract: During spring and summer 2011 CMS deployed Xrootd front- end servers on all US T1 and T2 sites. This.
Efi.uchicago.edu ci.uchicago.edu Towards FAX usability Rob Gardner, Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago US ATLAS.
Efi.uchicago.edu ci.uchicago.edu FAX meeting intro and news Rob Gardner Computation and Enrico Fermi Institutes University of Chicago ATLAS Federated Xrootd.
Wahid, Sam, Alastair. Now installed on production storage Edinburgh: srm.glite.ecdf.ed.ac.uk  Local and global redir work (port open) e.g. root://srm.glite.ecdf.ed.ac.uk//atlas/dq2/mc12_8TeV/NTUP_SMWZ/e1242_a159_a165_r3549_p1067/mc1.
MW Readiness Verification Status Andrea Manzi IT/SDC 21/01/ /01/15 2.
Efi.uchicago.edu ci.uchicago.edu FAX Dress Rehearsal Status Report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group Computation.
Efi.uchicago.edu ci.uchicago.edu Using FAX to test intra-US links Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group Computing Integration.
Efi.uchicago.edu ci.uchicago.edu FAX status developments performance future Rob Gardner Yang Wei Andrew Hanushevsky Ilija Vukotic.
Factors affecting ANALY_MWT2 performance MWT2 team August 28, 2012.
Storage Federations and FAX (the ATLAS Federation) Wahid Bhimji University of Edinburgh.
Marco Cattaneo LHCb computing status for LHCC referees meeting 14 th June
Efi.uchicago.edu ci.uchicago.edu Status of the FAX federation Rob Gardner Computation and Enrico Fermi Institutes University of Chicago ATLAS Tier 1 /
SLACFederated Storage Workshop Summary For pre-GDB (Data Access) Meeting 5/13/14 Andrew Hanushevsky SLAC National Accelerator Laboratory.
ATLAS XRootd Demonstrator Doug Benjamin Duke University On behalf of ATLAS.
Efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group S&C week Jun 2, 2014.
The ATLAS Cloud Model Simone Campana. LCG sites and ATLAS sites LCG counts almost 200 sites. –Almost all of them support the ATLAS VO. –The ATLAS production.
FAX PERFORMANCE TIM, Tokyo May PERFORMANCE TIM, TOKYO, MAY 2013ILIJA VUKOTIC 2  Metrics  Data Coverage  Number of users.
PERFORMANCE AND ANALYSIS WORKFLOW ISSUES US ATLAS Distributed Facility Workshop November 2012, Santa Cruz.
1 Configuring Sites Configuring Site Settings Configuring Inter-Site Replication Troubleshooting Replication Maintaining Server Settings.
FAX UPDATE 12 TH AUGUST Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring.
Efi.uchicago.edu ci.uchicago.edu Data Federation Strategies for ATLAS using XRootD Ilija Vukotic On behalf of the ATLAS Collaboration Computation and Enrico.
Efi.uchicago.edu ci.uchicago.edu Ramping up FAX and WAN direct access Rob Gardner on behalf of the atlas-adc-federated-xrootd working group Computation.
Efi.uchicago.edu ci.uchicago.edu Storage federations, caches & WMS Rob Gardner Computation and Enrico Fermi Institutes University of Chicago BigPanDA Workshop.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
Data Analysis w ith PROOF, PQ2, Condor Data Analysis w ith PROOF, PQ2, Condor Neng Xu, Wen Guan, Sau Lan Wu University of Wisconsin-Madison 30-October-09.
EU privacy issue Ilija Vukotic 6 th October 2014.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPIX, BNL 13 Oct 2015.
The HEPiX IPv6 Working Group David Kelsey (STFC-RAL) EGI OMB 19 Dec 2013.
PanDA Configurator and Network Aware Brokerage Fernando Barreiro Megino, Kaushik De, Tadashi Maeno 14 March 2015, US ATLAS Distributed Facilities Meeting,
Data Distribution Performance Hironori Ito Brookhaven National Laboratory.
Efi.uchicago.edu ci.uchicago.edu FAX splinter session Rob Gardner Computation and Enrico Fermi Institutes University of Chicago ATLAS Tier 1 / Tier 2 /
Efi.uchicago.edu ci.uchicago.edu Federating ATLAS storage using XrootD (FAX) Rob Gardner on behalf of the atlas-adc-federated-xrootd working group Computation.
Efi.uchicago.edu ci.uchicago.edu Sharing Network Resources Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago Federated Storage.
ATLAS Computing: Experience from first data processing and analysis Workshop TYL’10.
HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP33 Ambleside 22 Aug 2014.
CERN IT Department CH-1211 Genève 23 Switzerland t EGEE09 Barcelona ATLAS Distributed Data Management Fernando H. Barreiro Megino on behalf.
Efi.uchicago.edu ci.uchicago.edu Caching FAX accesses Ilija Vukotic ADC TIM - Chicago October 28, 2014.
Storage discovery in AliEn
Efi.uchicago.edu ci.uchicago.edu FAX splinter session Rob Gardner Computation and Enrico Fermi Institutes University of Chicago ATLAS Tier 1 / Tier 2 /
Valencia Cluster status Valencia Cluster status —— Gang Qin Nov
WLCG IPv6 deployment strategy
LHCOPN/LHCONE status report pre-GDB on Networking CERN, Switzerland 10th January 2017
LHCOPN update Brookhaven, 4th of April 2017
BNL Tier1 Report Worker nodes Tier 1: added 88 Dell R430 nodes
Future of WAN Access in ATLAS
ATLAS Cloud Operations
Outline Benchmarking in ATLAS Performance scaling
Jan 12, 2005 Improving CMS data transfers among its distributed Computing Facilities N. Magini CERN IT-ES-VOS, Geneva, Switzerland J. Flix Port d'Informació.
A full demonstration based on a “real” analysis scenario
Active Directory Administration
Data Federation with Xrootd Wei Yang US ATLAS Computing Facility meeting Southern Methodist University, Oct 11-12, 2011.
ATLAS Sites Jamboree, CERN January, 2017
FDR readiness & testing plan
Brookhaven National Laboratory Storage service Group Hironori Ito
An introduction to the ATLAS Computing Model Alessandro De Salvo
DiFX Python Interface John Spitzak (USNO).
IPv6 update Duncan Rand Imperial College London
Presentation transcript:

efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group Computation and Enrico Fermi Institutes University of Chicago ADC Development Meeting March 17, 2014

efi.uchicago.edu ci.uchicago.edu 2 Content Deployment Status Tests Developments – Failover to FAX – Job overflow – User tools

efi.uchicago.edu ci.uchicago.edu 3 Deployment Cloud T1sT2DsT2 DoneTotalDoneTotalDoneTotal CA US FR DE IT ES NL UK TW ND CERN RU Total

efi.uchicago.edu ci.uchicago.edu 4 Deployment CA cloud – Main contact Asoka D S – Start with TRIUMF o Connected to ORAN o Has xrootd door o Not open for external traffic o Being tested NL cloud – No word yet from Sara and Nikhef – 3 sites in Israel o Storm o Have 10Gbps connection ES cloud – PIC will rejoin – Has to stay with dCache 2.2 New redirectors – Will reorganize North America redirection tree o West (TRIUMF, SLAC, McGill, Scinet) o Central (MWT2, AGLT2, OU, SWT2) o East (BNL, BU, Victoria,SFU) – Add NL for NL T1s and the Scandinavian T2s – Add IL for Israeli sites – The rest of ND cloud distributed over existing redirectors according to their geographical location

efi.uchicago.edu ci.uchicago.edu 5 Status Most sites running stably All monitoring tools running stably.

efi.uchicago.edu ci.uchicago.edu 6 Tests Will try to saturate a test 100Gbps link MWT2-BNL – Analysis jobs – High IO direct access jobs – Simple xrdcp jobs Wahid will test running UCL as a diskless T2 with redirection to QMUL Two additional stress test in US cloud – Collects all the FTS transfers done in the cloud – Stressing the doors – simply looks up the files – Stressing the links – repeats the transfers but discards the files transferred

efi.uchicago.edu ci.uchicago.edu 7 Developments Failover to FAX – Turned on for all of the queues o Sites belonging to a cloud with it’s own redirector redirect to it. o NL, ND sites redirect to EU (root://atlas-xrd-eu.cern.ch/) o CA to global (root://glrd.usatlas.org/) – Developed a code that can send mails in case of a lot of jobs failing over o This mail would go directly to site admins o That should help them find why jobs were unable to get the data in a “normal” way. o More than 100 failed over jobs in any one queue in last 6 hours will generate the mail.

efi.uchicago.edu ci.uchicago.edu 8 Developments

efi.uchicago.edu ci.uchicago.edu 9 Developments Overflow to FAX – Need to propagate additional information to schedconfigDB and then JEDI scheduler and pilot o Data source FAX endpoint address – will be used by pilot to get the data from the optimal source o wanlimitsource, wanlimitsink – Variables set for each queue separately – Unit is Gbps (integer) o Still not clear how scheduler will enforce the limits – Need changes in pilot.

efi.uchicago.edu ci.uchicago.edu 10 User tools Currently in version 16 of localSetupFAX Added FAX-get – equivalent of dq2-get but getting data from FAX – a number of options still to come All the commands renamed from FAX-xxx to fax-xxx