2012-05-10Minos Computing Infrastructure Arthur Kreymer 1 ● Young Minos issues: – /local/stage1/minosgli temporary reprieve – need grid shared area – Retiring.

Slides:



Advertisements
Similar presentations
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Advertisements

2 Copyright © 2005, Oracle. All rights reserved. Installing the Oracle Database Software.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
Oxford Jan 2005 RAL Computing 1 RAL Computing Implementing the computing model: SAM and the Grid Nick West.
Minerva Infrastructure Meeting – October 04, 2011.
Jun 29, 20101/25 Storage Evaluation on FG, FC, and GPCF Jun 29, 2010 Gabriele Garzoglio Computing Division, Fermilab Overview Introduction Lustre Evaluation:
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
UCL Site Report Ben Waugh HepSysMan, 22 May 2007.
CC - IN2P3 Site Report Hepix Fall meeting 2009 – Berkeley
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
BaBar MC production BaBar MC production software VU (Amsterdam University) A lot of computers EDG testbed (NIKHEF) Jobs Results The simple question:
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Jean-Yves Nief CC-IN2P3, Lyon HEPiX-HEPNT, Fermilab October 22nd – 25th, 2002.
MiniBooNE Computing Description: Support MiniBooNE online and offline computing by coordinating the use of, and occasionally managing, CD resources. Participants:
Overview of day-to-day operations Suzanne Poulat.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
Jefferson Lab Site Report Kelvin Edwards Thomas Jefferson National Accelerator Facility HEPiX – Fall, 2005.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
Manchester HEP Desktop/ Laptop 30 Desktop running RH Laptop Windows XP & RH OS X Home server AFS using openafs 3 DB servers Kerberos 4 we will move.
SAM Installation Lauri Loebel Carpenter and the SAM Team February
Workshop on Computing for Neutrino Experiments - Summary April 24, 2009 Lee Lueking, Heidi Schellman NOvA Collaboration Meeting.
Remote Control Room and SAM DH Shifts at KISTI for CDF Experiment 김현우, 조기현, 정민호 (KISTI), 김동희, 양유철, 서준석, 공대정, 김지은, 장성현, 칸 아딜 ( 경북대 ), 김수봉, 이재승, 이영장, 문창성,
Fermilab Site Report Spring 2012 HEPiX Keith Chadwick Fermilab Work supported by the U.S. Department of Energy under contract No. DE-AC02-07CH11359.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Outline: Tasks and Goals The analysis (physics) Resources Needed (Tier1) A. Sidoti INFN Pisa.
Scientific Storage at FNAL Gerard Bernabeu Altayo Dmitry Litvintsev Gene Oleynik 14/10/2015.
Outline: Status: Report after one month of Plans for the future (Preparing Summer -Fall 2003) (CNAF): Update A. Sidoti, INFN Pisa and.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Neutrino Program (MINOS, MINERvA, General) Margaret Votava April 21, 2009 Tactical plan.
RHIC/US ATLAS Tier 1 Computing Facility Site Report Christopher Hollowell Physics Department Brookhaven National Laboratory HEPiX Upton,
D0 File Replication PPDG SLAC File replication workshop 9/20/00 Vicky White.
1 5/4/05 Fermilab Mass Storage Enstore, dCache and SRM Michael Zalokar Fermilab.
Nov 18, 20091/15 Grid Storage on FermiGrid and GPCF – Activity Discussion Grid Storage on FermiGrid and GPCF Activity Discussion Nov 18, 2009 Gabriele.
Patrick Gartung 1 CMS 101 Mar 2007 Introduction to the User Analysis Facility (UAF) Patrick Gartung - Fermilab.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Scientific Computing Facilities / General Physics Computing Facility (GPCF) Stu Fuess 06-Oct-2009.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plan for SCF / System Administration DocDB #3389 Jason Allen 10/06/2009.
GPCF* Update Present status as a series of questions / answers related to decisions made / yet to be made * General Physics Computing Facility (GPCF) is.
SAM projects status Robert Illingworth 29 August 2012.
Computing Infrastructure – Minos 2009/06 ● MySQL and CVS upgrades ● Hardware deployment ● Condor / Grid status ● /minos/data file server ● Parrot status.
Computing Infrastructure – Minos 2009/12 ● Downtime schedule – 3 rd Thur monthly ● Dcache upgrades ● Condor / Grid status ● Bluearc performance – cpn lock.
Computing Infrastructure Arthur Kreymer 1 ● Power status in FCC (UPS1) ● Bluearc disk purchase – coming soon ● Planned downtimes – none ! ● Minos.
Minos Computing Infrastructure Arthur Kreymer 1 ● Young Minos issues: – /local/stage1/minosgli temporary reprieve – need grid shared area – Retiring.
Minos Computing Infrastructure Arthur Kreymer 1 ● Grid – Going to SLF 5, doubled capacity in GPFarm ● Bluearc - performance good, expanding.
Computing Infrastructure Arthur Kreymer 1 ● Power status in FCC (UPS1) ● Bluearc disk purchase – still coming soon ● Planned downtimes – none.
Alexander Moibenko File Aggregation in Enstore (Small Files) Status report 2
Status report NIKHEF Willem van Leeuwen February 11, 2002 DØRACE.
WP18, High-speed data recording Krzysztof Wrona, European XFEL
The New APEL Client Will Rogers, STFC.
Dag Toppe Larsen UiB/CERN CERN,
U.S. ATLAS Grid Production Experience
Archiver.ias.ethz.ch easy to use solution for end user to store data on LTS strongbox. requirements: apache web-server to host end user frontend (php,html,jquery)
Dag Toppe Larsen UiB/CERN CERN,
Grid status ALICE Offline week Nov 3, Maarten Litmaath CERN-IT v1.0
SAM at CCIN2P3 configuration issues
Luca dell’Agnello INFN-CNAF
Artem Trunov and EKP team EPK – Uni Karlsruhe
Storage elements discovery
TYPES OF SERVER. TYPES OF SERVER What is a server.
Welcome to our Nuclear Physics Computing System
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Welcome to our Nuclear Physics Computing System
Status report NIKHEF Willem van Leeuwen February 11, 2002 DØRACE.
Distributing META-pipe on ELIXIR compute resources
Lee Lueking D0RACE January 17, 2002
INFNGRID Workshop – Bari, Italy, October 2004
Status and plans for bookkeeping system and production tools
Linux Cluster Tools Development
Presentation transcript:

Minos Computing Infrastructure Arthur Kreymer 1 ● Young Minos issues: – /local/stage1/minosgli temporary reprieve – need grid shared area – Retiring most AFS data areas, due to old hardware ● FermiMail (IMAP) migration is complete ● SLF5 migration is complete – have minos-slf4 short term ● Planned outages – Apr 19 - Dcache Chimera upgrade – NFS 4.1 access with SLF 6.2 – May 1-2 Bluearc firmware upgrade completed – May 17 – AFS upgrade to support Fermilab tokens ● Control Room - System Management by FEF coming - CD DocDB 4430 ● Home areas may move from AFS to Bluearc – starting planning now EXECUTIVE SUMMARY

Minos Computing Infrastructure Arthur Kreymer 2 Computing Infrastructure – Minos 2009/12 ● CPN million locks served to Minos ( 2.3 M NovA, 7.7 M MNV ) ● Still preparing to use shared jobsub scripts – /grid/fermiapp/common/condor – Will give us access to about 100 GPCF batch slots ● Should start using the Bluearc GridFTP server for MC import. – UT Austin tested this ● GridFTP servers available to let you own your Grid job output files ● Local shared file support – OSG_WN_TMP was converted to per-job temporary areas, as originally planned – This removed /local/stage1 – We had been using /local/stage1/minosgi for 3 GB shared files, for efficiency – Temporarily restored on GPFarm nodes, discussing long term solutions GRID

Minos Computing Infrastructure Arthur Kreymer 3 DATA handling ● SAM Projects have been run this year only to test the SAM Station. – Will continue to prototype lightweight script to deliver files in a fashion similar to what was done by a SAM Project. ● New SAM-WEB interface is being deployed in NOvA and Minerva – Intended to replace the old C++ library – We should move to this as soon as convenient ● Investigating Open storage solutions for disk – Cost claimed under $100/TB vs $300/TB Bluearc – Good match to SAM Cache and Dcache – Could use to replace tape in Enstore –

Minos Computing Infrastructure Arthur Kreymer 4 Enstore/DCache ● DCache 'Chimera' database migration Feb 23 – about 18 hours, starting around 00:00 – Requires client use of encp v3_10e ( deployed for Minos ) – We see the correct file sizes for files over 2 GB – With SLF 6.2, will be able to read/write files via NFS 4.1 ● SFA – Small Files Aggregation deployed Apr 19 – Automatically bundles and tar's small files written by ENCP – Dcache can handle these transparently – No Dcache writing yet. – Requires encp v3_11

Minos Computing Infrastructure Arthur Kreymer 5 NUCOMP ● Still need to test using shared 'jobsub' script ● FTP servers allow files to be owned by user instead of minosana ● IF Beam monitoring deployed for testing (NovA era beam monitoring) – Still need to compare to our beam data logging – ● About 750 TB Bluearc disk purchased this spring – Allocation yet to be determined ● REX took over Condor support from Ryan Patterson July 26 ● We should test using shared 'jobsub' script ● REX working to make output files owned by user, not minosana – Have deployed FTP servers, we should test this ● Still need improved management of condor-exec and condor-tmp ● IF Beam monitoring deployed for testing (NovA era beam monitoring) – Running in parallel – Big Green Button prototype implemented ●

Minos Computing Infrastructure Arthur Kreymer 6 FERMIMAIL ● ● Exchange 2010 IMAP server transition is complete ● good browser and imap client compatibility ● Claimed excellent webmail interface - .fnal.gov ● A few early glitches (false reports of mail size mismatcher), mostly OK now ● Uses IMAP port 993, not old default 143. ● My alpine configuration : – I nbox Path ={ .fnal.gov:993/ssl/novalidate-cert/user=kreymer}INBOX – Menu / Setup / Configure ● + expose-hidden-config ● + disable-these-authenticators – A GSSAPI – A PLAIN

Minos Computing Infrastructure Arthur Kreymer 7 GPCF ● Moved to FCC May 1,2 ● Using KVM now, not OracleVM ● minos25 now running as a GPCF VM ● minos-slf4 = minosgpvm02 runs SLF bit kernel – Staying through May 2012, unless a security issue emerges – Option to extend to Aug 2012 if needed – 'Just In Case' we need to test SLF 4 compability

Minos Computing Infrastructure Arthur Kreymer 8 Minos Servers ● New IF Mysql replica servers being deployed – Much faster than minos-mysql1, which we have shut down – Will abandon dbmauto for farm replica, use standard Mysql replication ● SLF5 migration is complete – minos27 (code build) was upgrade Jan 19, all builds now at SLF5. – minos25 (condor master) scheduled Feb 23, moving to a VM – minos-slf4 available for fallback and testing for 3 to 6 months

Minos Computing Infrastructure Arthur Kreymer 9 Control Room ● System Management – to be assumed by FEF in place of John Urish – Lenovo demo model tested, looks fine. – FEF will take over management when new systems are bought ● ReMote Shift (RMS) – Installed in our control room, same as remote stations – Installed on the minossrv-nd system at the ND – Being used to run everything but EVD ● Need to move to SLF 6, local and RMS. – Consider converting EVD and RC to a Web service – Consider moving DQM to offline systems