2012-02-16Minos Computing Infrastructure Arthur Kreymer 1 ● Young Minos issues: – /local/stage1/minosgli temporary reprieve – need grid shared area – Retiring.

Slides:



Advertisements
Similar presentations
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Advertisements

IBM SMB Software Group ® ibm.com/software/smb Maintain Hardware Platform Health An IT Services Management Infrastructure Solution.
TOPIC FOR THE DISCUSSION SECTION AUTHOR ONE AUTHOR TWO AUTHOR THREE FIFE Workshop Template Service Name.
2 Copyright © 2005, Oracle. All rights reserved. Installing the Oracle Database Software.
Report of Liverpool HEP Computing during 2007 Executive Summary. Substantial and significant improvements in the local computing facilities during the.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
23/04/2008VLVnT08, Toulon, FR, April 2008, M. Stavrianakou, NESTOR-NOA 1 First thoughts for KM3Net on-shore data storage and distribution Facilities VLV.
Oxford Jan 2005 RAL Computing 1 RAL Computing Implementing the computing model: SAM and the Grid Nick West.
2 Copyright © 2009, Oracle. All rights reserved. Installing your Oracle Software.
Site Report US CMS T2 Workshop Samir Cury on behalf of T2_BR_UERJ Team.
JIM Deployment for the CDF Experiment M. Burgon-Lyon 1, A. Baranowski 2, V. Bartsch 3,S. Belforte 4, G. Garzoglio 2, R. Herber 2, R. Illingworth 2, R.
Minerva Infrastructure Meeting – October 04, 2011.
Jun 29, 20101/25 Storage Evaluation on FG, FC, and GPCF Jun 29, 2010 Gabriele Garzoglio Computing Division, Fermilab Overview Introduction Lustre Evaluation:
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Vladimir Litvin, Harvey Newman Caltech CMS Scott Koranda, Bruce Loftis, John Towns NCSA Miron Livny, Peter Couvares, Todd Tannenbaum, Jamie Frey Wisconsin.
S. Veseli - SAM Project Status SAMGrid Developments – Part I Siniša Veseli CD/D0CA.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Jean-Yves Nief CC-IN2P3, Lyon HEPiX-HEPNT, Fermilab October 22nd – 25th, 2002.
Client – Server Application Can you create a client server application: The server will be running as a service: does not have a GUI The server will run.
MiniBooNE Computing Description: Support MiniBooNE online and offline computing by coordinating the use of, and occasionally managing, CD resources. Participants:
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
Development Timelines Ken Kennedy Andrew Chien Keith Cooper Ian Foster John Mellor-Curmmey Dan Reed.
Manchester HEP Desktop/ Laptop 30 Desktop running RH Laptop Windows XP & RH OS X Home server AFS using openafs 3 DB servers Kerberos 4 we will move.
São Paulo Regional Analysis Center SPRACE Status Report 22/Aug/2006 SPRACE Status Report 22/Aug/2006.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
Workshop on Computing for Neutrino Experiments - Summary April 24, 2009 Lee Lueking, Heidi Schellman NOvA Collaboration Meeting.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Online Software 8-July-98 Commissioning Working Group DØ Workshop S. Fuess Objective: Define for you, the customers of the Online system, the products.
Outline: Tasks and Goals The analysis (physics) Resources Needed (Tier1) A. Sidoti INFN Pisa.
GridPP11 Liverpool Sept04 SAMGrid GridPP11 Liverpool Sept 2004 Gavin Davies Imperial College London.
Run II Review Closeout 15 Sept., 2005 FNAL. Thanks! …all the hard work from the reviewees –And all the speakers …hospitality of our hosts Good progress.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
UTA MC Production Farm & Grid Computing Activities Jae Yu UT Arlington DØRACE Workshop Feb. 12, 2002 UTA DØMC Farm MCFARM Job control and packaging software.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Neutrino Program (MINOS, MINERvA, General) Margaret Votava April 21, 2009 Tactical plan.
Status & development of the software for CALICE-DAQ Tao Wu On behalf of UK Collaboration.
1 CMS Software Installation, Bockjoo Kim, 23 Oct. 2008, T3 Workshop, Fermilab CMS Commissioning and First Data Stan Durkin The Ohio State University for.
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
Scientific Computing in PPD and other odds and ends Chris Brew.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
Next Generation of Apache Hadoop MapReduce Owen
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Scientific Computing Facilities / General Physics Computing Facility (GPCF) Stu Fuess 06-Oct-2009.
GPCF* Update Present status as a series of questions / answers related to decisions made / yet to be made * General Physics Computing Facility (GPCF) is.
Parag Mhashilkar (Fermi National Accelerator Laboratory)
Minos Computing Infrastructure Arthur Kreymer 1 ● Young Minos issues: – /local/stage1/minosgli temporary reprieve – need grid shared area – Retiring.
Computing Infrastructure – Minos 2009/06 ● MySQL and CVS upgrades ● Hardware deployment ● Condor / Grid status ● /minos/data file server ● Parrot status.
Computing Infrastructure – Minos 2009/12 ● Downtime schedule – 3 rd Thur monthly ● Dcache upgrades ● Condor / Grid status ● Bluearc performance – cpn lock.
Computing Infrastructure Arthur Kreymer 1 ● Power status in FCC (UPS1) ● Bluearc disk purchase – coming soon ● Planned downtimes – none ! ● Minos.
Minos Computing Infrastructure Arthur Kreymer 1 ● Grid – Going to SLF 5, doubled capacity in GPFarm ● Bluearc - performance good, expanding.
Computing Infrastructure Arthur Kreymer 1 ● Power status in FCC (UPS1) ● Bluearc disk purchase – still coming soon ● Planned downtimes – none.
Valencia Cluster status Valencia Cluster status —— Gang Qin Nov
BaBar Transition: Computing/Monitoring
Status report NIKHEF Willem van Leeuwen February 11, 2002 DØRACE.
SQL Replication for RCSQL 4.5
The New APEL Client Will Rogers, STFC.
System Architecture & Hardware Configurations
Installation 1. Installation Sources
U.S. ATLAS Grid Production Experience
SAM at CCIN2P3 configuration issues
TYPES OF SERVER. TYPES OF SERVER What is a server.
TYPES OFF OPERATING SYSTEM
Haiyan Meng and Douglas Thain
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Michael P. McCumber Task Force Meeting April 3, 2006
Status report NIKHEF Willem van Leeuwen February 11, 2002 DØRACE.
HEC Beam Test Software schematic view T D S MC events ASCII-TDS
The DZero/PPDG D0/PPDG mission is to enable fully distributed computing for the experiment, by enhancing SAM as the distributed data handling system of.
Presentation transcript:

Minos Computing Infrastructure Arthur Kreymer 1 ● Young Minos issues: – /local/stage1/minosgli temporary reprieve – need grid shared area – Retiring most AFS data areas, due to old hardware ● FermiMail (IMAP) migration underway Jan thru March 2012 ● SLF5 migration – minos27 (builds) done Jan 19, minos25 (Condor) last ● /minos/data 93% full, should clean up or expand ● Planned outages – – Feb 16 (Fermigrid) – Feb 23 (minos25/condor, Dcache Chimera upgrade) – TBD – AFS upgrade to support Fermilab tokens – TBD – Bluearc firmware upgrade ● Control Room - System Management by FEF coming - CD DocDB 4430 ● Home areas may move from AFS to Bluearc – starting planning now EXECUTIVE SUMMARY

Minos Computing Infrastructure Arthur Kreymer 2 Computing Infrastructure – Minos 2009/12 ● CPN million locks served to Minos ( 1.6 M NovA, 5.3 M MNV ) ● Still preparing to use shared jobsub scripts – /grid/fermiapp/common/condor – Will give us access to about 100 GPCF batch slots ● Should start using the Bluearc GridFTP server for MC import. – UT Austin tested this ● GridFTP servers available to let you own your Grid job output files ● Local shared file support – OSG_WN_TMP was converted to per-job temporary areas, as originally planned – This removed /local/stage1 – We had been using /local/stage1/minosgi for 3 GB shared files, for efficiency – Temporarily restored on GPFarm nodes, discussing long term solutions GRID

Minos Computing Infrastructure Arthur Kreymer 3 DATA handling ● SAM Projects have been run this year only to test the SAM Station. – Will continue to prototype lightweight script to deliver files in a fashion similar to what was done by a SAM Project.

Minos Computing Infrastructure Arthur Kreymer 4 Enstore/DCache ● DAQ pools rebalancing was completed, all files are on disk ● We corrected file families of several files on Feb 7 – Required use of encp v3_10e – 9 months of fardet_data incorrectly in family – 3K dogwood5 daikon_07 files were incorrectly in fardet_data ● DCache 'Chimera' database migration Feb 23 – Will take about 18 hours, starting around 00:00 – Requires client use of encp v3_10e ( deployed for Minos ) – No change to Dcache interface is required. – We will see the correct file sizes for files over 2 GB – With SLF 6.2, will be able to read/write files via NFS 4.1

Minos Computing Infrastructure Arthur Kreymer 5 NUCOMP ● Still need to test using shared 'jobsub' script ● FTP servers are deployed to allow files owned by user, not minosana ● IF Beam monitoring deployed for testing (NovA era beam monitoring) – Still need to compare to our beam data logging – ● REX took over Condor support from Ryan Patterson July 26 ● We should test using shared 'jobsub' script ● REX working to make output files owned by user, not minosana – Have deployed FTP servers, we should test this ● Still need improved management of condor-exec and condor-tmp ● IF Beam monitoring deployed for testing (NovA era beam monitoring) – Running in parallel – Big Green Button prototype implemented ●

Minos Computing Infrastructure Arthur Kreymer 6 FERMIMAIL ● ● Exchange 2010 IMAP server ( not Exchange 2007 ) ● good browser and imap client compatibility ● Claimed excellent webmail interface ● Exchange 2007 migration delayed July → Oct ( from .fnal.gov ) ● Imap1/2/3 will follow

Minos Computing Infrastructure Arthur Kreymer 7 GPCF ● Still moving to FCC, pending resolution of OracleVM 3 issues. ● minos-slf4 = minosgpvm02 runs SLF bit kernel – Staying through May 2012, unless a security issue emerges – Option to extend to Aug 2012 if needed – 'Just In Case' we need to test SLF 4 compability

Minos Computing Infrastructure Arthur Kreymer 8 Minos Servers ● Waiting for new IF Mysql replica server ( pending disk installs ) – Much faster than minos-mysql1, which we have shut down – Will abandon dbmauto for farm replica, use standard Mysql replication ● SLF5 migration – minos27 (code build) was upgrade Jan 19, all builds now at SLF5. – minos25 (condor master) scheduled Feb 23, moving to a VM – minos-slf4 available for fallback and testing for 3 to 6 months

Minos Computing Infrastructure Arthur Kreymer 9 Control Room ● System Management – to be assumed by FEF in place of John Urish – Evaluating multi-monitor capable workstations – Will move to newly installed hosts one at a time ● ReMote Shift (RMS) – Installed in our control room, same as remote stations – Installed on the minossrv-nd system at the ND – Being used to run everything but EVD

Minos Computing Infrastructure Arthur Kreymer 10 Near Detector Task Force ● Minerva fully engaged in hardware, not so much in software – Calibration – Batch processing – Data Quality – Data Handling – Simulation – Code maintenance – and more – we are just getting started