YuChul Yang Oct KPS 2006 가을 EXCO, 대구 The Current Status of KorCAF and CDF Grid 양유철, 장성현, 미안 사비르 아메드, 칸 아딜, 모하메드 아즈말, 공대정, 김지은, 서준석, 김동희 ( 경북대학교, 물리학과 ) 이영장, 정지은, 문창성, 김현수, 전은주, 주경광, 김수봉 ( 서울대학교, 물리학과 ) 고정환, 이재승, 유인태 ( 성균관대학교, 물리학과 ) 조기현 ( 슈퍼컴퓨팅 센터, KISTI)
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Introduction to CDF Computing Developed in to respond to experiments greatly increased need for computational and data handling resources to deal with RunII One of the first large-scale cluster approaches to user computing for general analysis. Greatly increased CPU power & data to physicists. CDF Grid via CAF, DCAF, SAM and SAMGrid ☞ DCAF(DeCentralized Analysis Farm) ☞ SAM (Sequential Access through Metadata) – Real data Handling System ☞ SAMGrid – combination of SAM and JIM (Job Information Management) system
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Outline CAF Central Analysis Farm : A large central computing resource based on Linux cluster farms with a simple job management scheme at Fermilab. DCAF Decentralized CDF Analysis Farm : We extended the above model, including its command line interface and GUI, to manage and work with remote resources Grid We are now in the process of adapting and converting out work flow to the Grid
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Environment on CAF All basic CDF software pre-installed on CAF Authentication via Kerberos ☞ Jobs are run via mapped accounts with authentication of actual user through special principal ☞ Database, data handling remote usres ID passed on through lookup of actual user via special principal User’s analysis environment comes over in tarball - no need to pre-register or submit only certain jobs. Job returns results to user via secure ftp/rcp controlled by user script and principal
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 In 2006, about 50% of analysis farm outside of FNAL Distributed clusters in Korea, Taiwan, Japan, Italy, Germany, Spain, UK, USA and Canada
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Current DCAF approach Cluster technology (CAF = “Central analysis farm”) extended to remote site (DCAFs = Decentralized CDF analysis Farm) Multiple batch systems supported : converting from FBSNG system to Condor on all DCAFs SAM data handling system required for offsite DCAFs
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 (2006/Aug) Current CDF Dedicated Resources
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 TYPECPURAMHDDNO head Node cluster46.knu.ac.kr AMD MP2000 * 22G80G1 sam station cluster67.knu.ac.kr Pentium 4 2.4G1G80G1 submission node cluster52.knu.ac.kr Pentium 4 2.4G1G80G1 worker node cluster39~cluster73(21) cluster102~cluster114(13) Cluster122~cluster130(9) Cluster137~cluster139(3) (updated 2006) AMD MP2000 * 22G80G4 AMD MP2200 * 21G80G2 AMD MP2800 * 22G80G11 AMD MP2800 * 22G250G2 Pentium 4 2.4G1G80G15 Xeon 3.G * 22G80G9 Xeon 3.G * 22G80G3 Total 81 CPU (179.9GHz)79G4260G49 Detail of KorCAF resources
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Storage status CPURAMHDDNO Current0.6TB Opteron dual2G4TB1 Zeon dual1G1TB1 Total5.6TB2 Working on CondorCAF batch system cdfsoft Installed products : , , 4.8.4, 4.9.1, 4.9.1hpt3, 5.2.0, 5.3.0, 5.3.1, 5.3.3, 5.3.3_nt, 5.3.4, development Installed binary products: , 5.3.1, 5.3.3, 5.3.3_nt, 5.3.4
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 CAF gui & Monitoring System Select farm Process type Submit status User script, I/O file location Data access
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Functionality for User (KorCAF) FeatureStatus Self-contained user interfaceYes Runs arbitrary user codeYes Automatic identity managementYes Network delivery of resultsYes Input and output data handlingYes Batch system priority management Yes Automatic choice of farmNot yet Negotiation of resourcesNot yet Runs on arbitrary grid resourcesNot yet Grid
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Total CDF Computing Requirements Input ConditionsResulting Requirements Fiscal Year Int LEvtsPeak rateAnaRecoDiskTape I/OTape Vol fb -1 x 10 9 MB/sHzTHz PBGB/sPB /04.9 Analysis CPU, disk, tape needs scale with number of events. FNAL portion of analysis CPU assumed at roughly 50% beyond 2005.
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Movement to Grid It’s the world wide trend for HEP experiment. Need to take advantage of global innovations and resources. CDF still has a lot of data to be analyzed. USE Grid Cannot continue to expand dedicate resource
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Activities for CDF Grid Testing various approaches to using Grid resources (Grid3/OSG and LCG) Adapt the CAF infrastructure to run on top of the Grid using Condor glide-ins (GlideCAF) Use direct submission via CAF interface to OSG and LCG Use SAMGrid/JIM sendboxing as an alternate way to deliver experiment + user software Combine DCAFs with Grid resources
YuChul Yang Oct KPS 2006 가을 EXCO, 대구 Conclusions CDF has successfully deployed a global computing environment (DCAFs) for user analysis. A large portion (50%) of the total CPU resources of the experiment are now provided by offsite through a combination of DCAFs and other clusters. And KorCAF (DCAF in Korea) working on Condor batch system. Active work is in progress to build bridges to true Grid methods & protocols provide a path to the future.