Download presentation
Presentation is loading. Please wait.
Published bySteven Richards Modified over 9 years ago
2
11 March 2004Getting Ready for the Grid SAM: Tevatron Experiments Using the Grid CDF and D0 Need the Grid –Requirements, the CAF and SAM –Grid from the User Perspective Grid to Meet the Need –How SAM works –SAM usage by D0 and CDF Near Future: SAMGrid Rick St. Denis, University of Glasgow
3
11 March 2004Getting Ready for the Grid Reviews: Director’s (technically), International Finance Committee (fiscally) FNAL PAC (for its physics merit) Maximize physics output @ low Lumi –L3 output rate: 80 -> 360Hz by 06 Spokespersons’ Requirements for CDF 50% computing outside FNAL CDF needs the Grid
4
11 March 2004Getting Ready for the Grid Scale of CDF Requirements THz%offsiteCPU Speed #duals FY043.725%3GHz150 FY059.050%5GHz+360 FY0616.550%8GHz+220 6-7 sites, 100Duals each, by 2006 + 700 @FNAL
5
11 March 2004Getting Ready for the Grid CDF Computing Model Develop Analysis on desktop –Access to all CDF data from anywhere Large scale processing on batch clusters –Submission from anywhere –interactive tools: ls,top,head/tail/cat –Output to scratch space or desktop Implemented Now with CAF (not Grid standard) Exists Now
6
11 March 2004Getting Ready for the Grid Central Analysis Facility CAF is a pile of PC’s with a pile of disks. (1200 processors and 100TB) This can be implemented anywhere as dCAF: Decentralized CAF. Output of jobs can go to desktop or a scratch area Need a password for this: authentication (kerberos).
7
11 March 2004Getting Ready for the Grid Sequential Access through Metadata Metadata: SAM allows groups of files to be identified into datasets using attributes (metadata) such as production pass version or top quark mass to associate them. File Retrieval: SAM moves files to users as they request them. File Storage: SAM allows output files to be stored with new metadata.
8
11 March 2004Getting Ready for the Grid Metadata File Type: SAMMC Data File File Name: Bs_conc_4o5_3.root File ID: 2494282 File Size: 530926740 [B] File Start Time: 01/29/2004 16:00:00 File End Time: 01/29/2004 17:00:00 Application Family: generator Application Version: 1.00 Description: BsDspi_phipi MONTE CARLO Dataset 4o5 part 3 Run Number: 167634 [sam@nglas08 ~]$ sam get metadata --file=Bs_conc_4o5_3.root totalevents = 7290 Work Group: cdf Node Name: cdfsam.cnaf.infn.it dataset = BsMC-lucchesi_test html = http://www.pd.infn.it/~lucchesi
9
11 March 2004Getting Ready for the Grid Use Cases User Level MC Production –All Users have access –No data on site -> write to tape at FNAL User Level Data Access –All users have access –Selected samples automaticaly copied on site SAM provides this
10
11 March 2004Getting Ready for the Grid Functionality User selects a place to run, saying what dataset they will use System checks they can do this (privileges) User access to data at any place User output is stored on any disk or back to tape at FNAL and results are made available for transfer to any site for others to analyse.
11
11 March 2004Getting Ready for the Grid CAF Gui/CLI User Perspective Analysis program Grid TorontoKoreaItalyTaiwanFermiCAFUK CAF Gui/CLI User Perspective Only Fermilab Uses SAM Outside LabGrid Uses SAM
12
11 March 2004Getting Ready for the Grid Meeting the Needs SAM: How it works Progress in SAM CDFGridWorkshop: “Nerd’s Paradise” D0 and CDF Usage
13
11 March 2004Getting Ready for the Grid Fcdfdata016Disk/Cache Station central-analysis Daemon (smaster) Stager Daemon (stagerng) FSS (Deamon) (fss) Stager Daemon (stagerng) Disk/Cache Stager Daemon (stagerng) Stager Daemon (stagerng) Stager Daemon (stagerng)
14
11 March 2004Getting Ready for the Grid Node1 Cache Node2 Cache Node3 Cache Node4 Cache Node5 Cache Station smaster Stager stagerng Stager stagerng Stager stagerng Stager stagerng Stager stagerng A Farm: Station with Stagers and Caches
15
11 March 2004Getting Ready for the Grid What can 20 duals and 6 TB do? StreamEventsDaysInput Size Top,W/Z20.5 M10.34.5TB Hadronic B and charm 156M78.334.2TB Need to transfer 0.6 GB/min or 1 TB/Day
16
11 March 2004Getting Ready for the Grid fcdfdata016 Disks/Cache
17
11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache
18
11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng
19
11 March 2004Getting Ready for the Grid setenv SAM_STATION chris sam dump station --disks *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 53 minutes 25 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained STATION DISKS: disk 7844 nglas08.fnal.gov:/sam/test9/jozwiak/dev/chris, 29947KB/20GB free disk 8064 nglas08.fnal.gov:/sam/test10/jozwiak/dev/chris, 93110KB/20GB free *** END OF STATION DUMP *** sam dump station --disks
20
11 March 2004Getting Ready for the Grid sam dump station --groups *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 57 minutes 3 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained AUTHORIZED GROUPS: group test: admins: jozwiak, swap policy: LRU, fair share: 0, quotas (cur/max): projects = 0/150, disk: 13054803KB/40GB, locks:0B/0KB group test1: admins: jozwiak, swap policy: LRU, fair share: 0, quotas (cur/max): projects = 0/40, disk: 1714466KB/30GB, locks:0B/0KB group test2: admins: jozwiak, swap policy: LRU, fair share: 0, quotas (cur/max): projects = 0/50, disk: 7170234KB/40GB, locks:0B/0KB *** END OF STATION DUMP *** sam dump station --groups
21
11 March 2004Getting Ready for the Grid sam dump station --projects *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 1 hours 4 minutes 49 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained PROJECT MANAGER: fileReleaseTO = 1 days, max files given to project: Unlimited NO PROJECTS *** END OF STATION DUMP *** sam dump station --projects
22
11 March 2004Getting Ready for the Grid fcdfdata016 sam submit --script=userscript --group=groupname --cpu-per-event= --defname= Station central-analysis smaster Disks/Cache Stager stagerng
23
11 March 2004Getting Ready for the Grid fcdfdata016 >>>>>> Starting project with the Station Master Station Master contacted, result: Started project 49008 (49008_sam_) for group test Waiting for the project to initialize... Station central-analysis smaster Disks/Cache Stager stagerng
24
11 March 2004Getting Ready for the Grid fcdfdata016 Callback from server: 'OK|Project is ready' Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster
25
11 March 2004Getting Ready for the Grid fcdfdata016 >>>>>> Submitting the job to the batch system. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster
26
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP
27
11 March 2004Getting Ready for the Grid sam dump station --projects *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 1 hours 12 minutes 44 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained PROJECT MANAGER: fileReleaseTO = 1 days, max files given to project: Unlimited STATION PROJECTS: project 49205_sam_(49205) user jozwiak.test started 01 Nov 14:08:45 UNIX pid 158400787 still wants/currently uses 5/0 files *** END OF STATION DUMP *** Sam dump station --projects
28
11 March 2004Getting Ready for the Grid sam dump project --project=49205_sam_ *** BEGIN GPM DUMP *** Input files: 1003853..1011900 1003853: sim.ztautau.1000evts.017-1442-c5.01, size=0K, unbuffered yet 1003854: sim.ztautau.1000evts.017-1442-c5.02, size=0K, unbuffered yet 1011651: sim.pmc02_01.pythia.ztautau_mb1.1av_200evts.267_1553, size=0K, unbuffered 1011900: sim.pmc02_01.pythia.ztautau_mb1.1av_200evts.276_1152, size=0K, unbuffered Cached (not buffered) files: (none) Buffered files: (none) External files with delivery problems: (none) Umer contexts (name, state, join time, nSeen): (no umers) Proc contexts (ID: name, state, join time [, current|last]): (no procs) Processes waiting for call back:(none) *** END GPM DUMP *** sam dump project –project=
29
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP Optimizer
30
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker
31
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp
32
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore
33
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore
34
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore
35
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore
36
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore
37
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript
38
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer
39
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer
40
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer
41
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer
42
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer
43
11 March 2004Getting Ready for the Grid SAMManager:sam Getting next input file... SAMManager:sam Project master will call back.
44
11 March 2004Getting Ready for the Grid sam dump project --project=49225_sam_ *** BEGIN GPM DUMP *** Input files: 1099393..1099756 1099417: d0g.test_file_1G_a_dev.0001_001, size=0K, unbuffered yet 1099418: d0g.test_file_1G_a_dev.0002_001, size=0K, unbuffered yet Cached (not buffered) files: (none) Buffered files: (none) External files with delivery problems: (none) Umer contexts (name, state, join time, nSeen): 36422: jozwiak(test-harness:1), active, 05 Nov 13:59:09, 31 Proc contexts (ID: name, state, join time [, current|last]): 144663: jozwiak(test-harness:1)@nglas08, wait, 05 Nov 13:59:10, 1099415 Processes waiting for call back: CID=36422: 144663@nglas08.fnal.gov:11872 (05 Nov 20:53:43) *** END GPM DUMP *** Sam dump project –project=49225_sam_
45
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer
46
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer
47
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
48
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
49
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
50
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer rm
51
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
52
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer Optimizer
53
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker
54
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp
55
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp Other Cache
56
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp Other Cache
57
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp
58
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
59
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
60
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
61
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
62
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
63
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
64
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
65
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
66
11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer
67
11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF)
68
11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF)
69
11 March 2004Getting Ready for the Grid fcdfdata016 sam submit…. sam run project… Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF)
70
11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF) 52668 RUN 52675 RUN 52756 PSUSP Project pmaster Project pmaster Project pmaster samscript.sh userscript consumer eworker rcp encp Other Cache Enstore
71
11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF) 52668 RUN 52675 RUN 52756 PSUSP Project pmaster Project pmaster Project pmaster samscript.sh userscript consumer eworker rcp encp Other Cache Enstore
72
11 March 2004Getting Ready for the Grid SAM Animation worldScenerio.html
73
11 March 2004Getting Ready for the Grid Storing Files Getting things to tape from Glasgow
74
11 March 2004Getting Ready for the Grid fcdfdata016 Disks FSS Central-analysis fss Stager stagerng
75
11 March 2004Getting Ready for the Grid sam dump fss FSS version v3_2_2 at station central-analysis running on fcdfdata016.fnal.gov 6 hours 57 minutes 34 seconds No routing (all transfers are direct) Configuration for operation retrial (count, interval/timeout) DBS contact: 3, 1 hours Opter contact: 1, 1 hours Authorization receipt:1, 1 hours Stager contact: 1, 1 hours Transfer (retrials upon timeout and upon failure): 3, 6 hours Relay (multi-stage routing only): 3, 1 hours File Storage Server Dump: Stagers are known at nodes: fcdfdata016.fnal.gov No requests ever submitted Sam dump fss
76
11 March 2004Getting Ready for the Grid fcdfdata016 sam store descrip.py --source= [--dest=/pnfs…..] Disks FSS Central-analysis fss Stager stagerng
77
11 March 2004Getting Ready for the Grid fcdfdata016 sam store descrip.py --source= [--dest=/pnfs…..] Disks FSS Central-analysis fss Stager stagerng Descrip.py Metadata Info about file Sam checks info, checks location,
78
11 March 2004Getting Ready for the Grid fcdfdata016 sam store descrip.py --source= [--dest=/pnfs…..] Disks FSS Central-analysis fss Stager stagerng eworker encp, rcp, bbftp
79
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From Really Far Away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore
80
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss Routing: fcdfdata016 Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore sam store enstore
81
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss Routing: fcdfdata016 Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker bbftp fcdfdata016
82
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker bbftp fcdfdata016
83
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore
84
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker encp
85
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker encp
86
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore rm
87
11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore
88
11 March 2004Getting Ready for the Grid D0 Sam D0 relies entirely on SAM for analysis
89
11 March 2004Getting Ready for the Grid D0
90
11 March 2004Getting Ready for the Grid D0 Files 4000-8000 Files/Day
91
11 March 2004Getting Ready for the Grid D0 Data Volume 1TB-3TB/day
92
11 March 2004Getting Ready for the Grid D0 Files Per Month By Year 19992000200120022003 100,000 files Run II Start
93
11 March 2004Getting Ready for the Grid D0 Total Files 2.5Million Files Served
94
11 March 2004Getting Ready for the Grid D0 Data Per Month By Year 50 TB per month 19992000200120022003 Run II Start
95
11 March 2004Getting Ready for the Grid D0 Total Data Moved 700TB moved
96
11 March 2004Getting Ready for the Grid Progress in SAM: CDF All 800,000 CDF data files are in SAM Sam is in beta testing on the CDF CAF (1200 cpus): passed 20TB/Day delivery Karlsruhe uses SAM routinely Minos uses SAM for its Data Handling Steve Mrenna (Phenomenology) depositing ALPGEN files in SAM for common CDF/D0 use.
97
11 March 2004Getting Ready for the Grid Florida workshop: 11 installations in about 2 hours. Integrated with dCAF in 2 cases in 2 days. 3 in Asia, 4 in Europe 6 sites committed to summer 2004 usage of their facilities for all of CDF (mostly MC) Sam installation now: initsam cdf Follow-up on April 1. Each site has a local user support person to reduce load on core development team. Generally: Security ate 80% of the effort! Now 20!
98
11 March 2004Getting Ready for the Grid CDF
99
11 March 2004Getting Ready for the Grid Florida Workshop: After 2 Days
100
11 March 2004Getting Ready for the Grid 2TB/Day: Karlsruhe
101
11 March 2004Getting Ready for the Grid CDF Dcache on CAF ALL CDF on CAF reads 25TB/Day NonGrid Running
102
11 March 2004Getting Ready for the Grid Karlsruhe: 1500 files/Day CDF Files in a Month
103
11 March 2004Getting Ready for the Grid Karlsruhe: 5-10M Evt/Day CDF Events Transfer in a Month
104
11 March 2004Getting Ready for the Grid All CDF Files Moved by SAM 300K Files 20022003 D0: 2.5M files
105
11 March 2004Getting Ready for the Grid Total CDF Data Moved 200 TB 20022003 D0:700TB
106
11 March 2004Getting Ready for the Grid Advantage of Local Processing Karlsruhe processes 2TB/day. Rest of CDF on Central Cluster processes 25TB/day. (450 processors, 8 experiments, 10/13TB disk filled.) 5 users actively at Karlsruhe. Make ntuple for bottom and top physics for 15 people. 100 users active for rest of CDF: They pin the datasets of interest; copy new ones automatically.
107
11 March 2004Getting Ready for the Grid Dcache and SAM Dcache shapes traffic into disk: If a SAM cache is large, need to use Dcache instead of nfs mounts Dcache gives the user what is requested. 1TB gets same priority as 1GB: CDF users must send email requesting data to be staged. SAM examines consumption rate before staging next files – No EMAIL needed. SAM uses Dcache for its Caching at FNAL. This needs further work with SRM
108
11 March 2004Getting Ready for the Grid In the near term future:JIM Adding Grid Standard Tools
109
11 March 2004Getting Ready for the Grid CDF Grid Strategy 25% of CDF Computing from external resources. All CDF computing on CDF Grid by April 15: Utilize resources fully controlled by CDF: Kerberos/fbsng: dCAF + SAM October 15, 2004: JIM to capture shared resources June 2005: 50% of Computing resources external
110
11 March 2004Getting Ready for the Grid Desktop Anywhere Condor Submitter @regional centers SAM DB Condor Matchmaker @FNAL Globus GK CAF Submitter SAM Station @ each site WN Private LAN dCache June 2004 testing June 2005 required Simple JIM
111
11 March 2004Getting Ready for the Grid Detailed JIM Site Resource Selector Info Collector Info Gatherer Match Making User Interface Submission Global Job Queue Grid Client Submission User Interface Global DH Services SAM Naming Server SAM Log Server Resource Optimizer SAM DB Server RCMetaData Catalog Bookkeeping Service SAM Stager(s) SAM Station (+other servs) Data Handling Worker Nodes Grid Gateway Local Job Handler (CAF, D0MC, BS,...) JIM Advertise Local Job Handling Cluster AAA Dist.FS Info Manager XML DB server Site Conf. Glob/Loc JID map... Info Providers MDS MSS Cache Site Web Serv Grid Monitoring User Tools Flow of: jobdata meta-data
112
11 March 2004Getting Ready for the Grid Conclusions CDF has embraced the need for the Grid to achieve its physics mission SAM is working for D0 and growing in CDF
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.