Presentation is loading. Please wait.

Presentation is loading. Please wait.

11 March 2004Getting Ready for the Grid SAM: Tevatron Experiments Using the Grid CDF and D0 Need the Grid –Requirements, the CAF and SAM –Grid from the.

Similar presentations


Presentation on theme: "11 March 2004Getting Ready for the Grid SAM: Tevatron Experiments Using the Grid CDF and D0 Need the Grid –Requirements, the CAF and SAM –Grid from the."— Presentation transcript:

1

2 11 March 2004Getting Ready for the Grid SAM: Tevatron Experiments Using the Grid CDF and D0 Need the Grid –Requirements, the CAF and SAM –Grid from the User Perspective Grid to Meet the Need –How SAM works –SAM usage by D0 and CDF Near Future: SAMGrid Rick St. Denis, University of Glasgow

3 11 March 2004Getting Ready for the Grid Reviews: Director’s (technically), International Finance Committee (fiscally) FNAL PAC (for its physics merit) Maximize physics output @ low Lumi –L3 output rate: 80 -> 360Hz by 06 Spokespersons’ Requirements for CDF 50% computing outside FNAL CDF needs the Grid

4 11 March 2004Getting Ready for the Grid Scale of CDF Requirements THz%offsiteCPU Speed #duals FY043.725%3GHz150 FY059.050%5GHz+360 FY0616.550%8GHz+220 6-7 sites, 100Duals each, by 2006 + 700 @FNAL

5 11 March 2004Getting Ready for the Grid CDF Computing Model Develop Analysis on desktop –Access to all CDF data from anywhere Large scale processing on batch clusters –Submission from anywhere –interactive tools: ls,top,head/tail/cat –Output to scratch space or desktop Implemented Now with CAF (not Grid standard) Exists Now

6 11 March 2004Getting Ready for the Grid Central Analysis Facility CAF is a pile of PC’s with a pile of disks. (1200 processors and 100TB) This can be implemented anywhere as dCAF: Decentralized CAF. Output of jobs can go to desktop or a scratch area Need a password for this: authentication (kerberos).

7 11 March 2004Getting Ready for the Grid Sequential Access through Metadata Metadata: SAM allows groups of files to be identified into datasets using attributes (metadata) such as production pass version or top quark mass to associate them. File Retrieval: SAM moves files to users as they request them. File Storage: SAM allows output files to be stored with new metadata.

8 11 March 2004Getting Ready for the Grid Metadata File Type: SAMMC Data File File Name: Bs_conc_4o5_3.root File ID: 2494282 File Size: 530926740 [B] File Start Time: 01/29/2004 16:00:00 File End Time: 01/29/2004 17:00:00 Application Family: generator Application Version: 1.00 Description: BsDspi_phipi MONTE CARLO Dataset 4o5 part 3 Run Number: 167634 [sam@nglas08 ~]$ sam get metadata --file=Bs_conc_4o5_3.root totalevents = 7290 Work Group: cdf Node Name: cdfsam.cnaf.infn.it dataset = BsMC-lucchesi_test html = http://www.pd.infn.it/~lucchesi

9 11 March 2004Getting Ready for the Grid Use Cases User Level MC Production –All Users have access –No data on site -> write to tape at FNAL User Level Data Access –All users have access –Selected samples automaticaly copied on site SAM provides this

10 11 March 2004Getting Ready for the Grid Functionality User selects a place to run, saying what dataset they will use System checks they can do this (privileges) User access to data at any place User output is stored on any disk or back to tape at FNAL and results are made available for transfer to any site for others to analyse.

11 11 March 2004Getting Ready for the Grid CAF Gui/CLI User Perspective Analysis program Grid TorontoKoreaItalyTaiwanFermiCAFUK CAF Gui/CLI User Perspective Only Fermilab Uses SAM Outside LabGrid Uses SAM

12 11 March 2004Getting Ready for the Grid Meeting the Needs SAM: How it works Progress in SAM CDFGridWorkshop: “Nerd’s Paradise” D0 and CDF Usage

13 11 March 2004Getting Ready for the Grid Fcdfdata016Disk/Cache Station central-analysis Daemon (smaster) Stager Daemon (stagerng) FSS (Deamon) (fss) Stager Daemon (stagerng) Disk/Cache Stager Daemon (stagerng) Stager Daemon (stagerng) Stager Daemon (stagerng)

14 11 March 2004Getting Ready for the Grid Node1 Cache Node2 Cache Node3 Cache Node4 Cache Node5 Cache Station smaster Stager stagerng Stager stagerng Stager stagerng Stager stagerng Stager stagerng A Farm: Station with Stagers and Caches

15 11 March 2004Getting Ready for the Grid What can 20 duals and 6 TB do? StreamEventsDaysInput Size Top,W/Z20.5 M10.34.5TB Hadronic B and charm 156M78.334.2TB Need to transfer 0.6 GB/min or 1 TB/Day

16 11 March 2004Getting Ready for the Grid fcdfdata016 Disks/Cache

17 11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache

18 11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng

19 11 March 2004Getting Ready for the Grid setenv SAM_STATION chris sam dump station --disks *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 53 minutes 25 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained STATION DISKS: disk 7844 nglas08.fnal.gov:/sam/test9/jozwiak/dev/chris, 29947KB/20GB free disk 8064 nglas08.fnal.gov:/sam/test10/jozwiak/dev/chris, 93110KB/20GB free *** END OF STATION DUMP *** sam dump station --disks

20 11 March 2004Getting Ready for the Grid sam dump station --groups *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 57 minutes 3 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained AUTHORIZED GROUPS: group test: admins: jozwiak, swap policy: LRU, fair share: 0, quotas (cur/max): projects = 0/150, disk: 13054803KB/40GB, locks:0B/0KB group test1: admins: jozwiak, swap policy: LRU, fair share: 0, quotas (cur/max): projects = 0/40, disk: 1714466KB/30GB, locks:0B/0KB group test2: admins: jozwiak, swap policy: LRU, fair share: 0, quotas (cur/max): projects = 0/50, disk: 7170234KB/40GB, locks:0B/0KB *** END OF STATION DUMP *** sam dump station --groups

21 11 March 2004Getting Ready for the Grid sam dump station --projects *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 1 hours 4 minutes 49 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained PROJECT MANAGER: fileReleaseTO = 1 days, max files given to project: Unlimited NO PROJECTS *** END OF STATION DUMP *** sam dump station --projects

22 11 March 2004Getting Ready for the Grid fcdfdata016 sam submit --script=userscript --group=groupname --cpu-per-event= --defname= Station central-analysis smaster Disks/Cache Stager stagerng

23 11 March 2004Getting Ready for the Grid fcdfdata016 >>>>>> Starting project with the Station Master Station Master contacted, result: Started project 49008 (49008_sam_) for group test Waiting for the project to initialize... Station central-analysis smaster Disks/Cache Stager stagerng

24 11 March 2004Getting Ready for the Grid fcdfdata016 Callback from server: 'OK|Project is ready' Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster

25 11 March 2004Getting Ready for the Grid fcdfdata016 >>>>>> Submitting the job to the batch system. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster

26 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP

27 11 March 2004Getting Ready for the Grid sam dump station --projects *** BEGIN DUMP STATION chris version v3_2_2 running at nglas08 1 hours 12 minutes 44 seconds, admins: jozwiak terekhov Known batch systems: lsf Default batch system: lsf No replica selection criteria There are 0 authorized transfer groups Minimum delivery is 1KB; external deliveries are unconstrained PROJECT MANAGER: fileReleaseTO = 1 days, max files given to project: Unlimited STATION PROJECTS: project 49205_sam_(49205) user jozwiak.test started 01 Nov 14:08:45 UNIX pid 158400787 still wants/currently uses 5/0 files *** END OF STATION DUMP *** Sam dump station --projects

28 11 March 2004Getting Ready for the Grid sam dump project --project=49205_sam_ *** BEGIN GPM DUMP *** Input files: 1003853..1011900 1003853: sim.ztautau.1000evts.017-1442-c5.01, size=0K, unbuffered yet 1003854: sim.ztautau.1000evts.017-1442-c5.02, size=0K, unbuffered yet 1011651: sim.pmc02_01.pythia.ztautau_mb1.1av_200evts.267_1553, size=0K, unbuffered 1011900: sim.pmc02_01.pythia.ztautau_mb1.1av_200evts.276_1152, size=0K, unbuffered Cached (not buffered) files: (none) Buffered files: (none) External files with delivery problems: (none) Umer contexts (name, state, join time, nSeen): (no umers) Proc contexts (ID: name, state, join time [, current|last]): (no procs) Processes waiting for call back:(none) *** END GPM DUMP *** sam dump project –project=

29 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP Optimizer

30 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker

31 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp

32 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore

33 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore

34 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore

35 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 PSUSP eworker encp Enstore

36 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore

37 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript

38 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer

39 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer

40 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer

41 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer

42 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer

43 11 March 2004Getting Ready for the Grid SAMManager:sam Getting next input file... SAMManager:sam Project master will call back.

44 11 March 2004Getting Ready for the Grid sam dump project --project=49225_sam_ *** BEGIN GPM DUMP *** Input files: 1099393..1099756 1099417: d0g.test_file_1G_a_dev.0001_001, size=0K, unbuffered yet 1099418: d0g.test_file_1G_a_dev.0002_001, size=0K, unbuffered yet Cached (not buffered) files: (none) Buffered files: (none) External files with delivery problems: (none) Umer contexts (name, state, join time, nSeen): 36422: jozwiak(test-harness:1), active, 05 Nov 13:59:09, 31 Proc contexts (ID: name, state, join time [, current|last]): 144663: jozwiak(test-harness:1)@nglas08, wait, 05 Nov 13:59:10, 1099415 Processes waiting for call back: CID=36422: 144663@nglas08.fnal.gov:11872 (05 Nov 20:53:43) *** END GPM DUMP *** Sam dump project –project=49225_sam_

45 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer

46 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN eworker encp Enstore samscript.sh userscript consumer

47 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

48 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

49 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

50 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer rm

51 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

52 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer Optimizer

53 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker

54 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp

55 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp Other Cache

56 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp Other Cache

57 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer eworker rcp

58 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

59 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

60 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

61 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

62 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

63 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

64 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

65 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

66 11 March 2004Getting Ready for the Grid fcdfdata016 Job is submitted to queue. Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF) 52554 RUN samscript.sh userscript consumer

67 11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Project pmaster Batch (LSF)

68 11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF)

69 11 March 2004Getting Ready for the Grid fcdfdata016 sam submit…. sam run project… Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF)

70 11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF) 52668 RUN 52675 RUN 52756 PSUSP Project pmaster Project pmaster Project pmaster samscript.sh userscript consumer eworker rcp encp Other Cache Enstore

71 11 March 2004Getting Ready for the Grid fcdfdata016 Station central-analysis smaster Disks/Cache Stager stagerng Batch (LSF) 52668 RUN 52675 RUN 52756 PSUSP Project pmaster Project pmaster Project pmaster samscript.sh userscript consumer eworker rcp encp Other Cache Enstore

72 11 March 2004Getting Ready for the Grid SAM Animation worldScenerio.html

73 11 March 2004Getting Ready for the Grid Storing Files Getting things to tape from Glasgow

74 11 March 2004Getting Ready for the Grid fcdfdata016 Disks FSS Central-analysis fss Stager stagerng

75 11 March 2004Getting Ready for the Grid sam dump fss FSS version v3_2_2 at station central-analysis running on fcdfdata016.fnal.gov 6 hours 57 minutes 34 seconds No routing (all transfers are direct) Configuration for operation retrial (count, interval/timeout) DBS contact: 3, 1 hours Opter contact: 1, 1 hours Authorization receipt:1, 1 hours Stager contact: 1, 1 hours Transfer (retrials upon timeout and upon failure): 3, 6 hours Relay (multi-stage routing only): 3, 1 hours File Storage Server Dump: Stagers are known at nodes: fcdfdata016.fnal.gov No requests ever submitted Sam dump fss

76 11 March 2004Getting Ready for the Grid fcdfdata016 sam store descrip.py --source= [--dest=/pnfs…..] Disks FSS Central-analysis fss Stager stagerng

77 11 March 2004Getting Ready for the Grid fcdfdata016 sam store descrip.py --source= [--dest=/pnfs…..] Disks FSS Central-analysis fss Stager stagerng Descrip.py Metadata Info about file Sam checks info, checks location,

78 11 March 2004Getting Ready for the Grid fcdfdata016 sam store descrip.py --source= [--dest=/pnfs…..] Disks FSS Central-analysis fss Stager stagerng eworker encp, rcp, bbftp

79 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From Really Far Away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore

80 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss Routing: fcdfdata016 Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore sam store enstore

81 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss Routing: fcdfdata016 Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker bbftp fcdfdata016

82 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker bbftp fcdfdata016

83 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore

84 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker encp

85 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore eworker encp

86 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore rm

87 11 March 2004Getting Ready for the Grid Node from Really Far Away Disk Fss From really Far away Stager fcdfdata016 Fss central-analysis Stager Tmp Disk Enstore

88 11 March 2004Getting Ready for the Grid D0 Sam D0 relies entirely on SAM for analysis

89 11 March 2004Getting Ready for the Grid D0

90 11 March 2004Getting Ready for the Grid D0 Files 4000-8000 Files/Day

91 11 March 2004Getting Ready for the Grid D0 Data Volume 1TB-3TB/day

92 11 March 2004Getting Ready for the Grid D0 Files Per Month By Year 19992000200120022003 100,000 files Run II Start

93 11 March 2004Getting Ready for the Grid D0 Total Files 2.5Million Files Served

94 11 March 2004Getting Ready for the Grid D0 Data Per Month By Year 50 TB per month 19992000200120022003 Run II Start

95 11 March 2004Getting Ready for the Grid D0 Total Data Moved 700TB moved

96 11 March 2004Getting Ready for the Grid Progress in SAM: CDF All 800,000 CDF data files are in SAM Sam is in beta testing on the CDF CAF (1200 cpus): passed 20TB/Day delivery Karlsruhe uses SAM routinely Minos uses SAM for its Data Handling Steve Mrenna (Phenomenology) depositing ALPGEN files in SAM for common CDF/D0 use.

97 11 March 2004Getting Ready for the Grid Florida workshop: 11 installations in about 2 hours. Integrated with dCAF in 2 cases in 2 days. 3 in Asia, 4 in Europe 6 sites committed to summer 2004 usage of their facilities for all of CDF (mostly MC) Sam installation now: initsam cdf Follow-up on April 1. Each site has a local user support person to reduce load on core development team. Generally: Security ate 80% of the effort! Now 20!

98 11 March 2004Getting Ready for the Grid CDF

99 11 March 2004Getting Ready for the Grid Florida Workshop: After 2 Days

100 11 March 2004Getting Ready for the Grid 2TB/Day: Karlsruhe

101 11 March 2004Getting Ready for the Grid CDF Dcache on CAF ALL CDF on CAF reads 25TB/Day NonGrid Running

102 11 March 2004Getting Ready for the Grid Karlsruhe: 1500 files/Day CDF Files in a Month

103 11 March 2004Getting Ready for the Grid Karlsruhe: 5-10M Evt/Day CDF Events Transfer in a Month

104 11 March 2004Getting Ready for the Grid All CDF Files Moved by SAM 300K Files 20022003 D0: 2.5M files

105 11 March 2004Getting Ready for the Grid Total CDF Data Moved 200 TB 20022003 D0:700TB

106 11 March 2004Getting Ready for the Grid Advantage of Local Processing Karlsruhe processes 2TB/day. Rest of CDF on Central Cluster processes 25TB/day. (450 processors, 8 experiments, 10/13TB disk filled.) 5 users actively at Karlsruhe. Make ntuple for bottom and top physics for 15 people. 100 users active for rest of CDF: They pin the datasets of interest; copy new ones automatically.

107 11 March 2004Getting Ready for the Grid Dcache and SAM Dcache shapes traffic into disk: If a SAM cache is large, need to use Dcache instead of nfs mounts Dcache gives the user what is requested. 1TB gets same priority as 1GB: CDF users must send email requesting data to be staged. SAM examines consumption rate before staging next files – No EMAIL needed. SAM uses Dcache for its Caching at FNAL. This needs further work with SRM

108 11 March 2004Getting Ready for the Grid In the near term future:JIM Adding Grid Standard Tools

109 11 March 2004Getting Ready for the Grid CDF Grid Strategy 25% of CDF Computing from external resources. All CDF computing on CDF Grid by April 15: Utilize resources fully controlled by CDF: Kerberos/fbsng: dCAF + SAM October 15, 2004: JIM to capture shared resources June 2005: 50% of Computing resources external

110 11 March 2004Getting Ready for the Grid Desktop Anywhere Condor Submitter @regional centers SAM DB Condor Matchmaker @FNAL Globus GK CAF Submitter SAM Station @ each site WN Private LAN dCache June 2004 testing June 2005 required Simple JIM

111 11 March 2004Getting Ready for the Grid Detailed JIM Site Resource Selector Info Collector Info Gatherer Match Making User Interface Submission Global Job Queue Grid Client Submission User Interface Global DH Services SAM Naming Server SAM Log Server Resource Optimizer SAM DB Server RCMetaData Catalog Bookkeeping Service SAM Stager(s) SAM Station (+other servs) Data Handling Worker Nodes Grid Gateway Local Job Handler (CAF, D0MC, BS,...) JIM Advertise Local Job Handling Cluster AAA Dist.FS Info Manager XML DB server Site Conf. Glob/Loc JID map... Info Providers MDS MSS Cache Site Web Serv Grid Monitoring User Tools Flow of: jobdata meta-data

112 11 March 2004Getting Ready for the Grid Conclusions CDF has embraced the need for the Grid to achieve its physics mission SAM is working for D0 and growing in CDF


Download ppt "11 March 2004Getting Ready for the Grid SAM: Tevatron Experiments Using the Grid CDF and D0 Need the Grid –Requirements, the CAF and SAM –Grid from the."

Similar presentations


Ads by Google