4 March 2004GridPP 9th Collaboration Meeting SAMGrid:JIM and CDF Development CDF Accepts the Need for the Grid –Requirements How to Meet the Need –Status.

Slides:



Advertisements
Similar presentations
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Advertisements

Physics with SAM-Grid Stefan Stonjek University of Oxford 6 th GridPP Meeting 30 th January 2003 Coseners House.
SAM-Grid Status Core SAM development SAM-Grid architecture Progress Future work.
13 March 2002CDF-Grid Meeting at CERN CDF and the Grid Requirements and Anti-Requirements CDF-o-Centric View The Project The manpower Conclusion: CDF/D0.
Rod Walker IC 13th March 2002 SAM-Grid Middleware  SAM.  JIM.  RunJob.  Conclusions. - Rod Walker,ICL.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
GRID DATA MANAGEMENT PILOT (GDMP) Asad Samar (Caltech) ACAT 2000, Fermilab October , 2000.
The Sam-Grid project Gabriele Garzoglio ODS, Computing Division, Fermilab PPDG, DOE SciDAC ACAT 2002, Moscow, Russia June 26, 2002.
Oxford Jan 2005 RAL Computing 1 RAL Computing Implementing the computing model: SAM and the Grid Nick West.
JIM Deployment for the CDF Experiment M. Burgon-Lyon 1, A. Baranowski 2, V. Bartsch 3,S. Belforte 4, G. Garzoglio 2, R. Herber 2, R. Illingworth 2, R.
18 Feb 2004Computing Division Project Status Report1 Project Status Report : SAMGrid  SAMGrid Management, Status, Operations – Merritt  SAMGrid Development.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
F Run II Experiments and the Grid Amber Boehnlein Fermilab September 16, 2005.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
SAMGrid – A fully functional computing grid based on standard technologies Igor Terekhov for the JIM team FNAL/CD/CCF.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
HEP Experiment Integration within GriPhyN/PPDG/iVDGL Rick Cavanaugh University of Florida DataTAG/WP4 Meeting 23 May, 2002.
S. Veseli - SAM Project Status SAMGrid Developments – Part I Siniša Veseli CD/D0CA.
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
Grid Job and Information Management (JIM) for D0 and CDF Gabriele Garzoglio for the JIM Team.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
Building a distributed software environment for CDF within the ESLEA framework V. Bartsch, M. Lancaster University College London.
11 March 2004Getting Ready for the Grid SAM: Tevatron Experiments Using the Grid CDF and D0 Need the Grid –Requirements, the CAF and SAM –Grid from the.
CDF Grid Status Stefan Stonjek 05-Jul th GridPP meeting / Durham.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Deploying and Operating the SAM-Grid: lesson learned Gabriele Garzoglio for the SAM-Grid Team Sep 28, 2004.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
CHEP 2003Stefan Stonjek1 Physics with SAM-Grid Stefan Stonjek University of Oxford CHEP th March 2003 San Diego.
CHEP'07 September D0 data reprocessing on OSG Authors Andrew Baranovski (Fermilab) for B. Abbot, M. Diesburg, G. Garzoglio, T. Kurca, P. Mhashilkar.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
A Design for KCAF for CDF Experiment Kihyeon Cho (CHEP, Kyungpook National University) and Jysoo Lee (KISTI, Supercomputing Center) The International Workshop.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
Instrumentation of the SAM-Grid Gabriele Garzoglio CSC 426 Research Proposal.
GridPP18 Glasgow Mar 07 DØ – SAMGrid Where’ve we come from, and where are we going? Evolution of a ‘long’ established plan Gavin Davies Imperial College.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
The SAM-Grid and the use of Condor-G as a grid job management middleware Gabriele Garzoglio for the SAM-Grid Team Fermilab, Computing Division.
22 nd September 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
São Paulo Regional Analysis Center SPRACE Status Report 22/Aug/2006 SPRACE Status Report 22/Aug/2006.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
16 September GridPP 5 th Collaboration Meeting D0&CDF SAM and The Grid Act I: Grid, Sam and Run II Rick St. Denis – Glasgow University Act II: Sam4CDF.
1 DØ Grid PP Plans – SAM, Grid, Ceiling Wax and Things Iain Bertram Lancaster University Monday 5 November 2001.
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Data reprocessing for DZero on the SAM-Grid Gabriele Garzoglio for the SAM-Grid Team Fermilab, Computing Division.
The GriPhyN Planning Process All-Hands Meeting ISI 15 October 2001.
GridPP11 Liverpool Sept04 SAMGrid GridPP11 Liverpool Sept 2004 Gavin Davies Imperial College London.
Run II Review Closeout 15 Sept., 2005 FNAL. Thanks! …all the hard work from the reviewees –And all the speakers …hospitality of our hosts Good progress.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
19 February 2004SAMGrid Project Review SAMGrid: Future Plans CDF Accepts the Need for the Grid –Requirements D0 Relies on the Grid –Requirements How to.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Outline: Status: Report after one month of Plans for the future (Preparing Summer -Fall 2003) (CNAF): Update A. Sidoti, INFN Pisa and.
SAM Overview (training session) for CDF Users Doug Benjamin Duke University Krzysztof Genser Fermilab/CD.
DCAF(DeCentralized Analysis Farm) for CDF experiments HAN DaeHee*, KWON Kihwan, OH Youngdo, CHO Kihyeon, KONG Dae Jung, KIM Minsuk, KIM Jieun, MIAN shabeer,
Super Scaling PROOF to very large clusters Maarten Ballintijn, Kris Gulbrandsen, Gunther Roland / MIT Rene Brun, Fons Rademakers / CERN Philippe Canal.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Eileen Berman. Condor in the Fermilab Grid FacilitiesApril 30, 2008  Fermi National Accelerator Laboratory is a high energy physics laboratory outside.
Adapting SAM for CDF Gabriele Garzoglio Fermilab/CD/CCF/MAP CHEP 2003.
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
D0 File Replication PPDG SLAC File replication workshop 9/20/00 Vicky White.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
CDF SAM Deployment Status Doug Benjamin Duke University (for the CDF Data Handling Group)
Apr. 25, 2002Why DØRAC? DØRAC FTFM, Jae Yu 1 What do we want DØ Regional Analysis Centers (DØRAC) do? Why do we need a DØRAC? What do we want a DØRAC do?
Sept Wyatt Merritt Run II Computing Review1 Status of SAMGrid / Future Plans for SAMGrid  Brief introduction to SAMGrid  Status and deployments.
Project Status Report : SAMGrid
Status and plans for bookkeeping system and production tools
Presentation transcript:

4 March 2004GridPP 9th Collaboration Meeting SAMGrid:JIM and CDF Development CDF Accepts the Need for the Grid –Requirements How to Meet the Need –Status of SAMGrid for CDF Rick St. Denis, University of Glasgow

4 March 2004GridPP 9th Collaboration Meeting Director’s review, International Finance Committee: 50% computing outside FNAL Maximize physics low Lumi –L3 output rate: 80 -> 360Hz by 06 Spokespersons’ Requirements for CDF CDFGrid supported by FNAL PAC CDF needs the Grid

4 March 2004GridPP 9th Collaboration Meeting Scale of CDF Requirements THz%offsiteCPU Speed #duals FY %3GHz150 FY %5GHz+360 FY %8GHz sites, 100Duals each, by

4 March 2004GridPP 9th Collaboration Meeting CDF Computing Model Develop Analysis on desktop –Access to all CDF data from anywhere Large scale processing on batch clusters –Submission from anywhere –interactive tools: ls,top,head/tail/cat –Output to scratch space or desktop Implemented Now with CAF

4 March 2004GridPP 9th Collaboration Meeting Use Cases for Summer 2004 User Level MC Production –All CDF Users have access –No data on site -> SAM write User Level Data Access –All users have access –Selected samples on site: Full SAM Support SAM Essential for Summer 2004

4 March 2004GridPP 9th Collaboration Meeting Medium Term Vision Many Sites Fully transparent submission to all of CDF resources: 75% FNAL, 25% outside Fully transparent input and output of data

4 March 2004GridPP 9th Collaboration Meeting Summer 04 Functionality User selects submission site, saying what dataset they will use System checks they can do this (privileges) User access with SAM/dCache User registers output with SAM

4 March 2004GridPP 9th Collaboration Meeting October 04 To extend beyond 25% outside computing JIM is essential: JIM Test for CDF June04, production October 04 HOWEVER: It already seems that the 25% resources are not sufficient for the produciton passes: will want JIM earlier.

4 March 2004GridPP 9th Collaboration Meeting CAF Gui/CLI CDFGrid from a User Perspective AC++ Grid TorontoKoreaItalyTaiwanFermiCAFUK CAF Gui/CLI CDF Grid from a User Perspective Only Fermilab Uses SAM Outside LabGrid Uses SAM

4 March 2004GridPP 9th Collaboration Meeting CDF Grid Strategy 25% of CDF Computing from external resources. All CDF computing on CDF Grid by April 15: Utilize resources fully controlled by CDF: Kerberos/fbsng: dCAF + SAM October 15, 2004: JIM to capture shared resources June 2005: 50% of Computing resources external

4 March 2004GridPP 9th Collaboration Meeting Desktop Anywhere Condor centers SAM DB Condor Globus GK CAF Submitter SAM each site WN Private LAN dCache June 2004 testing June 2005 required Simple JIM

4 March 2004GridPP 9th Collaboration Meeting Detailed JIM Site Resource Selector Info Collector Info Gatherer Match Making User Interface Submission Global Job Queue Grid Client Submission User Interface Global DH Services SAM Naming Server SAM Log Server Resource Optimizer SAM DB Server RCMetaData Catalog Bookkeeping Service SAM Stager(s) SAM Station (+other servs) Data Handling Worker Nodes Grid Gateway Local Job Handler (CAF, D0MC, BS,...) JIM Advertise Local Job Handling Cluster AAA Dist.FS Info Manager XML DB server Site Conf. Glob/Loc JID map... Info Providers MDS MSS Cache Site Web Serv Grid Monitoring User Tools Flow of: jobdata meta-data

4 March 2004GridPP 9th Collaboration Meeting Meeting the Needs Progress in SAM JIM Status RunJob CDFGridWorkshop: “Nerd’s Paradise” Strict Project Management and process to respond to operational issues

4 March 2004GridPP 9th Collaboration Meeting Progress in SAM Dbserver, the database server between applications and Oracle, was upgraded to use a common schema for CDF and D0. All CDF data files are in SAM Sam in is in beta testing on the CDF CAF (1200 cpus): passed 20TB/Day delivery Minos uses SAM for its Data Handling Steve Mrenna (Phenomenology) depositing ALPGEN files in SAM for common CDF/D0 use.

4 March 2004GridPP 9th Collaboration Meeting JIM Deployment Issues Focus: 200 jobs each getting 200 files generated requests simultaneously to the DBServer! –Sensible sam: reliability went to 60%. Now add retries. Training Users D0 has D0Tools: Big script; determines where user is and copies files: harder to get into a sandbox; CAF conditions users! Distribution and compatibility: This has made great strides with SAM, now time for JIM Communication with the expert!

4 March 2004GridPP 9th Collaboration Meeting RunJob Dedicated farms at FNAL will go away and RunJob will be used for production processing of data CDF will use RunJob for MC production Dave Evans worked for CDF for 2 mo.: has made CDFRunJob based on RunJob(Shakar), a tool common to CMS. Morag will work on this.

4 March 2004GridPP 9th Collaboration Meeting Florida workshop: 11 installations in about 2 hours. Integrated with dCAF in 2 cases in 2 days. 3 in Asia, 4 in Europe 6 sites committed to summer 2004 usage of their facilities for all of CDF (mostly MC) Sam installation now: initsam cdf Follow-up on April 1. Each site has a local user support person to reduce load on core development team. Generally: Security ate 80% of the effort! Now 20!

4 March 2004GridPP 9th Collaboration Meeting

4 March 2004GridPP 9th Collaboration Meeting Florida Workshop: After 2 Days

4 March 2004GridPP 9th Collaboration Meeting 2TB/Day: Karlsruhe

4 March 2004GridPP 9th Collaboration Meeting CDF Dcache on CAF ALL CDF on CAF reads 20TB/Day

4 March 2004GridPP 9th Collaboration Meeting

4 March 2004GridPP 9th Collaboration Meeting

4 March 2004GridPP 9th Collaboration Meeting Dcache and SAM Dcache shapes traffic into disk: If a SAM cache is large, need to use Dcache instead of nfs mounts Dcache gives the user what is requested. 1TB gets same priority as 1GB: CDF users must send requesting data to be staged. SAM examines consumption rate before staging next files – No needed. SAM uses Dcache for its Caching at FNAL. This needs further work with SRM

4 March 2004GridPP 9th Collaboration Meeting SAMGrid Management Sam Management Team Sam Operations And Projects Sam Design Sam Project Leaders Sam Technical Leaders

4 March 2004GridPP 9th Collaboration Meeting SamGrid Development Process SAMGrid Operations/ProjectsIssue Raised SAMGrid Design SAMGrid Management Team Grid Deliverables Subproject Chaired by Technical Managers Chaired by Project Leaders

4 March 2004GridPP 9th Collaboration Meeting Subproject Organization Each Subproject has a subproject leader (SPL) responsible for making a plan and reporting progress. Each Subproject has one of the Technical leaders evaluating against an assessment template. No deliverable requires more than 3mo work to deliver.

4 March 2004GridPP 9th Collaboration Meeting SubProject Assessment Template 1.Background Documents 2.Project Definition/Mission Statement 3.Deliverables and timetable 4.Inter-project deliverables 5.Project status 6.Challenges and Critical Path Items 7.Lessons Learned 8.Project specific comments, alternate views

4 March 2004GridPP 9th Collaboration Meeting Housekeeping SAMGrid Assigned SubProjects JIM:D0Tools Common API Database Server Rewrite Database Servers toLinux Metadata Query with configurable Params Work FlowPackage MCRequest H Stream for CDF JIM:MCD0 Test Harness Retire CDF Replica Catalog Caching Configuration Management HousekeepingMC / Reconstruction Infrastructure User analysis Apps

4 March 2004GridPP 9th Collaboration Meeting Status of Assessments Subprojects defined Interviews conducted on about ½ Assessment reports being written

4 March 2004GridPP 9th Collaboration Meeting Conclusions CDF has embraced the need for the Grid to achieve its physics mission Progress in deployment, robustness testing has SAM in CDF JIM is rapidly solving its problems … with the help of a review and management process