July 25, 20071/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green OSG Information Services, VO Monitoring Services and Resource Selection Services Gabriele Garzoglio, Chris Green, Computing Division, Fermilab Rob Quick, Indiana University OSG User Meeting & OSG Site Administrators Meeting July 2007 OSG Information Services Architecture The VO Resource Service (VORS) The OSG Resource Selection Service (ReSS) ClassAd Matchmaking How these affect the Sites How these affect the User
July 25, 20072/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Context The OSG Information Services have 4 goals: –Provide static and “real-time” (where real-time is still evolving) information about Resource configurations and state. –Feed OSG-wide monitoring tools and provide interfaces to this information for Grid operations, VOs and Users. –Provide information for interoperation of OSG and EGEE for LHC Experiments and WLCG operations. –Provide information for resource selection by OSG VOs and Users.
July 25, 20073/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Please ask Questions During this talk We are looking for input, feedback and guidance.
July 25, 20074/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green OSG IS Architecture Grid / Site Interface VO / Grid Interface Site Grid VO Static Info (LDIF) Info Providers Config … Configuration Info Formatting Info Publishing Site Info Publisher (CEMon) Generic Info Providers (GIP) Info Gathering Info Collection Job / Res. Match ReSS Info Collection BDII Info Coll. WLCG BDII Job Queue Condor Schedd Job/Res. Match Condor Matchmaker Job Queue Job/Res. Match EGEE Resource Broker (RB) Info Collection Info Display VORS Info Gathering VORS Probes Instantiate… LDIF Classad LDIF
July 25, 20075/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green VORS in OSG IS Grid / Site Interface VO / Grid Interface Site Grid VO Static Info (LDIF) Info Providers Config … Configuration Info Formatting Info Publishing Site Info Publisher (CEMon) Generic Info Providers (GIP) Info Gathering Info Collection Job / Res. Match ReSS Info Collection BDII Info Coll. WLCG BDII Job Queue Condor Schedd Job/Res. Match Condor Matchmaker Job Queue Job/Res. Match EGEE Resource Broker (RB) Info Collection Info Display VORS Info Gathering VORS Probes Instantiate… LDIF Classad LDIF
July 25, 20076/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green What VORS does for you… Allows VO users to pick which sites support their VO Provides critical site info to a VO user Gives users a snapshot of current grid and site status Will provide a facility for users to look at other Grids from an OSG PO
July 25, 20077/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 20078/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 20079/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green ReSS in OSG IS Grid / Site Interface VO / Grid Interface Site Grid VO Static Info (LDIF) Info Providers Config … Configuration Info Formatting Info Publishing Site Info Publisher (CEMon) Generic Info Providers (GIP) Info Gathering Info Collection Job / Res. Match ReSS Info Collection BDII Info Coll. WLCG BDII Job Queue Condor Schedd Job/Res. Match Condor Matchmaker Job Queue Job/Res. Match EGEE Resource Broker (RB) Info Collection Info Display VORS Info Gathering VORS Probes Instantiate… LDIF Classad LDIF
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green ReSS Motivations Implement a light-weight cluster selector for push-based job handling services Enable users to express requirements on the resources in the job description Enable users to refer to abstract characteristics of the resources in the job description Provide soft-registration for clusters Use the standard characterizations of the resources via the Glue Schema
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green ReSS Technology ReSS basis its central services on the Condor Match- making service –Users of Condor-G naturally integrate their scheduler servers with ReSS –Condor information collector manages resource soft registration Resource characteristics is handled at sites by the EGEE gLite CE Monitor Service (CEMon) –CEmon registers with the central ReSS services at startup –Info is gathered by CEMon at sites running Generic Information Prividers (GIP) –GIP expresses resource information via the Glue Schema model –CEMon converts the information from GIP into old classad format. Other supported formats: XML, LDIF, new classad –CEMon publishes information using web services interfaces
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green A case study: VO Schedd to interact with ReSS Grid / Site Interface VO / Grid Interface Site Grid VO Static Info (LDIF) Info Providers Config … Configuration Info Formatting Info Publishing Site Info Publisher (CEMon) Generic Info Providers (GIP) Info Gathering Info Collection Job / Res. Match ReSS Info Collection BDII Info Coll. WLCG BDII Job Queue Condor Schedd Job/Res. Match Condor Matchmaker Job Queue Job/Res. Match EGEE Resource Broker (RB) Info Collection Info Display VORS Info Gathering VORS Probes Instantiate… LDIF Classad LDIF
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green VO Condor-Schedd interacts with ReSS Condor Match Maker Info Gatherer classads Condor Scheduler job What Gate? Gate 3 job CEMon CE Gate1 job-managers jobsinfo CLUSTER GIP CEMon CE Gate2 job-managers jobsinfo CLUSTER GIP CEMon CE Gate3 job-managers jobsinfo CLUSTER GIP ReSS Info Gatherer is the Interface Adapter between CEMon and Condor Grid / Site Interface VO / Grid Interface Grid Site VO
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green User Interacts with Schedd and ReSS universe = globus globusscheduler = $$(GlueCEInfoContactString) requirements = TARGET.GlueCEAccessControlBaseRule == "VO:DZero" executable = /bin/hostname arguments = -f queue MyType = "Machine" Name = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf-dzero " Requirements = (CurMatches < 10) ReSSVersion = "1.0.6" TargetType = "Job" GlueSiteName = "TTU-ANTAEUS" GlueSiteUniqueID = "antaeus.hpcc.ttu.edu" GlueCEName = "dzero" GlueCEUniqueID = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf-dzero" GlueCEInfoContactString = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf" GlueCEAccessControlBaseRule = "VO:dzero" GlueCEHostingCluster = "antaeus.hpcc.ttu.edu" GlueCEInfoApplicationDir = "/mnt/lustre/antaeus/apps GlueCEInfoDataDir = "/mnt/hep/osg" GlueCEInfoDefaultSE = "sigmorgh.hpcc.ttu.edu" GlueCEInfoLRMSType = "lsf" GlueCEPolicyMaxCPUTime = 6000 GlueCEStateStatus = "Production" GlueCEStateFreeCPUs = 0 GlueCEStateRunningJobs = 0 GlueCEStateTotalJobs = 0 GlueCEStateWaitingJobs = 0 GlueClusterName = "antaeus.hpcc.ttu.edu" GlueSubClusterWNTmpDir = "/tmp" GlueHostApplicationSoftwareRunTimeEnvironment = "MountPoints,VO-cms-CMSSW_1_2_3" GlueHostMainMemoryRAMSize = 512 GlueHostNetworkAdapterInboundIP = FALSE GlueHostNetworkAdapterOutboundIP = TRUE GlueHostOperatingSystemName = "CentOS" GlueHostProcessorClockSpeed = 1000 GlueSchemaVersionMajor = 1 … Resource Description Job Description Abstract Resource Characteristic Resource Requirements
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Does this sound like something you need to do ? (Users) Does this sound reasonable to you? (Site Admins)
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green ReSS Deployment on OSG Click here for live URLhere
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Status of ReSS ReSS is a lightweight Resource Selection Service for push-based job handling systems ReSS is deployed on OSG as a general service: talk to us if you are interested! DZero and Engagement VO use ReSS on OSG ReSS is used by FermiGrid for campus-wide resource selection More info at election/ election/
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green What Sites Need to Do Configure GIPs correctly so show Green on GIP monitor validate.grid.iu.edu/production/index.htmlhttp://gip- validate.grid.iu.edu/production/index.html Make sure VORS reports correct info for your site Make sure CEMon reports info from your site History.html History.html Ask for help from if you have any questions or
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green What VOs and Users need to do Understand parameters needed to select resource where your applications can run Interface the Information services to your application AND/OR use one of the OSG provided resource selectors (details in hidden slides).
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Conclusions OSG Information Services exist and are used in patches but the information provided is not yet complete nor uniform. We need the Sites to pay attention to the information content and configurations. We support Users who want to use any or all of the tools. OSG has a focus on Usability and Robustness over the next 12 months
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Additional Slides for More Detailed Information
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green User Interaction with ReSS The ReSS exposes information via condor collector interfaces –Programmatically: via a Web Service interface –Command line, via condor_status Examples: Tools Tools The Engagement VO gets OSG info from ReSS and does match making via a VO Match Making Service: ntVO ntVO –Condor scheduler interaction with ReSS See how to connect a scheduler directly to the OSG ReSS (à la DZero): See how FermiGrid uses ReSS for campus-wide resource selection: –Glue Schema Attributes definition: –FermiGrid classads:
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Glue Schema to old classad Mapping Site Cluster CE1 SubCluster1 SubCluster2 CE2 VO1 VO2 VO3 … Mapping the Glue Schema “tree” into a set of “flat” classads: all possible combination of (Cluster, Subcluster, CE, VO)
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Glue Schema to old classad Mapping Site Cluster CE1 SubCluster1 SubCluster2 CE2 VO1 VO2 VO3 Site Cluster SubCluster1 CE1 VO1 classad … Mapping the Glue Schema “tree” into a set of “flat” classads: all possible combination of (Cluster, Subcluster, CE, VO)
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Glue Schema to old classad Mapping Site Cluster CE1 SubCluster1 SubCluster2 CE2 VO1 VO2 VO3 Site Cluster SubCluster1 CE1 VO1 classad Site Cluster SubCluster2 CE1 VO1 classad … Mapping the Glue Schema “tree” into a set of “flat” classads: All possible combination of (Cluster, Subcluster, CE, VO)
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Glue Schema to old classad Mapping Site Cluster CE1 SubCluster1 SubCluster2 CE2 VO1 VO2 VO3 Site Cluster SubCluster1 CE1 VO1 classad Site Cluster SubCluster2 CE1 VO1 classad Site Cluster SubCluster1 CE1 VO2 classad … Mapping the Glue Schema “tree” into a set of “flat” classads: All possible combination of (Cluster, Subcluster, CE, VO)
July 25, /21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green Glue Schema to old classad Mapping Site Cluster CE1 SubCluster1 SubCluster2 CE2 VO1 VO2 VO3 Site Cluster SubCluster1 CE1 VO1 classad Site Cluster SubCluster2 CE1 VO1 classad Site Cluster SubCluster1 CE1 VO2 classad Site Cluster SubCluster2 CE1 VO2 classad Site Cluster SubCluster1 CE2 VO1 classad Site Cluster SubCluster2 CE2 VO1 classad … Mapping the Glue Schema “tree” into a set of “flat” classads: All possible combination of (Cluster, Subcluster, CE, VO)