EPCC, University of Edinburgh DIRAC and SAFE. DIRAC requirements DIRAC serves a variety of different user communities. –These have different computational.

Slides:

Advertisements

Similar presentations

Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.

Advertisements

The National Grid Service and OGSA-DAI Mike Mineter

HPCx Power for the Grid Dr Alan D Simpson HPCx Project Director EPCC Technical Director.

REDCap Treatment Randomization Module

Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)

MS CRM Integration WhosOn Service Integration Presentation MS CRM User Group.

Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester

Computer Monitoring System for EE Faculty By Yaroslav Ross And Denis Zakrevsky Supervisor: Viktor Kulikov.

Dorian Grid Identity Management and Federation Dialogue Workshop II Edinburgh, Scotland February 9-10, 2006 Stephen Langella Department.

Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.

Milestone 5 presentation Harlan Broughton Stephen Link.

Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?

Input Validation For Free Text Fields ADD Project Members: Hagar Offer & Ran Mor Academic Advisor: Dr Gera Weiss Technical Advisors: Raffi Lipkin & Nadav.

MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.

These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (

8/9/2015 1:47 AM SurveyCentralOverview.ppt CSC ©Copyright 2012 Online Survey Application: CSC Survey Central System Overview November 26, 2012 Supported.

Database Management Systems (DBMS)

Project Implementation for COSC 5050 Distributed Database Applications Lab1.

IT Job Roles Task 20. Software Engineer Job Description Software engineers are responsible for creating and maintaining software of various different.

Ch 11 Managing System Reliability and Availability 1.

Anthony Atkins Digital Library and Archives VirginiaTech ETD Technology for Implementers Presented March 22, 2001 at the 4th International.

The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.

For more notes and topics visit:

QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.

M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,

Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.

Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.

Computer Networks. IP Addresses Before we communicate with a computer on the network we have to be able to identify it. Every computer on a network must.

Raffaele Di Fazio Connecting to the Clouds Cloud Brokers and OCCI.

Global Customer Partnership Council Forum | 2008 | November 18 1IBM - GCPC MeetingIBM - GCPC Meeting IBM Lotus® Sametime® Meeting Server Deployment and.

Bboogle: Updates and Experience Patricia Goldweic, Sr. Software Engineer, Northwestern University Brian Nielsen, Project Manager, Faculty Initiatives,

ADOPTING OPEN SOURCE INTEGRATED LIBRARY SYSTEMS Best Practices Presented by Vandana Singh, PhD Assistant Professor, School of Information Sciences University.

DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.

The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.

HOW WEB SERVER WORKS? By- PUSHPENDU MONDAL RAJAT CHAUHAN RAHUL YADAV RANJIT MEENA RAHUL TYAGI.

Chapter 7: Database Systems Succeeding with Technology: Second Edition.

Usability Issues Documentation J. Apostolakis for Geant4 16 January 2009.

Stephen Booth EPCC Stephen Booth GridSafe Overview.

Stephen Booth EPCC Stephen Booth GridSafe Overview.

University of Palestine Faculty of Applied Engineering and Urban Planning Software Engineering Department Prepared By Ahmed Obaid Wassim Salem Supervised.

Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.

Scalable Web Server on Heterogeneous Cluster CHEN Ge.

 Whether using paper forms or forms on the web, forms are used for gathering information. User enter information into designated areas, or fields. Forms.

Session 7 - Maintenance - contract and day-to-day Maintenance Support Presenter  Grenville Powell (Managing Director - of Shokaz Integrated Computing.

Distributed System Concepts and Architectures 2.3 Services Fall 2011 Student: Fan Bai

Mark Lertvat (Speaker) Mike Jurney Andrew Levine Evan Davis.

Next Steps: becoming users of the NGS Mike Mineter

MEMBERSHIP AND IDENTITY Active server pages (ASP.NET) 1 Chapter-4.

DoC Private IaaS Cloud Thomas Joseph Cloud Manager

Derek Wright Computer Sciences Department University of Wisconsin-Madison Condor and MPI Paradyn/Condor.

Adrian Jackson, Stephen Booth EPCC Resource Usage Monitoring and Accounting.

Using the ARCS Grid and Compute Cloud Jim McGovern.

The National Grid Service Mike Mineter.

Introduction to AFS IMSA Intersession 2003 An Overview of AFS Brian Sebby, IMSA ’96 Copyright 2003 by Brian Sebby, Copies of these slides.

1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.

The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.

Charaka Palansuriya EPCC, The University of Edinburgh An Alarms Service for Federated Networks Charaka.

Database Administration Advanced Database Dr. AlaaEddin Almabhouh.

GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.

Fox Scientific, Inc. ONLINE ORDERING 101. Welcome to our website On our main page you can find current promotions, the vendors we offer, technical references.

PRACE-EGI helpdesk integration

THE STEPS TO MANAGE THE GRID

Introduction to Computers

Chapter 3: Windows7 Part 4.

Chapter 14: Protection.

An Introduction to Computer Networking

An Introduction to Operating Systems

Presentation transcript:

EPCC, University of Edinburgh DIRAC and SAFE

DIRAC requirements DIRAC serves a variety of different user communities. –These have different computational requirements best served by different types of computer. –User communities are spread across many different institutions. –Resources are geographically distributed and run by multiple organisations. –Some of these resources are provided by existing services with existing procedures. Funding is limited –Mostly only HW was funded. –Need to provide rest of the service as efficiently as possible. –Need to utilise existing infrastructure/processes where possible –Avoid unnecessary complications.

Stakeholders Dirac management –Need overview of usage of resources to inform allocation policy. –Need mechanisms to implement allocation policy. Research communities –Need resource usage information to manage community science programme. –Need mechanisms to manage community membership. –Need mechanisms to manage community resources. Users –Need to be able to request accounts (frequently at remote institutions) –Need to access accounts remotely –Want to get on with science without additional complications.

Level of integration Most requirements for integration are at the management level Experience suggests a strong correlation between user communities and compute resource. –Communities will choose resources appropriate to their science. –Users will want to access the unique features of these resources. –Though projects may span resources most individual users will probably stick to a single system. Global accounts, single-sign-on etc. not essential.

GRID? Computational grid not appropriate –Grids designed to provide uniform access to interchangeable resources. DIRAC resources are complementary not interchangeable. –Provides standard interface but only to features common to all systems Data grid may be more relevant. –Depends on the data handling requirements of user communities. –Need to gather more requirements.

SAFE design principles SAFE has been built to provide a single point of contact for users of national HPC services. –Role essentially that of the ITIL service desk. –Originally deployed for HPCx service, Currently used for HECToR service. Also used for internal EPCC services. Provides a well defined interface for service providers. –Tries to express all requests as standard tickets. –Supports multiple service providers with different support policies. Has to make very few technological assumptions. –Users can come from any academic institution. Can’t assume much more than and Web. –We usually bid to run service in parallel with hardware procurement. We have little say over hardware or system software and need to adapt SAFE quickly to provide service if bid successful.

SAFE design principles II Has to be flexible rather than prescriptive. –Requirements have changed constantly over the 10 years of SAFE development. –Need to be able to quickly implement new reports or policies generated by RCs or policy panels. –Need to maintain access to old data even when current system/policy has changed. –Need to be able to integrate new services into existing instances. –Need to be able to adapt tickets to meet needs of service teams and underlying infrastructure. Controlling our own software gives us a great deal of flexibility. –We have built up an extensive toolbox to allow rapid implementation of new requirements.

What can SAFE offer DIRAC. Software already exists and is already managing BG/Q service (minimal cost). Its designed to handle distributed user communities from many different institutions. –Many DIRAC users will already be familiar with it. Its designed to handle multiple service providers with different operating policies. While the SAFE supports many features sites only need to adopt those that work with their normal way of working.

SAFE as a service Can use the BG/Q safe to provide a service for the whole of DIRAC –Host, install, maintain, modify where necessary. –Generates necessary reports and statistics for whole of DIRAC. –Provides single point to manage project membership, account creation etc. –Lightweight and non-intrusive integration with service providers. –Special handling to work within local policies. –Choice over which features are adopted. –Centralised service requires minimal changes to existing software and only needs O(N) interactions not O(N 2 )

Account creation. Accounts requested via SAFE –Sends request to project manager. –Once approved raises ticket with service provider –Default is to do this by , XML available for scripts. Hi Support, This user has been authorised to have an account on one of our machines. Please create a new user account for them using the following information. Task ID: Machine: hector Username: demo User's Name: Dr Stephen P Booth Consortium: z01 - USL Project Group(s): z01 UID: GID: 1001 Thanks, The SAF. P.S. You can see the current pending queue by looking at New User Pending :3: :00:00 hector z01 USL z demo z01 Dr Stephen Booth

Completing tickets. Once created need to notify SAFE via web-form –Manually via browser or automatically via script. –Service provider can reject tickets. –Initial (one-shot?) password returned to SAFE for retrieval by user. –Similar mechanism possible for password resets. We can gather more information if needed –IP address ranges has been requested. We can encode local policies on Usernames UID/GID ranges into SAFE. Or we can let site choose UID/GID/Username and return values to SAFE when completing ticket. –UID/GID only need to be managed centrally if supporting file-system cross mounts.

Accounting/Reports SAFE contains an extensive accounting sub-system. Accounting data is parsed into DB tables. –Do NOT mandate a fixed format instead keep data close to raw format and define mappings to standard properties. –Easier to change system/policy without re-importing old data. –Easier to handle different service provider policies –Single reports may combine data from multiple tables in different formats provided reports are based on common properties. Service providers only need to provide DIRAC usage data in some convenient format. –Normally upload data daily. –Can also support storage accounting though this does currently use a fixed format.

Resource Management Safe can provide more detailed resource management. Uses a 3 level model. 1.Project – Top level corresponds to a grant of resources from allocation panel mostly internal to SAFE 2.ProjectGroup – Internal project management grouping controlled by project PI or designated managers through web interface. These can be just compute budgets but may also correspond to unix groups if used to manage disk resources. 3.User – individual user. Though this gives a lot of fine control to PI/PM it requires more integration with service provider –Sites can choose to use local resource management procedures instead. –Accounting does NOT depend on SAFE managing the resources.