WestGrid Overview Dr. Rob Simmonds Distributed Systems Architect.

Slides:



Advertisements
Similar presentations
WestGrid Collaboration and Visualization Brian Corrie Collaboration and Visualization Coordinator WestGrid/SFU.
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Presentation Two: Grid Security Part Two: Grid Security A: Grid Security Infrastructure (GSI) B: PKI and X.509 certificates C: Proxy certificates D:
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
Grid Security. Typical Grid Scenario Users Resources.
Grid Tutorial Cameron Kiddle Grid Research Centre University of Calgary.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
1c.1 Assignment 2 Preliminaries Review (Full details in assignment write-up.)‏ © 2011 B. Wilkinson/Clayton Ferner. Fall 2011 Grid computing course. Modification.
Grids and Globus at BNL Presented by John Scott Leita.
Simo Niskala Teemu Pasanen
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
National Computational Science National Center for Supercomputing Applications National Computational Science MyProxy: An Online Credential Repository.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
High Performance Louisiana State University - LONI HPC Enablement Workshop – LaTech University,
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
NeSC Apps Workshop July 20 th, 2002 Customizable command line tools for Grids Ian Kelley + Gabrielle Allen Max Planck Institute for Gravitational Physics.
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
Reliable Data Movement using Globus GridFTP and RFT: New Developments in 2008 John Bresnahan Michael Link Raj Kettimuthu Argonne National Laboratory and.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA-IPG Collaboration Projects Overview.
Using NMI Components in MGRID: A Campus Grid Infrastructure Andy Adamson Center for Information Technology Integration University of Michigan, USA.
1 Globus Grid Middleware: Basics, Components, and Services Source: The Globus Project Argonne National Laboratory & University of Southern California
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
1 Windows 2008 Configuring Server Roles and Services.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Grid NERSC demo Shreyas Cholia Open Software and Programming NERSC User Group Meeting September 19, 2007.
NUG 2004 Grid File Yanker Demo Shreyas Cholia Mass Storage Group, NERSC 06/24/2004.
WP8 Meeting Glenn Patrick1 LHCb Grid Activities in UK Grid WP8 Meeting, 16th November 2000 Glenn Patrick (RAL)
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
© 2007 UC Regents1 Track 1: Cluster and Grid Computing NBCR Summer Institute Session 1.1: Introduction to Cluster and Grid Computing July 31, 2007 Wilfred.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
Creating and running an application.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
UK Grid Meeting Glenn Patrick1 LHCb Grid Activities in UK Grid Prototype and Globus Technical Meeting QMW, 22nd November 2000 Glenn Patrick (RAL)
1 Overall Architectural Design of the Earth System Grid.
GSI: Security On Teragrid A Introduction To Security In Cyberinfrastructure By Dru Sepulveda.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
John Kewley e-Science Centre All Hands Meeting st September, Nottingham GROWL: A Lightweight Grid Services Toolkit and Applications John Kewley.
Bulk Data Transfer Activities We regard data transfers as “first class citizens,” just like computational jobs. We have transferred ~3 TB of DPOSS data.
MGRID Architecture Andy Adamson Center for Information Technology Integration University of Michigan, USA.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
LCG2 Tutorial Viet Tran Institute of Informatics Slovakia.
EGI-InSPIRE RI Grid Training for Power Users EGI-InSPIRE N G I A E G I S Grid Training for Power Users Institute of Physics Belgrade.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 22 February 2006.
Grid Security.
Creating and running applications on the NGS
CRC exercises Not happy with the way the document for testbed architecture is progressing More a collection of contributions from the mware groups rather.
THE STEPS TO MANAGE THE GRID
Viet Tran Institute of Informatics Slovakia
Patrick Dreher Research Scientist & Associate Director
From Prototype to Production Grid
Grid Computing Software Interface
Presentation transcript:

WestGrid Overview Dr. Rob Simmonds Distributed Systems Architect

Talk Overview The WestGrid project The WestGrid HPTC resources Grid services for HPTC and how they will be used in WestGrid

WestGrid Project 8 institutions More than 250 researchers Technical and operational officers HPTC: compute resources and storage Visualization and collaboration

WestGrid People PIs –Jonathan Borwein (SFU), Gren Patey (UBC), Jonathan Schaeffer (UofA), Brian Unger (UofC), Mike Vetterli (SFU/TRIUMF) HPC planning committee –Rob Balantyne, Matthew Choptuik, Corrie Kost, Harold Esche, Paul Lu, Richard Marchand, Seamus O'Shea, Mark Thachuk, Ron Senda, Martin Siegert, Rob Simmonds, Mike Vetterli Visualization planning committee –Lyn Bartram, Kelly Booth, Pierre Boulanger, Brian Corrie, Sara Diamond, Larry Katz, John MacDonald, Trever Woods CAO –Ken Hewitt

WestGrid HPTC Resources 140TB IBM storage server (Power4/AIX) 1008 processor IBM cluster (IA-32/Linux) 256 processor SGI Origin (MIPS/Irix) 144 processor HP SC45 (Alpha/Tru64) All connected by Canada’s world class networks

Grid Computing “Grid” is a set software services –Combines meta-computing, resource discovery and security –Designed to enable access to resources in different management domains –Grid services will enable WestGrid resources to be integrated into individual researcher’s computing environments

Grid Standardization Global Grid Form (GGF) is working to provide standards Open Grid Services Architecture (OGSA) defines low level Grid services

Grid toolkits Globus (Public domain – ANL/ISI) –Currently version 2.x used for production –Version 3 provides a reference implementation for OGSA Legion (Commercial – Avaki) –Provides more support for data handing –Will support OGSA

Grid Security Infrastructure Ability for trusted users to access remote resources without re- authentication Ability for trusted jobs to access remote resources without re- authentication Protection against stolen credentials Avoid requirement for dedicated, highly available security server(s)

Certificate Authority Model CA issues certificates to trusted users and services Certificates used to authenticate with remote resources that trust issuing CA Grid Canada CA will be trusted by WestGrid resources

GSI Proxy Certificates User credentials delegated from user certificate to proxy certificate –Proxy certificate used for authentication Proxy certificates have limited lifetime –can also be limited to only authenticate with certain services Proxy certificate copied to remote resource when job is started

Globus Security Commands Users can request a certificate using ‘grid-cert-request’ –This creates userkey.pem and usercert_request.pem in ~/.globus/ Certificate request file sent to CA –usercert.pem is returned and placed in ~/.globus/ Aim to automate this process for WestGrid users

Globus Security – Cont. Proxy certificate created using ‘grid-proxy-init’ Proxy certificate examined using ‘grid-proxy-info’ Proxy certificate destroyed using ‘grid-proxy-destroy’ Proxy certificates could be created during login process

GSI initialization demo …

Enabling Access to Resources Holding certificate from trusted CA does not guarantee access to resources Users given access to resource by being included in recource’s grid-mapfile –This allows owner of resource to choose which users are allowed to use the resource The grid-mapfile maps Grid user to a local account

Globus Job Starting Run job on remote resource using ‘globus-job-run ’ – must trust the CA that signed the users certificate and user must be mentioned in grid-mapfile –Proxy certificate is copied to GASS cache on to enable program to authenticate with other remote resources

Batch Job Starting ‘globus-job-submit ’ –This returns a url used to query job ‘globus-job-status ’ –Find out if the job is waiting, running or finished ‘globus-job-get-output ’ –Get output produced by job. This is stored in the GASS cache on the host where the job is running ‘globus-job-clean ’ –Remove the GASS cache entry for the job in question

GridFTP ‘globus-url-copy ’ –Copies file from one location to another file:/ - a file on a local file-system gsiftp:// / - a file on GridFTP server Extensions to standard FTP include –Third party transfers –Parallel transfers

Credential Repository NCSA’s MyProxy server provides an on- line credential repository User stores proxy certificate in repository –This certificate can be long lived User can later recover a short lived certificate from the repository

Credential Repository Uses Used to authenticate with environment when user does not have access to their certificate –e.g., in a Web portal Could be used to authenticate and get proxy certificate during login process eliminating need for Unix passwords

MyProxy Commands myproxy-init –s –Put a proxy certificate into the MyPoxy server on –Can specify host using environment variable myproxy-info –s –View information about user’s proxy certificate myproxy-get-credential –Get a proxy certificate myproxy-destroy –Remove proxy certificate from the MyProxy server

Inserting Credential

Recovering Credential

MyProxy Certificate Renewal Allows automated proxy certificate renewal Special proxy certificate enables trusted service to renew standard proxy certificate –e.g., trust a local scheduler to renew the certificate before starting a job Should help to prevent users resorting to insecure means for automating proxy renewal

GSI Enabled SSH Tools GSI enabled versions of OpenSSH tools will be used in WestGrid –gsi-ssh Authenticates through GSI and copies proxy certificates to remote host –gsi-scp Authenticates through GSI

GSI Enabled SSH

Resource Discovery Globus uses MDS for resource discovery –GRIS – provides information about individual hosts –GIIS – provides information about groups of hosts In WestGrid each of the 4 major resources will run a GRIS At least one GIIS will be provided to hold aggregate information –Probably use one per site

MDS Publish information to LDAP servers –Information used by Grid services to locate needed resources Publish information such as –Type(s) of job scheduler available –Parameters accepted by job scheduler –Number of processors –Amount of RAM, disk or tape –Software and license availability

MDS Example

Meta-scheduling A meta-scheduler is used to submit jobs to other job schedulers WestGrid will employ meta-scheduling –Condor-G, Silver and Trellis are under consideration –Multiple meta-schedulers could be used Hierarchical meta-scheduling can be employed

Condor-G Can be used to submit jobs to specific machines Can use ‘glideins’ to add resources to local condor pool New version will include support for batch scheduler advertisements

Condor-G : Glidein Example Movie at

Result: Solar System Viz Movie at

WestGrid Accounting Use MDS to publish accounting information from each site to LDAP WestGrid wide accounting calculated and also published in secure LDAP Users will be able to gain access to information, filtered by a policy manager

Scheduling Priorities Plan to use accounting information to provide fairness in scheduling priorities across WestGrid Feed values calculated using global accounting information back into local batch schedulers

Data Storage Grid enabled access to storage –Accessible from researcher’s desktop Distributed file systems currently limited –Security and caching issues Data repository systems provide much of the functionality required –SRB from SDSC –Giggle from ISI/ANL

Repository management Large network available file stores Annotation – meta-data tagging Data representation optimization –Files, collections and containers User level replication aided by catalogs

Look at SRB

SRB – “S commands”

Wide Area Message Passing MPI-G2 enables running of message passing jobs in Grid environment Attempts to use best MPI implementation at each site Provides process mapping configuration to group tightly coupled processes

Web Portals Enable access to Grid services via web browser Start a secure session then authenticate this session with GSI using credential server Web session now acts as you in Grid environment WestGrid mock up

Getting a WestGrid Account Centralized Web based account requests We get certificate or you use exiting certificate We setup accounts, install certificates and you

WestGrid Grid Environment Initial Grid services use –Globus, MyProxy, OpenSSH, SRB Services include –Job starting, resource discover, credential management and repository management Working on having meta-scheduler(s) –Condor-G, …

Lots of work to do … Distributed file systems Improved replica management Fine-grain security Performance measurement and analysis Credential based information discovery Enhanced meta-scheduling Workflow

Credits – TeleSim helpers Mark Fox (TeleSim programmer) –Web portals, demo Andrey Mirchovski (TeleSim research student) –Security and chief Globus critic Phil Rizk (Hons project student/TeleSim programmer) –MDS, accounting and Web services

Questions and Comments …