Implementing Production Grids

Slides:



Advertisements
Similar presentations
High Performance Computing Course Notes Grid Computing.
Advertisements

Certification Authority. Overview  Identifying CA Hierarchy Design Requirements  Common CA Hierarchy Designs  Documenting Legal Requirements  Analyzing.
The DOE Science Grid Computing and Data Infrastructure for Large-Scale Science William Johnston, Lawrence Berkeley National Lab Ray Bair, Pacific Northwest.
4/22/2002 Implementing Production Grids William E. Johnston The NASA IPG Engineering Team ( and.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
Computing and Data Infrastructure for Large-Scale Science Deploying Production Grids: NASA’s IPG and DOE’s Science Grid William E. Johnston
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
DISTRIBUTED COMPUTING
Grid Security Issues Shelestov Andrii Space Research Institute NASU-NSAU, Ukraine.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
7. Grid Computing Systems and Resource Management
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
Grid and Cloud Computing
Grid Computing Security Mechanisms: the state-of-the-art
Operating Systems : Overview
Peter Kacsuk – Sipos Gergely MTA SZTAKI
Security Requirements for ChinaGrid Applications - What the current grid security solutions cannot do Hai Jin Huazhong University of Science and Technology.
HellasGrid CA & euGridPMA
Globus —— Toolkits for Grid Computing
Middleware independent Information Service
Grid Computing.
Organized by governmental sector (National Institute of information )
University of Technology
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
Operating Systems : Overview
Operating Systems Bina Ramamurthy CSE421 11/27/2018 B.Ramamurthy.
The Globus Toolkit™: Information Services
Distributed Systems Bina Ramamurthy 11/30/2018 B.Ramamurthy.
Core Grid Functions: A Minimal Architecture for Grids
Operating Systems : Overview
Core Grid Functions: A Minimal Architecture for Grids
Grid Services B.Ramamurthy 12/28/2018 B.Ramamurthy.
Managing Services with VMM and App Controller
Operating Systems : Overview
From Prototype to Production Grid
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Proposed Grid Protocol Architecture Working Group
Operating Systems : Overview
The Anatomy and The Physiology of the Grid
Operating Systems : Overview
The Anatomy and The Physiology of the Grid
Distributed Systems Bina Ramamurthy 4/22/2019 B.Ramamurthy.
gLite The EGEE Middleware Distribution
Grid Computing Software Interface
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

Implementing Production Grids Bina Ramamurthy 1/16/2019 B.Ramamurthy

Introduction Based on “Implementing Production Grids” by William Johnston, of The NSASA IPG Engineering Team and The DOE Science Grid Team http://www-library.lbl.gov/docs/LBNL/511/92/PDF/LBNL-51192.pdf Production grids are intended to provide identified user communities with a rich, stable, and standard distributed computing environment. Standards are mainly from the Global Gird Forum (GGF) 1/16/2019 B.Ramamurthy

Existing Production Grids Grids provide a persistent infrastructure for scientific and business applications. Existing grids: UK e-science grid NASA Information Power Grid DOE science grid(s) Asia Pacific Grid These are infrastructure projects. 1/16/2019 B.Ramamurthy

Grid Services Number of projects aimed at providing types of higher-level grid services that will be used directly by the community. Ninf ( A network based information library for global worldwide computing infrastructure) GridLab Network Weather Services 1/16/2019 B.Ramamurthy

Coverage in the Paper Various suites related to the gird: Globus (infrastructure) Condor (infrasructure) SRB/MCAT (federating and cataloguing tertiary storage) PBSPro (job management system) A PKI authentication substrate 1/16/2019 B.Ramamurthy

Major Topics Deploying operational infrastructure and manage it. Establishing cross-site trust Dealing with scaling issues Listening to and interfacing with users 1/16/2019 B.Ramamurthy

The Grid Context Girds are an approach for building dynamically constructed problem-solving using geographically and organizationally dispersed, high-performance and data handling resources. Grids also provide important infrastructure supporting multi-institutional and multi-organizational collaboration. Functionally, girds are tools, middleware and services for a wide variety of applications. (but currently only scientific applications.) 1/16/2019 B.Ramamurthy

Important Features A set of uniform software services that manage and provide access to heterogeneous, distributed resources. Widely deployed infrastructure. Is that all? 1/16/2019 B.Ramamurthy

Grid Architecture See the copy of the enclosed page 1/16/2019 B.Ramamurthy

Basic Functions (Hour glass model) The set of basic functions a grid must have are called the Common Grid Services These include: Grid Information Service (GIS) Grid Security Infrastructure (GSI) Grid Job initiator (Globus GRAM) Grid Scheduling function (NWS, Maui) Basic data management mechanism (GridFTP) Grid event monitoring (Grid Monitoring Architecture) 1/16/2019 B.Ramamurthy

Usage Models Anticipated usage model will determine what gets deployed and when. These usage models can be further divided into compute models and data models. (compute grid and data grid) 1/16/2019 B.Ramamurthy

Compute and Data Models Compute Models Export existing services Loosely coupled processes Workflow managed processes Distributed-pipelined/coupled processes Tightly coupled processes Data Models Occasional access to multiple tertiary storage Distributed analysis of massive datasets followed by cataloging and archiving Large Reference data sets Grid metadata management 1/16/2019 B.Ramamurthy

Grid Support for Collaboration Grids support collaboration in the form of virtual organization (VO). VO is a combination of human collaborators and the grid environment they share. Security: GSISSH, GSIFTP, GridFTP: GSI provides authentication, communication and trust management. Persistent Publication service: Preserve organizational structure and share community information: GIS Group-to-group audio and videoconferencing facility based on Internet IP multicast: Access Grid. 1/16/2019 B.Ramamurthy

Building a Multi-site Computational and Data Grid Test Environment The grid building team: sys admin plays an important role: Grid software involves root-owned processes and also trust model for authorizing users that is not typical. Form a working group (WG). Grid resources: identify computing and storage resources to be incorporated into the grid. Install batch schedulers to manage load. Use co-scheduling. Co-scheduling for the grid involves scheduling multiple individual, potential architecturally and administratively heterogeneous computing resources so that multiple processes are guaranteed to execute at the same time in order to communicate and coordinate with each other. Examples: PBSPro, Maui. We will use Globus grid software for the test environment. 1/16/2019 B.Ramamurthy

Initial Test Bed Grid information service: to locate resources based on characteristics needed by the job (OS, CPU count, memory, etc.) Globus Meta Data Service (MDS) provides GRIS and GIIS respectively providing the registry and directory services. Use PKI authentication and use certificates from Globus Certificate Authority for the test environment. Validate access to, and operation of GIS/GIISs at all sites and test local and remote job submission using these certificates. 1/16/2019 B.Ramamurthy

Trust Management GSI provides uniform grid entity naming and authentication mechanism. But the real issue is establishing the “trust” in the process that each CA uses for issuing the identity certificates to users and other entities such as host and services. Two steps defined in CA policy: Physical identification of the entities, verification of their association with the organization and assigning appropriate names. X-509 certificate is issued for the entity. 1/16/2019 B.Ramamurthy

Trust and Usage Trust is confidence in or reliance on some quality or attribute of a person or thing or the truth of a statement. Grid identity token (in say X.509) is presented for remote authentication. It is verified by the using appropriate cryptographic techniques. The relying party should have some level of confidence that the entity that initiated the transaction is the entity that is expected to be. 1/16/2019 B.Ramamurthy

Establishing an Operational CA Set up or identify a CA to issue Grid X.509 certificates to users and hosts. You may use the Netscape CMS (Certificate Management System). CA policies are encoded in formal statements called Certificate Policy/Certification Practice Statement (CP/CPS). Templates are available for these. Determine your space of entities for each of which you will have to issue certificates: humans, hosts, services, security domain gateways. Each of which must have a clear policy defined in CP/CPS. 1/16/2019 B.Ramamurthy

Naming Important issues in developing CP is the naming of the principals. Tendency is to pack a lot o information into a subject’s name (ex. X.500 style). However less information helps is certificate management. For example, certificate can have flat name space with the common name of the entity and a random string. On the other hand if it is a hierarchical namespace then consider full organizational hierarchy in naming. 1/16/2019 B.Ramamurthy

The Certification Authority Model Single CA provider is a common model. A central CA that has an overall CP and subordinate policies for a collection of VOs. An independent can be assigned the job of operating in the CA infrastructure. There is a Root CA that certifies the subordinate CAs that issue users certificates. See the attached figure. 1/16/2019 B.Ramamurthy