Leigh Grundhoefer Indiana University

Slides:



Advertisements
Similar presentations
CMS Applications Towards Requirements for Data Processing and Analysis on the Open Science Grid Greg Graham FNAL CD/CMS for OSG Deployment 16-Dec-2004.
Advertisements

1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
APPLICATION DEVELOPMENT BY SYED ADNAN ALI.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Configuration Management Process and Environment MACS Review 1 February 5th, 2010 Roland Moser PR a-RMO, February 5 th, 2010 R. Moser 1 R. Gutleber.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
Open Science Grid Software Stack, Virtual Data Toolkit and Interoperability Activities D. Olson, LBNL for the OSG International.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Rsv-control Marco Mambelli – Site Coordination meeting October 1, 2009.
OSG Operations and Interoperations Rob Quick Open Science Grid Operations Center - Indiana University EGEE Operations Meeting Stockholm, Sweden - 14 June.
OSG Services at Tier2 Centers Rob Gardner University of Chicago WLCG Tier2 Workshop CERN June 12-14, 2006.
Integration and Sites Rob Gardner Area Coordinators Meeting 12/4/08.
OSG Middleware Roadmap Rob Gardner University of Chicago OSG / EGEE Operations Workshop CERN June 19-20, 2006.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks State of Interoperability Laurence Field.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
Grid Operations Lessons Learned Rob Quick Open Science Grid Operations Center - Indiana University.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
EGEE-III-INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III All Activity Meeting Brussels,
OSG Integration Activity Report Rob Gardner Leigh Grundhoefer OSG Technical Meeting UCSD Dec 16, 2004.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Antonio Retico CERN, Geneva 19 Jan 2009 PPS in EGEEIII: Some Points.
Site Validation Session Report Co-Chairs: Piotr Nyczyk, CERN IT/GD Leigh Grundhoefer, IU / OSG Notes from Judy Novak WLCG-OSG-EGEE Workshop CERN, June.
Status Organization Overview of Program of Work Education, Training It’s the People who make it happen & make it Work.
The OSG and Grid Operations Center Rob Quick Open Science Grid Operations Center - Indiana University ATLAS Tier 2-Tier 3 Meeting Bloomington, Indiana.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
G.Govi CERN/IT-DB 1 September 26, 2003 POOL Integration, Testing and Release Procedure Integration  Packages structure  External dependencies  Configuration.
Operations Activity Doug Olson, LBNL Co-chair OSG Operations OSG Council Meeting 3 May 2005, Madison, WI.
OSG Deployment Preparations Status Dane Skow OSG Council Meeting May 3, 2005 Madison, WI.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Introduction to ITIL and ITIS. CONFIDENTIAL Agenda ITIL Introduction  What is ITIL?  ITIL History  ITIL Phases  ITIL Certification Introduction to.
Kati Lassila-Perini EGEE User Support Workshop Outline: – CMS collaboration – User Support clients – User Support task definition – passive support:
Area Coordinator Report for Operations Rob Quick 4/10/2008.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
Open Science Grid OSG Resource and Service Validation and WLCG SAM Interoperability Rob Quick With Content from Arvind Gopu, James Casey, Ian Neilson,
INFSO-RI Enabling Grids for E-sciencE Operations Parallel Session Summary Markus Schulz CERN IT/GD Joint OSG and EGEE Operations.
Components Selection Validation Integration Deployment What it could mean inside EGI
Opensciencegrid.org Operations Interfaces and Interactions Rob Quick, Indiana University July 21, 2005.
Integration TestBed (iTB) and Operations Provisioning Leigh Grundhoefer.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Monitoring and Information Services Core Infrastructure (MIS-CI) Service Description Mark L. Green OSG Integration Workshop at UC Feb 15-17, 2005.
Grid Deployment Technical Working Groups: Middleware selection AAA,security Resource scheduling Operations User Support GDB Grid Deployment Resource planning,
OSG Facility Miron Livny OSG Facility Coordinator and PI University of Wisconsin-Madison Open Science Grid Scientific Advisory Group Meeting June 12th.
OSG User Group August 14, Progress since last meeting OSG Users meeting at BNL (Jun 16-17) –Core Discussions on: Workload Management; Security.
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
Open Science Grid Interoperability
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
Regional Operations Centres Core infrastructure Centres
Operations Interfaces and Interactions
Open Science Grid Progress and Status
JRA3 Introduction Åke Edlund EGEE Security Head
Monitoring and Information Services Technical Group Report
BA Continuum India Pvt Ltd
Ian Bird GDB Meeting CERN 9 September 2003
Incident Response Plan for the Open Science Grid
LCG/EGEE Incident Response Planning
Maite Barroso, SA1 activity leader CERN 27th January 2009
LCG Operations Centres
LCG Operations Workshop, e-IRG Workshop
Description of Revision
Supporting Grid Environments
gLite The EGEE Middleware Distribution
{Project Name} Organizational Chart, Roles and Responsibilities
Presentation transcript:

Leigh Grundhoefer Indiana University OSG Grid Operations Leigh Grundhoefer Indiana University

leighg at indiana dot edu Agenda Grid Operations Development OSG Functional Service Cycle Deployment Integration Provisioning Production OSG Operations Activities 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu “Big Picture” Goals The OSG Operations activity and Support Centers Group has been tasked with the role of preparing, provisioning and running the infrastructure used for the OSG production environment. The operations activities duties to the OSG are to ensure that the production environment is usable for the current application base and to continue to evolve as a common service environment which is able to support multiple sciences with a application-friendly grid infrastructure. 11/21/2018 leighg at indiana dot edu

Grid Operations Security, Policy and Authentication Other Grids Integration Test Bed Provisioning Production Releases And Updates Documentation of software Registration, Verification and Monitoring Knowledge base of Commonly Answered questions Trouble reporting and ticketing Support Center coordinator 11/21/2018 leighg at indiana dot edu

Operations Environment Organization and Definition of core elements Grids Support Centers Virtual Organizations Resources Registration by Owners and Providers Common software services Verification and ongoing evaluation Resources and services Support Center ticketing, ticketing response VO services 11/21/2018 leighg at indiana dot edu

Operations Environment (cont.) Information from monitoring Job slots and file transfers Published policies Accounting Coordination of OSG Help Desk Coordinator of support centers Definition and execution of Standard Operational Procedures (SOPs). Definition and execution of Standard Operational Procedures (SOPs) which add to the availability, stability and enhancement of production infrastructure. 11/21/2018 leighg at indiana dot edu

Grid to Grid relationships Understand and create specialized help desk and trouble reporting schemas Understand and create monitoring and accounting interfaces Create and deploy identity and authorization interfaces 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu Agenda Grid Operations Development OSG Functional Service Cycle Deployment Integration Provisioning Production OSG Operations Activities 11/21/2018 leighg at indiana dot edu

Where do you get this stuff? (Architecture and Requirements) Release Candidate Blueprint (ARCH) ITB 0.3 Integration Test Bed Operations OSG 0.4 Provisioning VO’s Deployment Activity Release Description Service Development (Sponsored Activities) Technical Groups 11/21/2018 leighg at indiana dot edu

OSG Integration Activity Readiness plan Effort Resources Readiness plan adopted VO Application Software Installation Software & packaging OSG Deployment Activity Service deployment OSG Operations-Provisioning Activity Application validation Release Candidate Middleware Interoperability Functionality & Scalability Tests feedback Metrics & Certification Release Description 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu Provisioning Finalize all installation software and procedures OSG based software packages Pre-compiled binaries Source code (compiled during installation) Post configuration scripts ( configure_osg.sh ) Create production “Grid Support” documentation for all software and defined procedures. 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu Provisioning Translate OSG operations model into production operations activities Provide timelines for Resources and Support Centers Setup versioning using release procedures Install or upgrade grid wide services 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu Production Reports Usage reports from OSG monitoring Operations reports to the OSG community Daily Sites Status reports to Support Centers Meetings Weekly Operations Activity Release-based provisioning Activity Weekly Support Centers Technical Group Weekly Documentation Activity Monitors for verification ACDC Operations Dashboard Issue handling 11/21/2018 leighg at indiana dot edu

Verification of Resouces ACDC Operation Dashboard provides detailed cyclic testing of all resources Tests based upon “site-verify” tool distributed with OSG common software, around 30 tests Tests results output available per test per site Five possible results No Information Pass Fail Error Not Tested Resources are grouped in to three areas: Production, Pending or Offline 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu Agenda Grid Operations Development OSG Functional Service Cycle Deployment Integration Provisioning Production OSG Operations Activities 11/21/2018 leighg at indiana dot edu

GOC: A Communications Hub Grid Operations Center Leveraged Coverage from GRNOC Abilene, NLR, TransPAC OSG Trouble Ticket System Trouble Ticket Exchange with Support Centers / Grids Weekly Issue tracking report Web Page Development and Documentation OSG Production web site www.opensciencegrid.org Commonly answered questions Knowledge Base Collaborative development information - osg.ivdgl.org OSG Registration Database 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu GOC: A Service Center Grid wide Information Services Registration Database Catalog Production Software Caches OSG Knowledge Base OpenScienceGrid web site Grid wide Monitoring Services GLUE information providers ACDC Dashboards GridCat MonaLisa archive Multiple OSG ITB resources Small but demonstrative OSG production resource 11/21/2018 leighg at indiana dot edu

GOC: Incident Response Response has been defined by the Security Technical Group All incidents should be reported mail aliases which are monitored by the GOC. The GOC maintains a list of local site security contacts, derived from the registrations. The GOC designed and implemented a specialized mail service for secure correspondence A response team leader forms a group to access, contain, and report on each incident. 11/21/2018 leighg at indiana dot edu

GOC: Policy Procedures Grid Service Change or Upgrade Registrations Support Centers Virtual Organizations Resources Critical OSG Release Update 11/21/2018 leighg at indiana dot edu

Grid Service Change/Update 11/21/2018 leighg at indiana dot edu

Support Center Registration 11/21/2018 leighg at indiana dot edu

Virtual Organization Registration 11/21/2018 leighg at indiana dot edu

Resource/Service Registration 11/21/2018 leighg at indiana dot edu

Critical Release Update 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu OSG Community Support How do we support issues that fall outside the Support Model? Open Support Model Mailing Lists (OSG-General) Knowledge Base and Release Documentation Jabber chat room Weekly meetings 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu GOC: Conclusions Enables Users and Usage Creates a known and usable service environment Allows status, monitoring and accounting Helps users and applications “bridge the gap” between single use and grid based resource utilization 11/21/2018 leighg at indiana dot edu

Grid Operations Effort Five FTEs at Indiana University Two FTEs at FermiLab One FTE at LBNL One FTE at University of Chicago Help and direction encourage from everyone. 11/21/2018 leighg at indiana dot edu

leighg at indiana dot edu Thank you! 11/21/2018 leighg at indiana dot edu

Operations activities for 0.4 A distributed support organization with a central operations organization, with Indiana GOC as the central point of contact. Create and organize Support Centers so that they coordinate with each other. Metrics publishing. Create and develop policy and process interfaces to the EGEE and LCG operations organization. Deploy and maintain production instances of the Grid Catalogs. Validation and Testing Services Create and validate verification and performance testing mechanisms for the common services. 11/21/2018 leighg at indiana dot edu