Controls Telecon October 3, 2007 Global Design Effort 1 Summary of OpenClovis Training Claude Saunders.

Slides:



Advertisements
Similar presentations
How We Manage SaaS Infrastructure Knowledge Track
Advertisements

Clustering Technology For Scaleability Jim Gray Microsoft Research
NGAS – The Next Generation Archive System Jens Knudstrup NGAS The Next Generation Archive System.
CACORE TOOLS FEATURES. caCORE SDK Features caCORE Workbench Plugin EA/ArgoUML Plug-in development Integrated support of semantic integration in the plugin.
Introduction to Maven 2.0 An open source build tool for Enterprise Java projects Mahen Goonewardene.
System Center 2012 R2 Overview
GENI Experiment Control Using Gush Jeannie Albrecht and Amin Vahdat Williams College and UC San Diego.
HP Quality Center Overview.
Abstract HyFS: A Highly Available Distributed File System Jianqiang Luo, Mochan Shrestha, Lihao Xu Department of Computer Science, Wayne State University.
2004 Cross-Platform Automated Regression Test Framework Ramkumar Ramalingam, Rispna Jain IBM Software Labs, India.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Overview Of Microsoft New Technology ENTER. Processing....
The middleware that makes real time integration a reality.
© DSRG 2001www.cs.agh.edu.pl Cross Grid Workshop - Kraków Krzysztof Zieliński, Sławomir Zieliński University of Mining and Metallurgy {kz,
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Creating Business Workflow Using SharePoint Designer 2007 Presented by Tarek Ghazali IT Technical Specialist Microsoft SQL Server MVP Microsoft SQL Server.
Wind River VxWorks Presentation
LIGO-G E ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech Argonne, May 10, 2004.
The powerful capabilities of JBoss Middleware as cloud based services on OpenShift. Build applications. Integrate with other systems Orchestrate using.
Performance and Exception Monitoring Project Tim Smith CERN/IT.
Simply Connecting the World Corecess NMS Solution ViewlinX ViewlinX PowerPack Corecess NBI.
Framework for Automated Builds Natalia Ratnikova CHEP’03.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Introducing Reporting Services for SQL Server 2005.
Cluster Reliability Project ISIS Vanderbilt University.
DCOM (Overview) by- Jeevan Varma Anga.
1April 2002 © 2001, Cisco Systems, Inc. All rights reserved. © 2022, Cisco Systems, Inc. All rights reserved. May 2002 Cisco Networking Services Notification.
© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 6: Implement Wireless Scalability.
LIGO-G9900XX-00-M ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech.
Abierman-netconf-mar03 1 NETCONF BOF 56th IETF San Francisco, California March 17, 2003 Discussion: Admin:
August 3-4, 2004 San Jose, CA Developing a Complete VoIP System Asif Naseem Senior Vice President & CTO GoAhead Software.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
SONIC-3: Creating Large Scale Installations & Deployments Andrew S. Neumann Principal Engineer, Progress Sonic.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Grid programming with components: an advanced COMPonent platform for an effective invisible grid © 2006 GridCOMP Grids Programming with components. An.
Support required for running application software projects in the SL/CO/AP section M.Vanden Eynden October 2000 * A description of the software development.
1 Makes Mobile WiMAX Simple Netspan Overview Andy Hobbs Director, Product Management 5 th October 2007.
SONIC-3: Creating Large Scale Installations & Deployments Andrew S. Neumann Principal Engineer Progress Sonic.
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
© The Sage Group plc 2001 Product Update Andy Birch.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Middleware for Fault Tolerant Applications Lihua Xu and Sheng Liu Jun, 05, 2003.
Highly Available Internet Telephony Fact or Fiction? Manfred Reitenspiess Fujitsu Siemens Computers Munich, Germany
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
1 Murthy Esakonu June 3rd, 2009 Shenzhen China OpenSAF Developer Days 2009 Writing First OpenSAF Application Session OpenSAF.
OpenSAF Technical Overview Mario Angelic Technical Co-Chair OpenSAF Project June 4 th, 2009.
Hans Feldt Senior Software Engineer, Ericsson AB Developer Days June 2009 IMM in OpenSAF, status and future.
Brian Lauge Pedersen Senior DataCenter Technology Specialist Microsoft Danmark.
IBM Software Group © 2009 IBM Corporation IBM Tivoli Provisioning Manager Installation.
OpenSAF Architecture & Status
Introduction to OpenSAF
Current Generation Hypervisor Type 1 Type 2.
Transforming VLC into an SA-Aware Application
Integrating HA Legacy Products into OpenSAF based system
SI-SI Dependency Nagendra Kumar Senior Software Engineer,
Wizard Templates.
Software Research Directions Related to HA/ATCA Ecosystem
EIN 6133 Enterprise Engineering
Get Amazon AWS-DevOps-Engineer-Professional Exam Real Questions - Amazon AWS-DevOps-Engineer-Professional Dumps Realexamdumps.com
Management Solution for Cisco NG Advanced Security Services
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
QNX Technology Overview
(c) 2011 Microsoft. All rights reserved.
Features Overview.
Mark Quirk Head of Technology Developer & Platform Group
Presentation transcript:

Controls Telecon October 3, 2007 Global Design Effort 1 Summary of OpenClovis Training Claude Saunders

2 Participants Trainer: Ron Yorgason of OpenClovis Trainees –From FNAL Linux Cluster Control and Monitoring Project Jim Kowalkowski Amitoj Singh Nirmal Seenu –From FNAL ILC/NML and Instrumentation Kevin Krause Alexei Semenov –From ANL Claude Saunders Shifu Xu We were trained on the not-yet-released new version of OpenClovis (was called 2.3, now called 3.0)

3 Product Overview High AvailabilitySystem Management Communication infrastructure Basic infrastructure Hardware abstraction OS abstraction Memory management HA container HA management Name service Group membership service Remote procedure calls Inter-Process communication Event service Checkpointing service CLI SNMP (sub-)agents Managed object repository Alarm management Fault management Chassis management Timers Developer friendly infrastructure Sys and binary Logging Debug CLI Debug Support GUI based IDE Code Generation

4 Offline Techniques –Engineering Design Discipline –Over design for spare capacity Runtime Techniques: Infrastructure Monitor the system Recovery and repair Services to offload application need to deal with failure High Availability Techniques to achieve it, and OpenClovis solutions IDE automation results in fewer bugs and errors –MIB Import –Code Generation Code base “hardened” in multiple environments Professional design services Platform Management Alarm Management Component Management Availability Management Framework Fault Management

5 HA Modeling SAF System Model made easy –1:1 UML constructs –Wizards to automate and simplify the process –Pull down menus to customize properties –Validation tools to ensure correctness of model SAF HA Model

6 AMF: SAF HA Model Example Node UNode V Node W Node X Cluster C1 C2 C3 C4 SU: S1 SU: S2 SG:SG1 C5C6 C7 SU: S3SU: S4SU: S5 SG:SG2 CSI:A1 CSI:A2 SI:A active standby CSI:B1 SI:B CSI:C1 SI:C active standby active standby

7 OpenClovis Release 2.2 High Availability –Availability Management (2N redundancy models) –Checkpointing Service Asynchronous Collocated –Fault Repair –Cluster Membership Platform Management –Alarm Management –Provisioning Support Pre-provisioning HW proxies –Chassis Management HPI MIB support Core Infrastructure –Component Management Boot Management Component health monitoring –Basic Infrastructure Debug and Logging support Transaction with two-phase commit and rollback Platforms –Big-endian support –Mixed-endian support –Linux 2.6 based distributions –SMP –Itanium 64-bit support

8 OpenClovis Release 2.3 Features Overall –Upgradeability support –Integrated with Wind River tool chain Platforms –SUN Netra ATCA Platform –Radisys Promentum 60x0 ATCA chassis and other chassis –Additional Linux 2.6 based distributions (WindRiver PNE Linux) Basic Infrastructure –Intelligent memory management –Support for binary log streams –Runtime and Offline Log Viewers Communication –TIPC migration High Availability –Generalized Group Membership –N+M redundancy model Node Management Infrastructure –Arbitrarily nested managed objects (MOs) –Node/blade independent MOs –MO attribute access modes –Transient MO attributes –Run-time metadata retrieval System Manageability –Integrated platform management SDK / OpenClovis IDE –Improved work-flow support –Tool integration via XML files –Enhanced MIB import –Usability improvements –SNMP code auto-generation –Project model templates –Improved application code generation –SAF API Generation

9 OpenClovis Release 3.0 – in planning Beta in Q3 of 2008 Platform –Full AMC support –MicroTCA support –Additional Processors (i.e. Cavium) –Multi-Chassis support –Solaris Support (Collaboration with Sun) Middleware Manageability –Run-time model configuration upgrade –Run-time north bound middleware configuration Upgrade Services –Currently in-process in SAF IDE –Increase code generation coverage –Tighter build integration –Target deploy and debug capability

10 Summary We were duly impressed with the depth and breadth of OpenClovis, despite the fact that: Ron the trainer was –A) Not very good at teaching –B) Only familiar with parts of OpenClovis A good part of what we got from training was a result of concentrated self-study and discussion (ie. being locked in a room for 3 days). Many questions remain. Product behaved well. –One problem with multi-failover lab. Scalability an open question. Hopefully Linux Cluster project can test these limits (1000+ nodes). A good portion of what we learned was about what the SAF specifications mean in practice. –This knowledge should be transferable to other SAF implementations if needed.