Software Research Directions Related to HA/ATCA Ecosystem

Slides:



Advertisements
Similar presentations
Remus: High Availability via Asynchronous Virtual Machine Replication
Advertisements

26-Sep-11 1 New xTCA Developments at SLAC CERN xTCA for Physics Interest Group Sept 26, 2011 Ray Larsen SLAC National Accelerator Laboratory New xTCA Developments.
|ESDS SOFTWARE SOLUTION PVT. LTD.| Enterprise Datacenter Management Suite.
Asis AdvancedTCA Class Asis Shelf manager The Asis ATCA shelf manager is designed to comply with all relevant ATCA Specifications, including the IPMI 1.5.
3U ATCA Shelf October Proprietary and Confidential-
Shelf Management & IPMI SRS related activities
I4wifi a.s. Jan Sedlak Prague, Czech Republic Using Router OS solutions and RouterBoards for industrial data-logging.
Virtualization for Cloud Computing
1. 2 How do I verify that my plant network is OK? Manually: Watch link lights and traffic indicators… Electronically: Purchase a SNMP management software.
System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.
1 SAM & OLAR. 2 Upon completion of this module, you will be able to: List significant changes to system administration tasks in HP-UX 11i Determine system.
About Samway Electronic SRL Founded in 2005 in Bucharest, Romania Focused on management and monitoring solutions for telecom/industrial computers Active.
Virtual Servers Harry Dickens Arkansas Public School Resource Center Harry Dickens Arkansas Public School Resource Center.
An Introduction to IBM Systems Director
IT Infrastructure Chap 1: Definition
Basic Concepts Of CITRIX XENAPP.
Oct Controls Telecon Global Design Effort ATCA Summit plus NASA/Ames Visit Claude Saunders.
Argonne National Laboratory is managed by The University of Chicago for the U.S. Department of Energy ILC Controls: High Availability Software.
Redundancy. 2. Redundancy 2 the need for redundancy EPICS is a great software, but lacks redundancy support which is essential for some highly critical.
ATCA based LLRF system design review DESY Control servers for ATCA based LLRF system Piotr Pucyk - DESY, Warsaw University of Technology Jaroslaw.
August 3-4, 2004 San Jose, CA Developing a Complete VoIP System Asif Naseem Senior Vice President & CTO GoAhead Software.
Redundant IOC with ATCA(HPI) support Utilizing modern hardware for better availability Artem Kazakov, KEK/SOKENDAI.
21 March 2007 Controls 1 Hardware, Design and Standards Issues for ILC Controls Bob Downing.
Controls Telecon October 3, 2007 Global Design Effort 1 Summary of OpenClovis Training Claude Saunders.
1 Global Design Effort Beijing GDE Meeting, February 2007 Controls for Linac Parallel Session 2/6/07 John Carwardine ANL.
May 29, 2007 DESY Controls Mtg. Global Design Effort 1 Integration Requirements for ATCA C. Saunders.
 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.
-Proprietary and Confidential-
Latest ideas in DAQ development for LHC B. Gorini - CERN 1.
CERN DNS Load Balancing VladimírBahylIT-FIO NicholasGarfieldIT-CS.
ATCA at UIUC M. Haney, M. Kasten High Energy Physics Z. Kalbarczyk, T. Pham, T. Nguyen Coordinated Science Laboratory ILLINOIS UNIVERSITY OF ILLINOIS AT.
1 The ILC Control Work Packages. ILC Control System Work Packages GDE Oct Who We Are Collaboration loosely formed at Snowmass which included SLAC,
Asis AdvancedTCA Class Asis Shelf manager The Asis ATCA shelf manager is designed to comply with all relevant ATCA Specifications, including the IPMI 1.5.
FlowLevel Client, server & elements monitoring and controlling system Message Include End Dial Start.
1 Global Control System J. Carwardine (ANL) 6 November, 2007.
Planning Server Deployments Chapter 1. Server Deployment When planning a server deployment for a large enterprise network, the operating system edition.
ESS (vacuum) control system Daniel Piso Controls Division February 20, 2013.
MicroTCA & AdvancedTCA Shelf Management Jiping Cao May, 2009 Controls Conference (RT2009) Beijing China.
ATCA based LLRF system design review DESY Control servers for ATCA based LLRF system Piotr Pucyk - DESY, Warsaw University of Technology Jaroslaw.
High Availability Low Dollar Clustered Storage
LAN Connections.
Visualization & Collaboration
Virtualization for Cloud Computing
Chapter 48 Operating Systems, Computer Architecture and Databases
Chapter 7. Identifying Assets and Activities to Be Protected
Homework Reading Machine Projects Labs
Design of an AdvancedTCA board Management Controller Solution
“…Embedded Software to OSS/Applications…”
Current Generation Hypervisor Type 1 Type 2.
High Availability Linux (HA Linux)
A scalable, feature-rich VMS solution, delivers enterprise-level performance along with freedom of choice, enabling system customization and compatibility.
Chapter 5: Using System Software
The ILC Control Work Packages
How SCADA Systems Work?.
Embedded IPMI.
xTCA interest group meeting
Chapter 1: Introduction
Why PC Based Control ?.
ILC Global Control System
High Availability Low Dollar Clustered Storage
Measurement & Automation Explorer
Managing Clouds with VMM
Chapter 2: The Linux System Part 1
The ILC Control System J. Carwardine, C. Saunders, N. Arnold, F. Lenkszus (Argonne), K. Rehlich, S. Simrock (DESY), B. Banerjee, B. Chase, E. Gottschalk,
Open Automation Software
® IRL Solutions File Number Here.
An XML-based System Architecture for IXA/IA Intercommunication
DBOS DecisionBrain Optimization Server
What is “Control System” or “Framework”?
Principles of Information Technology
Presentation transcript:

Software Research Directions Related to HA/ATCA Ecosystem Claude Saunders GDE/TDR March 14, 2007 Controls Telecon Global Design Effort

Start from premise: Ok, we have ATCA/uTCA chassis’, needed cards, shelf manager, slot power management, hot swap, demonstrated failover modes, etc. Ie. what Ray Larsen has in mind for ATCA/uTCA development station R&D Now what… How to use all this as a complete production system where stations will number in thousands (plus many other resources)? Sep.20, 2006 Controls Telecon Global Design Effort

Managing Resources We have a large number of resources to manage. Hardware (FRU – Field Replaceable Unit) Software components Application and operating system components Must provide Monitoring Configuration Management And not just for ATCA/uTCA chassis’ and cards, but whole system including routers, switches, filesystems, relational database, etc. Crosses into traditional IT support domain Sep.20, 2006 Controls Telecon Global Design Effort

Outstanding Issues (1) How to monitor resources remotely? Out-of-band More expensive, and can add complexity Can still monitor and reconfigure primary functions in some failure scenarios In-band Run risk of not being able to get monitoring/configuration through in failure scenario (ex. Network switch misconfiguration) Protocols and Standards SNMP (Simple Network Management Protocol) CIM (Common Information Model) IPMI over RMCP Hybrid? Sep.20, 2006 Controls Telecon Global Design Effort

Outstanding Issues (2) How to monitor resources remotely (cont’d) Management console (aka NMS) Commercial, OpenNMS, custom ? Automated diagnosis Beyond threshold alarms Where is line between OOB (out-of-band) management and control system itself? Sep.20, 2006 Controls Telecon Global Design Effort

Outstanding Issues (3) How to get at remote serial consoles? ATCA cards, AMC cards, remote FPGA based devices on ethernet field-bus will all have some kind of serial console. IPMI 2.0 has SOL (Serial-over-LAN) Built into various commercial products What do we need to do for our custom hardware? And again Out-of-band or In-band Sep.20, 2006 Controls Telecon Global Design Effort

Outstanding Issues (4) How to configure resources remotely? Loading code/configuration to processors, FPGA’s, network switches, etc. Need a “resource model” to keep track of state of hundreds of thousands of FRUs and software components. For ATCA/uTCA, this includes state of hot-swap finite-state-machine of shelf-manager Sep.20, 2006 Controls Telecon Global Design Effort

Outstanding Issues (5) Software frameworks for HA Is SAF Compliance important? Virtual Machines Xen Clustering Software Heartbeat (linux-ha.org) HA middleware OpenClovis GoAhead Self-Reliant UIUC ARMOR Where is industry (telecom) going with all this? Sep.20, 2006 Controls Telecon Global Design Effort

Aside on OpenClovis Was ClovisSolutions (proprietary software) Open-sourced all their products recently Easily downloaded with well-documented examples of much of their functionality. Deployment targets provided for Intel ATCA CPUs provided with Arrow starter kit. Good candidate for exploring SAF-compliant middleware. Provides “resource model” as well as many libraries to support checkpointing, failover state management, etc. Provides SNMP based resource monitoring Sep.20, 2006 Controls Telecon Global Design Effort

Outstanding Issues (6) Given a particular control system (EPICS, DOOCS, Tango, etc), and a framework such as OpenClovis, how do we: Instrument the control system software so it can be managed by HA middleware? Checkpoint during the operation of a soft-real time control system? Or even just suspend operation gracefully Design device drivers that can readily hand off their state to a peer? Assuming we deem solution of previous issues relevant to increasing availability… Sep.20, 2006 Controls Telecon Global Design Effort