CMW infrastructure Status report

Slides:



Advertisements
Similar presentations
Copyright Kenneth M. Chipps Ph.D.
Advertisements

BE-CO work for the TS Nov 8 Nov 11P.Charrue - BE/CO - LBOC1.
1 Thread Pools. 2 What’s A Thread Pool? A programming technique which we will use. A collection of threads that are created once (e.g. when server starts).
Wojciech Sliwinski for BE-CO group Special thanks to: E.Hatziangeli, K.Sigerud, P.Charrue, V.Baggiolini, M.Sobczak, M.Arruat, F.Ehm LHC Beam Commissioning.
E. Hatziangeli – LHC Beam Commissioning meeting - 17th March 2009.
A. Dworak BE-CO-IN, CERN. Agenda 228th June 2012  Sum up of the previous report  Middleware prototyping  Transport  Serialization  Design concepts.
LBTO IssueTrak User’s Manual Norm Cushing version 1.3 August 8th, 2007.
Controls Issues Injection beam2 test meeting 28 th Aug 2008 Eugenia Hatziangeli Input from J. Lewis, M. Sobzak, JJ Gras, C. Roderick, M.Pace, N. Stapley,
By Team T-Rex James Houlihan And Gavin Herbert
LHC BLM Software revue June BLM Software components Handled by BI Software section –Expert GUIs  Not discussed today –Real-Time software  Topic.
Grid Security Vulnerability Group Linda Cornwall, GDB, CERN 7 th September 2005
23/2/2000Status of GAUDI 1 P. Mato / CERN Computing meeting, LHCb Week 23 February 2000.
Lecture 4 Mechanisms & Kernel for NOSs. Mechanisms for Network Operating Systems  Network operating systems provide three basic mechanisms that support.
Post ACCOR until LS2: End of Life for CMW products CO3 meeting, 25th June 2015 Wojciech Sliwinski for the BE-CO Middleware team.
Technical Stop feed-down P.Charrue on behalf of the BE Controls Group 5th September 2011P.Charrue - 8h30 meeting1.
Issues concerning Device Access (JAPC / CMW / FESA) With input from: A.Butterworth, E.Carlier, A. Guerrero, JJ. Gras, St. Page, S. Deghaye, R. Gorbonosov,
FESA S. Deghaye for the FESA team BE/CO. What happened since April? followed by “Our plans”
DIAMON Project Project Definition and Specifications Based on input from the AB/CO Section leaders.
RDA3 Transport Joel Lauener on behalf of the CMW team 26th June, 2013
BE-CO work for the TS Outcome of the actions 23 – 28 Apr May 12P.Charrue - BE/CO - LBOC1.
Acquire Safe Solution When Gmail Password Reset Required Toll Free Number
Object Interaction: RMI and RPC 1. Overview 2 Distributed applications programming - distributed objects model - RMI, invocation semantics - RPC Products.
FGClite Feedback from BE-CO & SUWG(Smooth Upgrades)
Expense Auditing Process
Payments Report Verification and Request March 2017
Introduction to CAST Technical Support
DCS Status and Amanda News
NERC Published Lessons Learned
Vacation Tracking System
N-Tier Architecture.
Handout 1: Diary systems
Walter Binder Giovanna Di Marzo Serugendo Jarle Hulaas
Chapter 3 Internet Applications and Network Programming

UPDATED ELECTRONIC CLAIM FORM PROCESS
What Is The Quick Solution For Bigpond Hacked Account Problem?
Object-Oriented Network Communication (OOMI)
Computing infrastructure for accelerator controls and security-related aspects BE/CO Day – 22.June.2010 The first part of this talk gives an overview of.
WLCG Management Board, 16th July 2013
Middleware – ls1 progress and planning BE-CO Tc, 30th september 2013
AgilizTech Support Desk Overview
Fix yahoo ssl error code 5 call for help
Consular Crisis Management Division
Unlocking the mysteries of distributed microservice authorization
Chapter 12: Concurrency, Deadlock and Starvation
ONAP Security Sub-committee Update
ITPD ISSUE MANAGEMENT PROCESS SEPTEMBER 5, 2008
Example of a Problem Statement: Introduction into ARENA
How to Fix MacBook Frozen Problem?.
Online Guidance is Available to Repair HP Scanner
Unit4 Customer Portal Submitting & Managing Cases.
CNA Processes CVE Team.
Management of Change Report Errors to Management.
Unit 224 (B&A 36): Principles of providing administrative services
Introduction to CAST Technical Support
Implementation of the INFOD System using AOP
Example of a Problem Statement: Introduction into ARENA
Exception Handling Chapter 9 Edited by JJ.
Exception Handling Oo28.
Lessons learnt from the August trial electronic examinations
Website Planning EIT, Author Gay Robertson, 2018.
Test Administrator training — Part A
LHC BLM Software audit June 2008.
The Troubleshooting theory
Banafsheh Hajinasab Based on presentation by K. Strnisa, Cosylab
5 Why Process/Guidelines
Enhanced agent workspace for messaging
Presentation transcript:

CMW infrastructure Status report P.Charrue – LBCM 14 Sept 2010 For the CMW team

Outline Current CMW issues (3) CMW middle and long term plans How to report issues to the Controls Group 4th May 2010 P.Charrue - LBCM

Issue #1 – Blocked socket Description : JAVA clients blocked (XPOC project) and not getting data anymore from the devices Cause : Socket blocking situation in the JacORB CORBA library (part of the CMW infrastructure) – known bug in JacORB Occurence : Once to the XPOC client Often for the Logging infrastructure Immediate cure : Restart the client application as the blocking situation cannot be resolved CMW proposal : Today: We provide a callback to the client application which detect such blocking situation and take take action (mail, sms, alarm, restart, log, …) In 2 weeks: We will deliver a patch to this external Jacorb library to solve this blocking situation;currently tested. 4th May 2010 P.Charrue - LBCM

Issue #2 – Blocked Proxy Description : Cause : Occurrence : CMW Proxy is blocked due slow consuming clients Cause : ‘Slow clients’ subscribed to Proxy are not consuming the data quick enough and block many notification threads (in Proxy) resulting in a complete blocking of the Proxy Occurrence : BBQ, Hump Buster Immediate cure : Kill the ‘slow client’ application as the blocking situation cannot be resolved automatically CMW proposal : A new version of the Proxy has been developed that handles correctly slow clients (by reserving processing resources for every subscribed client) and minimizes impact of slow consumers on the well behaving clients Currently tested for the CMW-Proxy-BQ When the test are completed the upgraded Proxy will be deployed in close collaboration with Operations – end this week 4th May 2010 P.Charrue - LBCM

Issue #3 – Loss of client communication Description : Client/Server communication is lost inside the JAVA client application: busy CMW notification thread inside the JAVA client prevents any subsequent communication (idle socket in FIN_WAIT1 left in the FrontEnd) Cause : JAVA client CMW thread responsible for the socket operation is too busy by doing data processing and therefore cannot cleanly close the communication Occurrence : Collimators Immediate cure : Restart the JAVA application as the blocking situation cannot be resolved CMW proposal : Get more data from blocked JAVA application to confirm our hypothesis Organise code review with the authors of these JAVA clients to understand why the communication threads are blocked Help the developers of the Java Clients to move to JAPC (as this issue is solved using JAPC) 4th May 2010 P.Charrue - LBCM

CMW medium and long terms Medium term plans Deploy Proxies with support to slow clients Deploy patched Jacorb library to solve the JAVA client blocking situation Push the usage of JAPC to avoid the loss of communication from certain JAVA client applications Long term plans The CMW team is currently preparing a complete technical review of the Communication Infrastructure Several clients have already been interviewed The issues of the present infrastructure have been captured and prioritised along with the new functionality requested Several solutions have been evaluated External middleware experts have been contacted to help us confirm our choices The actual review will take place in October 2010 https://wikis.cern.ch/display/MW/CMW+Review 4th May 2010 P.Charrue - LBCM

How to report an issue to the Controls Group (reminder) From the e-logbook, a simple right-clic on an entry will create a JIRA issue Each JIRA issue is then assigned and is closely followed-up http://issues/browse/APS PS and SPS operators are making good use of this From your browser, go to http://issues and fill in a new JIRA issue As a last solution: Avoid direct email to individuals (they might be on vacation, not reading their mail, sick, on leave, ….) Instead opt for the support mailing lists (e.g. cmw-support@cern.ch, fesa-support@cern.ch, …) 4th May 2010 P.Charrue - LBCM