Technical Stop feed-down P.Charrue on behalf of the BE Controls Group 5th September 2011P.Charrue - 8h30 meeting1.

Slides:



Advertisements
Similar presentations
JQuery MessageBoard. Lets use jQuery and AJAX in combination with a database to update and retrieve information without refreshing the page. Here we will.
Advertisements

Mobile Application Architectures
BE/CO Changes in LS1 to the Software Development Infrastructure and Widely Used Libraries Chris Roderick, Greg Kruk, Katarina Sigerud, Luigi Gallerani,
Handshake over DIP 10/08/20091R. Alemany - BE/OP/LHC.
Supervision of Production Computers in ALICE Peter Chochula for the ALICE DCS team.
BE-CO work for the TS Nov 8 Nov 11P.Charrue - BE/CO - LBOC1.
Chapter 10 Server Administration1 Ch. 10 – Server Administration MIS 431 – created Spring 2006.
ILLiad Migration & Server Upgrade: From Your Library's' IT Point of View Juan Denzer Library System Specialist August 1, 2013.
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
Choose & Book Common Issues Identified. Problems with Choose and Book 2 Slow Performance 3 Strange Behaviours 1 Does not Run.
controls Middleware – OVERVIEW & architecture 26th June 2013
TE/MPE/MI OP section meeting 29 th September 2009 HCC 2009 Frequently Asked Questions 0v1 M. Zerlauth.
Desktop Security: Worms and Viruses Brian Arkills, C&C NDC-Sysmgt.
Wojciech Sliwinski for BE-CO group Special thanks to: E.Hatziangeli, K.Sigerud, P.Charrue, V.Baggiolini, M.Sobczak, M.Arruat, F.Ehm LHC Beam Commissioning.
E. Hatziangeli – LHC Beam Commissioning meeting - 17th March 2009.
NETWORK CENTRIC COMPUTING (With included EMBEDDED SYSTEMS)
A. Dworak BE-CO-IN, CERN. Agenda 228th June 2012  Sum up of the previous report  Middleware prototyping  Transport  Serialization  Design concepts.
LHC 30 th and 31 st July :10 Finish scrubbing fill 25 ns trains by operator dump. –No XPOC data … 08:40 Inject for another scrubbing fill 09:09.
Programming and Application Packages
W. Sliwinski – eLTC – 7March08 1 LSA & Safety – Integration of RBAC and MCS in the LHC control system.
8 Hour Heat Run Sequencer History of the test Analyze of the events Memory space used by the sequencer Questions in view of the future tests.
Guidelines for Homework 6. Getting Started Homework 6 requires that you complete Homework 5. –All of HW5 must run on the GridFarm. –HW6 may run elsewhere.
Chapter 4 Initial Configuration Tasks. Understanding the Initial Configuration Tasks window Microsoft now provides a new feature, the Initial Configuration.
PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015.
WorkPlace Pro Utilities: Profile Upgrade Utility WorkPlace Tech 4.0 SNVT Utility.
Controls Issues Injection beam2 test meeting 28 th Aug 2008 Eugenia Hatziangeli Input from J. Lewis, M. Sobzak, JJ Gras, C. Roderick, M.Pace, N. Stapley,
Operational tools Laurette Ponce BE-OP 1. 2 Powering tests and Safety 23 July 2009  After the 19 th September, a re-enforcement of access control during.
WLCG Service Report ~~~ WLCG Management Board, 1 st September
1 The System Menu. 2 The System menu Dashboard Page displayed upon every login. It encompasses several boxes organised in two columns that provide a complete.
UTC-Timing problem P.Charrue for the BE/CO/Timing team 1.
Wojciech Sliwinski for the BE-CO Middleware team: Wojciech Buczak, Joel Lauener Radoslaw Orecki, Ilia Yastrebov, Vitaliy Rapp (GSI)
LHC BLM Software revue June BLM Software components Handled by BI Software section –Expert GUIs  Not discussed today –Real-Time software  Topic.
Session 1 Introduction  What is RADE  Technology  Palette  Tools  Template  Combined Example  How to get RADE  Questions? RADE Applications EN-ICE-MTA.
Monday h00: End of fill #1640: 3.7 pb h14: RB S12 and S23 tripped before loading rampdown table, active filter. MPE piquet. Fip_Com_Lost.
FGC Upgrades in the SPS V. Kain, S. Cettour Cave, S. Page, J.C. Bau, OP/SPS April
Nominal intensity bunches ● First ramp with nominal intensity bunches suffered from an instability appearing around 1.8 TeV. ● Nominal intensity bunches.
BE-CO review Looking back at LS1 CERN /12/2015 Delphine Jacquet BE/OP/LHC Denis Cotte BE/OP/PS 1.
A PC Wakes Up A STORY BY VICTOR NORMAN. Once upon a time…  a PC (we’ll call him “H”) is connected to a network and turned on. Aside: The network looks.
FESA S. Deghaye for the FESA team BE/CO. What happened since April? followed by “Our plans”
GGUS summary (3 weeks) VOUserTeamAlarmTotal ALICE4004 ATLAS CMS LHCb Totals
SRM-2 Road Map and CASTOR Certification Shaun de Witt 3/3/08.
DIAMON Project Project Definition and Specifications Based on input from the AB/CO Section leaders.
V. Kain – eLTC – 7March08 1 V.Kain, S. Gysin, G. Kruk, M. Lamont, J. Netzel, A. Rey, W. Sliwinski, M. Sobczak, J. Wenninger LSA & Safety - RBAC, MCS Roled.
RDA3 Transport Joel Lauener on behalf of the CMW team 26th June, 2013
Saturday 11.9 ● From Friday – Minimum required crossing angle is 100  rad in 2010 – Plenty of aperture at triplets: > 13  (n1 > 10) – Can stay with 170.
CO Timing Review: The OP Requirements R. Steerenberg on behalf of AB/OP Prepared with the help of: M. Albert, R. Alemany-Fernandez, T. Eriksson, G. Metral,
PC Current Interlocking for the SPS Fast Extractions. 1 J. Wenninger July 2009.
Log Shipping, Mirroring, Replication and Clustering Which should I use? That depends on a few questions we must ask the user. We will go over these questions.
Industrial Control Engineering Session 1 Introduction  What is RADE  Technology  Palette  Tools  Template  Combined Example  How to get RADE 
E. Hatziangeli – LHC Beam Commissioning meeting - 3 rd March 2009.
BE-CO work for the TS Outcome of the actions 23 – 28 Apr May 12P.Charrue - BE/CO - LBOC1.
CACI Proprietary Information | Date 1 PD² v4.2 Increment 2 SR13 and FPDS Engine v3.5 Database Upgrade Name: Semarria Rosemond Title: Systems Analyst, Lead.
How To Make Easysite Forms By Joshua Crawley Contact:
MPE Workshop 14/12/2010 Post Mortem Project Status and Plans Arkadiusz Gorzawski (on behalf of the PMA team)
Monitoring Dynamic IOC Installations Using the alive Record Dohn Arms Beamline Controls & Data Acquisition Group Advanced Photon Source.
LSA Core overview 6 / 11 / 2007 Wojciech Śliwiński (AB-CO-AP) on behalf of LSA team.
AB-CO Exploitation 2006 & Beyond Presented at AB/CO Review 20Sept05 C.H.Sicard (based on the work of Exploitation WG)
HOW TO FIX MSVCR100. DLL IS MISSING ERROR? missing-error.
H2LC The Hitchhiker's guide to LSA Core Rule #1 Don’t panic.
LHC Beam Commissioning Meeting V. Kain & R. Alemany
Network/Controls issue 15 September h00 – 9h30
Middleware – ls1 progress and planning BE-CO Tc, 30th september 2013
LSA/InCA changes during LS1
Weird Stuff I Saw While ... Supporting a Java Team
CMW infrastructure Status report
BLM settings management in LSA
LHC Fast Timing Commissioning
LHC BLM Software audit June 2008.
February 11-13, 2019 Raleigh, NC.
Presentation transcript:

Technical Stop feed-down P.Charrue on behalf of the BE Controls Group 5th September 2011P.Charrue - 8h30 meeting1

Reminder: CO planned changes during TS#4 New logging storage space – Done last Monday and tested ok New routers configurations in the CCR – Done last Monday and tested ok A new version of japc-monitoring was delivered and deployed (V3.4.0) Release LSA for LHC/SPS with changes to allow publication of Trims New PRO release of cmw-rda v.2.9.x Java library RDA Upgrade of all Proxies (LHC & Injectors) Upgrade of RBAC A1 servers 5th September 2011P.Charrue - 8h30 meeting2

Controls issues from eLogbook Communication problems with handshake with experiments: – injection handshake not working. – we are publishing the WARNING but looks like the experiments do not receive anything, no reaction. – After calling Kris, we restarted the CMW-DIP-LHC-CRITICAL process Operation VISTAR is not working LSA issues with the collimators – Greg rolled back to the previous version and restarted the LSA servers XPOC missing filling pattern – Roman rolled back to the previous version LHC Operations page not working due to a problem with the fixed display framework... calling developers – Jakub restarted the VISTAR 5th September 2011P.Charrue - 8h30 meeting3

DIP and VISTAR K.Kostro The DIP issue was solved by restarting the CMW-DIP-LHC- CRITICAL process I have been called for the issue with the Vistar page not displaying anything. I saw that in admin Vistar monitorOn were counted but there were no updates. Since Joel released DIP gateway for the –EXP one I thought it could be related so I copied the old jars and restarted the gateway but this did not change anything. So the problem is probably in the Vistar code or environment. 5th September 2011P.Charrue - 8h30 meeting4

DataBase publication C.Roderick The quarterly database security patch applied to the LSA database on 31st August put the database Java re-publishing client into a strange state. The Java process was re-deployed with no code changes, but with an updated CMW RDA library. Things appeared to be working for a while, though not fully tested due to the ongoing LHC TS. During the LHC restart, publication problems appeared intermittently and could only be re- produced from running applications (e.g. in the CCC). The problem turned out to be related to some of the database connections in the SPS/LHC LSA Server's connection pool to be in "bad state" - able to update data in tables, but not cause a publication of the data from the database. Restarting the LSA sever and the Java database re-publishing server on Saturday around 20:10 seems to have solved the problem. Since the restart, other problems have been observed, sometimes related to values whose source is ultimately the publications from the LSA database (e.g. no property updates after re-connection observed by Roman and Alick- re-subscription solves the problem). Alick also mention no DIP updates from the Experiments. However, these problems do not seem to be caused by database publishing which appears to be working normally. 5th September 2011P.Charrue - 8h30 meeting5

XPOC and Sequencer R.Gorbonosov Problems with XPOC related to the values not received from the DB (filling pattern and next injection bucket number). The diagnosis of these problems Chris described very well. LHC Sequencer experienced some problems on Saturday morning while checking the state of power converters: random “front-end is down” exceptions: each task repetition produced similar exception for different devices while the PCs synoptic didn’t show any problems The only significant change done in LHC Sequencer was switching to the latest cmw-rda library. Other changes are related to the Sequence Editor development and are not used during the sequence execution. After rolling back to the previous version of Sequencer (using the previous version of cmw-rda ) this problem seems to disappear. 5th September 2011P.Charrue - 8h30 meeting6

LSA - G.Kruk 1. LHC Collimator task that compares HW and DB settings – To fix the problem I rolled back version of that module to previous on LHC and SPS servers. – I will know more on Monday when I will check it with InCA team. 2. “Problem with FIDEL task” – It looks like the Fidel classes keep an internal state (Hardware group name). – This variable seems to be initialized by a sequencer task during injection and then used later during ramp. – The problem comes if the LSA server is rebooted in the meantime. – This weekend I rebooted LSA server twice – on Friday to cure the LHC Collimator task and on Saturday when Chris asked me to do it to solve the publication problem. – And the FIDEL problem occurred twice after these reboots. I’ll talk to M. Strzelczyk (Fidel Responsible) about that on Monday. 3. “LSA check on PC state not working properly” – This sequencer task is implemented in Java class called “LSA” but it doesn’t use LSA Server to perform the check of PC state. – It reads it directly from FGCs via JAPC/CMW. See Roman’s mail for details. 4. “LSA is tired: it takes MINUTES to send an actual trim” – The trim was actually relatively fast, it took about 200ms. The problem was with sending the value to the FGC. From LSA all settings are send to the HW using asynchronous SET. – So basically we do a SET and we wait for the hardware to send us ACKnowledgement. – Typically ACKs come after 1-2 sec (for certain device types after sec), but we wait up to 60 seconds and then we throw a timeout exception (that you can see in the Logbook entry). – We observed a similar problem few times in the past but it was happening only when driving thousands of FGCs at a time. – When talking to Wojtek we thought that it might be related to the fact that by default CMW drops messages coming from the HW if CMW queues are full on the client side. – This would explain quite well what we observed. So to solve the problem, during this TS I’ve set appropriate JACORB property to assure that no ACK messages are lost. But it seems that this didn’t solve the problem or the problem is elsewhere. – What is different is that so far it was happening only when sending settings to hundreds/thousands of devices. – In this case the SET was done only on 8 PCs. It looks a bit related to the problem from point 3. To be followed up.. 5th September 2011P.Charrue - 8h30 meeting7

Conclusion A few issues were fixed immediately by rolling back to the previous version The DB handshake seems to be solved but today the experts will have a closer look on the logfiles DIP and VISTAR problems will also be anaylsed in detail today LSA experts will contact FIDEL and FGC experts I will follow these up and keep OP informed 5th September 2011P.Charrue - 8h30 meeting8