Download presentation
Presentation is loading. Please wait.
Published byRussell Douglas Modified over 6 years ago
1
T. Kadowaki (Accelerator Engineering Corporation)
Reliability of HIMAC Control System 16 APR. 2013 Australia T. Kadowaki (Accelerator Engineering Corporation) 1
2
Contents What Is Software Trouble ? HIMAC Availability
Software Trouble in HIMAC Statistics Analysis Suggestion Software and Hardware Example of Our Actions Summary 2
3
What Is Software Trouble ?
Accelerator Trouble Power Supply RF Control System Cooling .... Network Computer Software Computer Hardware .... Bugs Abnormal Termination Hung-up Sequence Error 3
4
HIMAC Availability Upper ring Treatment rooms Lower ring 4
5
HIMAC Availability MORE THAN (almost) 98% Availability 5
6
Software Trouble in HIMAC
FY 2003 – 2011 (9 years) Number of Failure Downtime (hh:mm) Total : 160 events Total : 46:33 1:36 3 The Others Bugs Bugs 16 Hang-up Hang-up 7:45 Sequence Error Sequence Error 7:10 69 25:39 64 5:33 11 Abnormal Termination 6
7
Software Trouble in HIMAC
Sequence Error Retry the sequence 6 min. avg. Hang-up Notice abnormal condition Check the control system status Find program hang-up Program restart OK NG Computer restart 20 min. avg. 7
8
Software Trouble in HIMAC
How to notice the hang-up? Health check tool Process monitoring tool ..... System performance New bugs Which system is easy to restart? PLC > Win PC > UNIX WS ? System performance Hardware reliability 8
9
Control Software and Hardware
Network PLC breakdown HDD breakdown Hang-up 9
10
Control Software and Hardware
Software total : 46:33 Hardware total : 78:47 Control System Downtime ~ 20% Software Downtime < 10% 10
11
Example of Our Actions Network Packet Monitoring Network trouble or
NOT? 11
12
Operation Log Database
Example of Our Actions Operation Log Database Control Server Operation request Device status, response Operation Panel Local Devices DB Server 12
13
Control system trouble
Example of Our Actions Operation Log Database Control system trouble or NOT? 13
14
Summary Analyzed 2003 – 2011 operation data.
Software trouble is less than 10% of total downtime. Control software trouble mainly caused by “Sequence error” and “Hang-up”. “Hang-up” needs longer time to restore than “Sequence error”. We are searching for a system which is easy to notice “Hang-up” to restore. 14
15
Control System and the Other Systems
Ex1
16
Control System and the Other Systems
Ex2
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.