PEP-II Reliability and Uptime Roger Erickson 13 October 2003 With thanks to C.W. Allen, W. Colocho, P. Schuh, M. Stanek, and the Operations staff members.

Slides:



Advertisements
Similar presentations
KPI Familiarisation.
Advertisements

SERVICE LEVEL AGREEMENTS The Technical Contract Within the Master Agreement.
ORÇUN B İ LEN Maintenance involves fixing any sort of mechanical or electrical device which has become out of order or broken. It also includes.
Data Center Site Infrastructure Tier Standard: Topology Dr. Natheer Khasawneh Sadeem Al-Saeedi (8276)
Modifications to DESY-II M. Minty Nov 11, The DESY-II 7GeV Lepton Synchotron DORIS f inj = 6.25 Hz E inj = 4.5 GeV N ppb = 5 · 10 9, typical DORIS.
Proton Source Workshop Booster Downtime Wednesday, December 8, 2010 T. Sullivan.
TTF1/TTF2/XFEL/LC stability/reliability Stability: FEL requests beyond specs. What was measured? What needs to be measured What needs to be compared to.
UPS Improvements to Beam Availability at the Australian Synchrotron Light Source Don McGilvery.
Workshop on Machine Protection, focusing on Linear Accelerator complexes Summary of Fifth Session – Operational Aspects 1)RF Breakdown recovery 2)CLIC.
FNAL Linac Downtime Overview Proton Source Workshop Fernanda G. Garcia Accelerator Division Proton Source Department December 7-8, 2010
PEP-II B Factory Machine Status and Upgrades John T. Seeman for the PEP-II Staff SLAC DOE Site Review April 9, 2003.
June 2-4, 2004 DOE HEP Program Review 1 M. Sullivan for the PEP-II Team DOE High Energy Physics Program Review June 2-4, 2004 PEP-II Status and Plans.
SLAC Accelerator Department The PEP-II Accelerator John T. Seeman Assistant Director of the Technical Division Head of the Accelerator Department SLUO.
Availability Performance of LCLS X-Ray FEL at SLAC William Colocho for the LCLS team.
Accelerator Operations and Efficiency Presentation by Roger Erickson SLAC Operations Review SLAC, June 15-16, 2004.
Radiation-protection experience at Belle / summary of beam abort system 22 September, 2003 T.Tsuboyama (KEK)
Reliability Modeling of an ADS Accelerator SNS-ORNL/Myrrha Linac (MAX project) EuCARD 2, GENEVA (20-21 March 2014 ) CERN.
F All Experimenters' Mtg - 9 Jun 03 Week in Review: 06/02/03 –06/09/03 Keith Gollwitzer – FNAL Stores and Operations Summary Standard Plots.
PEP-II Operational Safety and Management Presentation by Roger Erickson B-Factory Operations Review SLAC, April 26, 2006.
Accelerators for ADS March 2014 CERN Approach for a reliable cryogenic system T. Junquera (ACS) *Work supported by the EU, FP7 MAX contract number.
1 Availsim DRFS and klyClus setup, assumptions, questions Tom Himel August 11, 2009.
August 05, Startup 2013 Machine Status:  Proton Source Commissioning and Studies RFQ Injector Line (RIL) Linac Booster  Main Injector Startup.
All Experimenters’ Meeting November 03, 2014 Photo by Greg Vogel.
Introduction to availability modelling in ELMAS Arto Niemi.
Dmitri Denisov D0 Weekly Summary: August 18 to August 24  Delivered Luminosity and operating efficiency u Delivered 6.9pb -1 u Recorded 6.0pb -1 (87%)
June 17, 2004 / Collab Meeting Strategy to reduce uncertainty on a  to < 0.25 ppm David Hertzog University of Illinois at Urbana-Champaign n Present data.
Machine Protection at the 1MW CEBAF Electron Accelerator and Free Electron Laser Facility Kelly Mahoney Presented at the Workshop for.
MICE Project Report Alan Bross (for Paul Drumm). Project Issues ● Key dates: – ISIS Synchrotron start-up scheduled for 1st August ● Shielded area around.
All Experimenters’ Meeting January 26, Accelerator Operation Summary Calendar Week # 03 NuMI Weekly Integrated Intensity 2.74 E18 protons Beam.
Long shutdown 1 LHC Machine Status Report K. Foraz June 12 th, 2013.
John Carwardine, Ewan Paterson, Marc Ross 14/15 July 2009 Availability Task Force: Subgroup #2.
All Experimenters’ Meeting October 20, 2014 Photo by Greg Vogel.
All Experimenters’ Meeting December 8, 2014 Photo by Greg Vogel.
ATF hardware status - Orbit Stabilization T.Naito(KEK) ATF2 Project meeting Laboratoire de l'Accélérateur Linéaire (LAL) 1.Orbit oscillation.
D&D Review 6 August PEP-II Minimal Maintenance State Upkeep Stan Ecklund (J. Seeman, S. DeBarger, D. Kharakh, M. Zurawel)
1 Run7 startup M. Sullivan MAC Review Nov , 2007 M. Sullivan for the PEP-II Team Machine Advisory Committee Review November 15-17, 2007 Run 7 Startup.
All Experimenters MeetingDmitri Denisov Week of December 23 to December 30 D0 Summary  Delivered luminosity and data taking efficiency u Delivered: 5.8pb.
ATC / ABOC 23 January 2008SESSION 6 / MTTR and Spare Parts AB / RF GROUP MTTR, SPARE PARTS AND STAND-BY POLICY FOR RF EQUIPMENTS C. Rossi on behalf of.
August 12, Machine Status: 2013  Proton Source Commissioning and Studies RFQ Injector Line (RIL) Linac : Roof hatch installed Booster : Magnet.
Linac2 and Linac3 D. Küchler for the linac team. Planning first preparative meeting for the start-up of Linac2 in June 2013 –this early kick-off useful.
9 October 2003S. DeBarger PEP-II Vacuum Status PEP-II Machine Advisory Committee.
Equipment Life Optimization Program (ELOP) Doug Hilleman February 10, 2011.
WIR SCHAFFEN WISSEN – HEUTE FÜR MORGEN Transition to user operation Didier Voulot :: Paul Scherrer Institut SwissFEL Commissioning Workshop, 22 March2016.
Tailoring the ESS Reliability and Availability needs to satisfy the users Enric Bargalló WAO October 27, 2014.
SPS availability K. Cornelis Acknowledgments : A. Rey and J. Fleuret.
Failure analysis experience at TLS Yi Chih Liu 15 April ARW 2013.
Proton PMG May 7 th 2015 Cons Gattuso AD Shutdown Manager.
November 12-13, 2007 Super-B Factory: Accelerator Costing, PEP-II Hardware and Schedule J. Seeman SLAC International Review Committee meeting November.
Montse Pont ARW 2013 Operation and Reliability at ALBA Montse Pont.
Preparation Phase of the FAIR Project
Maintenance strategies
Machine operation and daily maintenance management in SOLEIL
MPE-PE Section Meeting
Failure analysis at BEPCII
RELIABILITY OF 600 A ENERGY EXTRACTION SYSTEMS
John T. Seeman DOE PEP-II Operations Review April 26, 2006
SuperB Injection, RF stations, Vibration and Operations
IENG 451 / 452 Stability: Total Productive Maintenance
Compliance Report to ERCOT TAC February 2007
the CERN Electrical network protection system
An-Najah National University
2012 Line Information System Coca-Cola Refreshment
Vibration Measurement, Analysis, Control and Condition Based Maintenance 14 Predictive maintenance Dr. Michiel Heyns Pr.Eng. T: C: +27.
A Portion of the SCP RF Control System LCLS Related
Injector –Linac Status & Schedule E. Bong LCLS FAC October 12, 2006
Software Testing and Maintenance Maintenance and Evolution Overview
KPI Familiarisation
PEP-II RF System Status and Upgrades
J. Seeman Assistant Director for PPA/LCLS
(Beam) Commissioning Plan
Presentation transcript:

PEP-II Reliability and Uptime Roger Erickson 13 October 2003 With thanks to C.W. Allen, W. Colocho, P. Schuh, M. Stanek, and the Operations staff members who collected the data.

Excludes “long” downtimes and holiday shut-downs.

Statistics: Causes of Unscheduled Down Time 3 PEP-II running periods considered: January 2000 through June ,936 total scheduled operating hours hours unscheduled down time reported malfunctions (“events”) events directly tied to lost hours. We can sort the data by area of the machine (HER, linac, etc.), by system categories (RF, vacuum, etc.), by date, and by details of resolution.

Accelerator Performance Statistics Definitions: Revealed failures: malfunctions resulting in lost beam time. Also called “events”. Unscheduled down time: hours lost from scheduled program due to malfunctions. Mean Time to Fail: MTTF = Scheduled beam time Events Mean Time to Repair: MTTR = Unscheduled down time Events Availability = 1 - Unscheduled down time Scheduled beam time NOTE: PEP-II aborts are not counted as downtime, unless the event is reported; i.e., unless we stop to fix something and make a database entry.

PEP-II Run Totals Run 1: 1/12/00 – 10/31/00Run 2: 2/4/01 – 6/30/02 Run 3: 11/15/02 – 6/30/03 Long annual downtimes and holiday shut-downs are not included.

Hardware Availability by Run MTTFMTTRAvailability hours percent Run Run Run MTTF has been getting shorter (worse) each run. MTTR improved from Run 1 to Run 2, but got worse during Run 3.

Unscheduled Downtime by Major System SystemRun 1Run 2Run 3 Injection PEP Rings BaBar PG&E Availability Total100.0 Unscheduled down time (percentage), sorted by responsible system.

MTTR : PEP-II Rings Run 1Run 2Run 3Run 1Run 2Run 3 MTTR EvntsDT hrsEvntsDT hrsEvntsDT hrs Power Supplies Magnets RF Vacuum Utilities Controls Safety Other Totals

Time Required for Repairs Beam time lost Events Percent of total events Hours down % of total DT > 0 to 1.0 hours % % > 1.0 to 2.0 hours % % > 2.0 to 4.0 hours % % > 4.0 to 8.0 hours856.5% % > 8.0 to 24.0 hours564.3% % > 24.0 hours80.6% % % % Combined data set from all three runs.

PEP Rings Events Requiring > 2 hours to Repair Run 3 Data: 33 % of PEP ring events require > 2 hours to repair. These account for 81 % of PEP ring down time.

Problems Requiring > 24 hours to Fix January 2000 – June 2003: 5 vacuum chamber failures in PEP rings. Some known vulnerabilities were already receiving attention. Vacuum task force is studying options for upgrading some chambers. 2 site-wide electrical power outages. These were outside SLAC’s control. SLTR quadrupoles overheated when cooling water pump stopped, but power remained on.

Recent Problems Requiring > 24 hours to Fix August 20, 2003: VVS transformer failure in linac. Failure occurred during E158; no impact on PEP. Two days for full recovery. Failure was in the only dry-type transformer among 16 VVS’s. Oil-filled, fixed-ratio replacement options being investigated. September 12, 2003: Site-wide power failure when tree grew too close to 230 kV line. Time lost to PEP program >47 hours. Tree trimming had not been done on established schedule. SLAC now has new contract with tree-trimmer company, with option to renew for five years.

Underlying Problems Sometimes Cross Technical and Jurisdictional Boundaries Seasonal high ambient temperatures cause drift, jitter, timing-shifts, spurious trips, and sometimes component failures in power supplies and sensitive electronics. Plan to air-condition the electronics alcove at Linac Sector 0, which houses the master oscillator and electronics critical to accelerator timing. A contract has been awarded. Several PEP support buildings have temperature control problems on hot days. More needs to be done to identify cost-effective improvements. An example of a problem not easily identified by counting malfunction reports.

Injection and Tuning Normal top-off: Typically 4 to 5 minutes to fill at intervals of 40 to 50 min. Approx. 10% of scheduled run time. Why is 21% spent injecting and tuning? Beam aborts require fill from scratch; typically 15 to 25 minutes each time.

Beware of Double counting: An abort in one ring usually leads to an abort in the other.

HER RF Aborts StationRun 2Run 3 –12-1: 0.33  1.1 aborts/day –12-3: 0.50  0.34 – 8-1: 0.22  0.57 – 8-3: 0.50  0.68 – 8-5: 0.51  0.66 –12-6:  1.65* Total = 2.1  5.0aborts/day –All stations were worse in 2003, except * 12-6 fault accounting only available since 10-May-2003.

LER RF Aborts StationRun 3 –4-3:0.88 aborts/day* –4-4:0.55 (was 0.56 in 2002) –4-5:0.55 (was 0.53 in 2002) Total = 2 aborts per day * 4-3 fault accounting only available since 10-May-2003.

BaBar Radiation Aborts 3-year trend, based on data latched by accelerator control system: –2000:5.6 aborts/day –2001:4.1 –2002:3.6 –2002/3:2.8

Injection and Tuning Summary Percentages of scheduled operating hours: Normal top-offs: 10% Fill from scratch following: RF aborts:6.3% BaBar radiation aborts:3.5% Approximate total:20% Trickle charging could have significant beneficial impact!

Scheduled Off Time No routine scheduled maintenance days. Repair Opportunity Days (“RODs”) are launched when needed for show-stoppers or upgrade projects (typically 1/month). As many ROD and SML jobs as possible are completed during program interruption (typically 50 to 100 identified jobs).

Personnel Protection System (PPS) Testing Formerly required approx 3 months of beam-off, most of which was folded into long downtimes, but “verifications” were required at 6-month intervals. Net impact on PEP program depended on interval between long downtimes. Typically about 2 weeks/year. New policies and procedures have reduced testing to about 3 weeks once each year to coincide with long downtimes, plus operator interlock checks.

Opportunities for Further PPS Testing Improvements Add switches and indicators to further decouple zones/subsections/systems for testing purposes. Further streamline test procedures (much progress made last year). Train/authorize more staff members, so that testing can be done 24 hours/day when opportunities arise. Additional uptime to be gained? Possibly 1 week/year, depending on long downtime schedule and “opportunistic” down days. Long-range proposal: Replace linac and BSY PPS with modern system to facilitate testing and minimize downtime for diagnosing problems.

How to Increase PEP-II Up Time: Challenges to Ourselves Allocate resources among hardware projects to achieve optimal improvement in MTTF. Identify common-mode or infrastructure projects that will improve overall uptime and stability. Find ways to reduce frequency of aborts. Minimize scheduled off time through policy and procedure changes and aggressive scheduling. Reduce MTTR with improved procedures, diagnostic tools, and organizational efficiency.