February 10, 2010 RMS ERCOT 1/24/10 Production Issue Overview and Lessons Learned Karen Farley Manager, Retail Customer Choice.

Slides:



Advertisements
Similar presentations
MarkeTrak Orientation July 30,  Antitrust Admonition  Introductions  MarkeTrak Flight Test Orientation  Why Do We Test?  Overview  API vs.
Advertisements

Information Technology Report Dave Pagliai Manager, IT Support Services January 2015 ERCOT Public.
MarkeTrak Update Retail Market Subcommittee December 6, 2006 Adam Martinez & Karen Farley.
Retail Sub-Committee Update Robert Connell June 14, 2002.
RMS Update to TAC August 7, RMS Update to TAC ► At July 9 RMS Meeting:   RMS Voting Items:
Role of Account Management at ERCOT PRR 672 Collaborative Analysis Presentation to RMS November 8, 2006 DRAFT ONLY.
Retail Market Subcommittee Update to TAC Kathy Scott April 24,
1 LSE Certification Process ERCOT August 23, 2006.
Objectives: Upgrade Siebel to a supported application Upgrade Oracle database to current version Deliver all existing user functionality with no degradation.
ERCOT Retail Market IT Update Aaron Smallwood Director, IT Operations Retail Market Subcommittee April 7 th, 2015.
1 TDTWG Update to RMS June 2, MarkeTrak API Performance Metrics Review ERCOT continues work with CenterPoint and Oncor to refine/revise the MT API.
ERCOT MARKET EDUCATION
Retail Market Subcommittee (RMS) Update Kathy Scott January 3, 2013Technical Advisory Committee 1.
RMS Update to TAC January 3, Goals Update ► Complete and improve SCR745, Retail Market Outage Evaluation & Resolution, implementation and reporting.
RMS Update to TAC May 8, RMS Update to TAC ► At April 9 RMS Meeting:  Antitrust Training  RMS Voting Items: ► NPRR097Changes to Section 8 to Incorporate.
Data Extracts & Reporting Recent Issues ERCOT Information Technology Data Extracts Working Group 11/27/07.
1 TDTWG Update to RMS Tuesday January 6, Primary Activities 1.Reviewed ERCOT System Outages and Failures 2.Presented Service Availability and.
TDTWG NAESB EDM v1.6 Project Update to RMS Thursday October 16, 2003.
Retail Market Subcommittee Update to COPS Kathy Scott July 16,
Market Impact Assessment TF Final Report to RMS June 11, 2008.
Retail Market Subcommittee June 9, 2010 Performance Measures 1st Quarter 2010 Transaction Comparison.
Retail Market Subcommittee Update to COPS Kathy Scott May 13,
RMS Update to TAC January 8, Voting Items From RMS meeting on 12/10/2008  RMGRR069: Texas SET Retail Market Guide Clean-up – Section 7: Historical.
1 TDTWG Update to RMS Wednesday February 14 th, 2007.
Objectives: Upgrade Siebel to a supported application Upgrade to Oracle 9i database Deliver all existing user functionality with no degradation in performance.
Rob Connell May 1, 2002 Retail Sub-Committee Update.
Texas Test Plan Team Market Testing Update to RMS October 16, 2002.
Market Coordination Team Update Retail Market Subcommittee November 8, 2006 Susan Munson Retail Market Liaison.
1 C urrent Market Release TX SET V2.0 / Solution to Stacking.
Retail Business Processes PR 50121_07 Project Update Retail Market Subcommittee September 13, 2006 Adam Martinez Mgr, Market Operations DPO.
TX SET v2.1 Implementation Plan. Table of Contents A.Shut Down Procedure B.Shut Down Timeline Details C.Conference Calls D.Additional Contingencies.
Texas SET Version 3.0 Production Implementation Plan.
Information Technology Report Trey Felton Manager, IT Service Delivery September 2011 ERCOT Public.
Report to RMS January 14, TTPT Key Dates and Deadlines as of 1/14/03 1/05/04 - Mandatory Connectivity Kick Off Call & Penny Tests begin 1/12/03.
PR50121_07 Retail Business Processes (RBP) Project Update Retail Market Subcommittee November 8, 2006 Adam D. Martinez Mgr, Market Operations DPO.
1 ESI ID SERVICE HISTORY AND USAGE DATA EXTRACT SYSTEM CHANGE REQUEST (SCR 727) February 24, 2003.
Retail Transaction Processing Year End Review and Recent Issues RMS January 2007.
1 TDTWG Update to RMS Tuesday February 3, Primary Activities Elections 2.ERCOT System Outages and Failures 3.MarkeTrak Performance 4.Discussed.
1 TDTWG Scope and Goals 2015 Wednesday January 8, 2014.
Rob Connell May 29, 2002 Retail Sub-Committee Update.
1 RMS Update on Move-In / Move-Out Task Force November 14, 2002.
1 New RO Projects Hope Parrish June NEW RO Projects for 2008 Requested by ERCOT - Overview Objective Objective of the following information is to.
1 Linked-Service Address Discussion Thursday - April 8, 2004 (Updated 4/12/04 to include meeting results) Airport Hilton - Austin.
1 TDTWG Update to RMS TDTWG Thursday, March TDTWG TDTWG has continued work necessary to further support of the NAESB EDM V1.6 Project Work primarily.
1 Update on the 867_03 Contingency Plan Nancy Hetrick February 25, 2003.
1 Transaction or Issue Clean Up. 2 Customer Protection and 814_08 Issue (Phase 2 – Potentially Late 08s) Background Completed Items Next Steps.
TDTWG Update to RMS Wednesday January 14. TDTWG Update to RMS Scope Texas Data Transport Working Group (TDTWG) is responsible for creating and maintaining.
Retail Market Subcommittee (RMS) Update Kathy Scott April 9, 2013Commercial Operations Subcommittee 1.
Objectives: Upgrade Siebel to a supported application Upgrade Oracle database to current version Deliver all existing user functionality with no degradation.
1 TDTWG Accomplishments 2010 Friday January 28, 2011.
February 19, 2009 ERCOT Follow up on questions from 2/11 discussion on proposed Expedited Switch rulemaking changes…
1 TX SET Mass Transition Project RMS Update March 15, 2006.
1 TDTWG Update to RMS Tuesday March 3, Primary Activities 1.ERCOT System Outages and Failures 2.MarkeTrak Performance 3.Discussed 4 th QTR Performance.
1 ERCOT Retail Release Overview. 2 How Are Changes Managed? Retail Testing Business Teams Development Teams Release Management Management of: Migration.
Information Technology Service Availability Metrics March 2008.
Retail Market Subcommittee Update to COPS Kathy Scott November 5,
1DRAFT for DISCUSSION Transition From Non-IDR to IDR Load Profile and LSE 15-minute Data for AMS Market Advanced Readings and Settlements Taskforce 10/9/09.
Information Technology Update Aaron Smallwood Manager, IT Business & Customer Services.
1 Customer Objections in Complete Status (CCO Clean-up Phase 3) Background Next Steps.
February 25, 2009 ERCOT Follow up on questions from 2/18 meeting on proposed Expedited Switch rulemaking changes…
Retail SLA Proposed Changes RMS/TDTWG September 2008 Trey Felton IT Account Manager.
1 Customer Objections in Complete Status (CCO Clean-up Phase 3) Background Next Steps.
SCR786 Retail Market Test (Sandbox) Environment January 2016.
3 rd Party Registration & Account Management SMT Update To AMWG May 24, 2016.
Alternative Proposal SCR786 Retail Market Test Environment September 2015.
1 Market Trials Update NATF January 5, 2010.
3 rd Party Registration & Account Management SMT Update To AMWG March 22, 2016.
August 9, 2006 Retail Market Subcommittee Meeting MarkeTrak Update.
Stacking Implementation Plan
Presentation transcript:

February 10, 2010 RMS ERCOT 1/24/10 Production Issue Overview and Lessons Learned Karen Farley Manager, Retail Customer Choice

2 Outline for RMS Upgrade History Migration Weekend Troubleshooting Timeline Market Impacts Lessons Learned Where to find system outage notices Where to find Help Desk contact information

3 Upgrade History Project Retail Application Upgrades August release - upgrade of Inovis software for NAESB to v3.2.0 –v3.2.0 failed testing in August – was pulled from the August release September release - upgrade of Inovis software for NAESB to v3.1.0 –v3.1.0 passed internal testing –Migrated to production – rolled back to v3.0.2 on 9/27/09 January release – upgrade of Inovis software for NAESB to v3.1.0 patch 28 –v3.1.0 patch 28 was successfully tested in ERCOT CERT environment Details on slide 3 –Scheduled to migrate to Production on 1/24/10

4 Upgrade History CERT testing criteria – lessons learned from September rollback –Tested within Flight 1009 –Test with individual MPs that are stand-alone entities –Test with at least one MP from each Service Provider –Test with a large file (for example: IDR Historical usage) to ensure there are no encryption / decryption – file size issues existing between ERCOT and MP Testing Completed for January Release v3.1.0 patch 28Connectivity CompletedNAESB PGP setting changes would be needed at ERCOT TDSP6 / 6 successful2 / 6 Service Providers6 / 6 successful2 / 6 REPs (no Service Provider) 7 / 7 successful5 / 7

5 Migration weekend 1/24/10 Release weekend - –After migration, transactions were flowing with MPs –Issue - outbound files failed to be decrypted on recipient side –Experienced intermittent transaction failures with no recognizable pattern ~ 273 files had at least 1 NAESB failure Many were processed successfully once the needed PGP changes were made Some of these failures were due to starting up components in different order –Issues initially believed to impact a small number of MPs The ERCOT planned retail release completed at approximately 1:46 PM today, Sunday, January 24, Should you have any issues, they can be reported to the ERCOT Help Desk at or or contact your ERCOT Account

6 Troubleshooting Timeline 1/24/10 Sunday –Continued to work issues with 2 REPs and 1 Service Provider –ERCOT contacted impacted parties, 1 was not available until Monday Requested re-import of the ERCOT PGP key 2 completed, 1 remained for Monday –6:30pm – appeared issues could be resolved without a rollback 1/25/10 Monday –Larger number of exceptions identified ~ 680 files had at least 1 NAESB failure Many were reprocessed successfully after the keys were imported ~300+ were due to 1 Service Provider being down (from Sun) A small subset may be captured twice as they remained from the previous day and were again reprocessed –1 REP continued to have issues with larger files, reprocessing appeared to work during lower peak times when files were not pending outbound to the MP Some larger files would finish, some would not and then be retried and stay in a pending state, as more files were sent out and then failed, volumes pending increased –9 separate Help Desk tickets received on 1/25/10

7 Troubleshooting Timeline Continued - 1/26/10 Tuesday –1 Service Provider from Monday believed issues on their side, able to decrypt manually, ERCOT continued to reprocess files to that Service Provider –12:58 PM - Market Notice sent to inform the Market that ERCOT was experiencing retail transaction processing issues –Decision made to continue to troubleshoot problems instead of rolling back to previous version 1/27/10 Wednesday Continued analysis with vendor – see version comparison on slide 7

8 Troubleshooting Timeline Version comparison Future upgrade release will be discussed in detail at TDTWG and scheduled to be part of a scheduled flight test. VersionNative C PGPJava PGP Pre-release3.0.2X Target version for 1/24/10 release patch 28X ERCOT rolled back to on 1/28/ patch 18X Future release3.2.0 patch 16XX

9 Troubleshooting Timeline Continued - 1/27/10 Wednesday –Decision made to roll back to patch 18 –ERCOT tested patch 18 with impacted MPs in CERT 1/28/10 Thursday –11:00 AM - ERCOT hosted a Conference Call with the Market to discuss the NAESB issue and the planned emergency outage. –Continued remainder of CERT testing with impacted MPs –At 2:00 PM, emergency outage and the patch was released to production successfully and impacted MPs were receiving and decrypting files

10 Troubleshooting Timeline Continued – 1/29/10 Friday –3:00 PM – ERCOT hosted a Conference Call with the Market to discuss the NAESB issue, the Patch that was made to the upgrade, and the plan for supporting the market in identifying the MP’s affected and the transactions affected. –ERCOT had identified the files that 997s were not received, and after the call, redropped them outbound to the market. Date received# of files 1/24/1053 1/25/ /26/ /27/ /28/1065 Total742 files

11 Market Impacts Delay of transactions to TDSPs and REPs Transactions out of protocol Emergency outage to migrate to production TDSPs requested safety net process be followed, which results in additional manual efforts at TDSPs and REPs –TDSP #1 – 2816 safety nets (includes both Priority and Standard MVIs) –TDSP #2 – XXXX (may receive update from TDSP prior to RMS and will update) MarkeTrak issues – 57 from ERCOT to individual MPs with their details TRAN TYPECountBreakout 814s22976 See breakout column 814_03s2764 * 376 were priority move ins 867s _24/25s s _21s9004 Total116296All other 814s2343 Dupes8166

12 Lessons Learned Communication Internal breakdown of communications at ERCOT delayed the notification to the market –Actions Release Management – to provide additional details to RCS if there are known issues related to the release or outage and RCS will communicate issues to the Market in the completion notice. RCS - Will follow up with Commercial Operations first thing in the morning on the 1 st business day following the release or outage to identify if issues are resolved. If issues persist, RCS will confirm list of MPs that are impacted and send updated market notice. RCS will review with TDTWG to determine if market participant production technical contact list from the testing worksheets should be included in Release and Outage notices.

13 Lessons Learned Communication (continued) Help Desk tickets should be tracked to determine scope of impact more quickly –Actions Production support - proactive review of tickets received during window of release and 1 business day after to identify any issues. Review release changes with Help Desk to have the correct priority for release related issues. Improve clarity in notification and ticket tracking for Level 2 support

14 Lessons Learned Communication (continued) Awareness by Market of ERCOT software upgrade –Actions RCS will review format of Market Notices with CCWG to determine if placement of who to contact in case of issues should be changed. RMS review of PPL has been budget focus vs. functionality focus Risk Management Review of CERT test issues –Actions ERCOT will integrate flight testing schedule into future Inovis software upgrades

15 System Outage Notices -

16 Contact Us - Help Desk

17 Questions?