A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 1 Problem Management Jim Heronime, Manager, ITSM Program Tanya Friehauf-Dungca,

Slides:



Advertisements
Similar presentations
SERVICE MANAGER 9.2 PROBLEM MANAGEMENT TRAINING JUNE 2011.
Advertisements

A BPM Framework for KPI-Driven Performance Management
The Value of a Project Management Office Copyright: Kathy J. Lang, 2004.
PROJECT RISK MANAGEMENT
8D Corrective Action. 2 8D Problem Solving & Corrective Action: Initiate 8D Corrective Action D1 - Create Problem Solving Team D2 - Define the Problem.
Page 2 Agenda Page 3 History –Blue Print, 2000 –GIS Process 1.2, 2001 (training only) –GIS Process 2.0, (ITIL based - not implemented) –Supply/Demand.
Problem Management Process Training
Root Cause Analysis Procedure (IMSP525) Training
Best Practices – Overview
Problem Management Overview
ITIL: Why Your IT Organization Should Care Service Support
IS&T Project Management: How to Engage the Customer September 27, 2005.
Runway Safety Teams (RSTs) Description and Processes Session 5 Presentation 1.
Problem Management ISD Division Office of State Finance.
ITIL Process Management An Overview of Service Management Processes Presented by Jerree Catlin, Sue Silkey & Thelma Simons.
Top Tactics for Maximizing GMP Compliance in Blue Mountain RAM Jake Jacanin, Regional Sales Manager September 18, 2013.
Problem Management SDG teamIT Problem Management Process 2009 Client Services Authors: M. Begley & R. Crompton, Client Services.
ITPD PRODUCTION SUPPORT PROCESS OCTOBER 8, /15/2015 Guiding Principles 1.Support the business area’s needs to execute transactions and expand.
1 Federal Communications Commission (FCC) Communications Security, Reliability and Interoperability Council (CSRIC) Working Group 1A - Public Safety Consolidation.
What is Business Analysis Planning & Monitoring?
The Information Component: Help Desk Performance Measures
PMP® Exam Preparation Course
© Mahindra Satyam 2009 Defect Management and Prevention QMS Training.
Service Management Processes
Project Tracking. Questions... Why should we track a project that is underway? What aspects of a project need tracking?
Deakin Richard Tan Head, Information Technology Services Division DEAKIN UNIVERSITY 14 th October 2003.
ITIL Process Management An Overview of Service Management Processes Thanks to Jerree Catlin, Sue Silkey & Thelma Simons University of Kansas.
3.08 b Determine venture’s information technology.
Service Transition & Planning Service Validation & Testing
Software Project Management
SacProNet An Overview of Project Management Techniques.
ISM 5316 Week 3 Learning Objectives You should be able to: u Define and list issues and steps in Project Integration u List and describe the components.
Project Tracking and Monitoring QMS Training. 2 Objective To track and monitor the progress of the project and take appropriate corrective actions to.
Monitor & Control Risks 1 MEC-4. What is Monitoring & Controlling Risks? 2 » Monitoring & Controlling Risks is the process of: implementing Risk Response.
ITPD PRODUCTION SUPPORT PROCESS OCTOBER 8, /23/2015 Guiding Principles 1.Resolve production issues in a timely and effective manner 2.Manage.
Search Engine Optimization © HiTech Institute. All rights reserved. Slide 1 What is Solution Assessment & Validation?
Develop Project Charter
Lecture 12 Managing Project Control and Closure. Project Control The process that allows monitoring and measurement of project progress and directing.
Moving to a Results Based-Management Environment Progress Report April
Test status report Test status report is important to track the important project issues, accomplishments of the projects, pending work and milestone analysis(
Project Management Project Integration Management Minder Chen, Ph.D. CSU Channel Islands
Project Management Basics
Software Quality Assurance SOFTWARE DEFECT. Defect Repair Defect Repair is a process of repairing the defective part or replacing it, as needed. For example,
Monitor & Control Risks 1 MEC-4. What is Monitoring & Controlling Risks? 2 » Monitoring & Controlling Risks is the process of: implementing Risk Response.
Company: Cincinnati Insurance Company Position: IT Governance Risk & Compliance Service Manager Location: Fairfield, OH About the Company : The Cincinnati.
Chapter 3: Business Continuity Planning. Planning for Business Continuity Assess risks to business processes Minimize impact from disruptions Maintain.
1 An Overview of Process and Procedures for Health IT Collaboration GSA Office of Citizen Services and Communications Intergovernmental Solutions Division.
Introduction to ITSM processes. CONFIDENTIAL Agenda Problem Management  Overview  High Level process Change Management  Overview  High Level process.
Company LOGO. Company LOGO PE, PMP, PgMP, PME, MCT, PRINCE2 Practitioner.
ITIL and Remedy ITSM Implementation Overview
Problem Management for ITSD “Getting to the root of it” Thatcher Deane Feb 28, 2013.
Info-Tech Research Group1 Info-Tech Research Group, Inc. Is a global leader in providing IT research and advice. Info-Tech’s products and services combine.
Strana 1MBA kurz informačního inženýrství INITIATE CONSTRUCT DELIVER MAINTAIN & SUPORT quality assurance, manage project, trainig&education, manage.
IS&T Project Reviews September 9, Project Review Overview Facilitative approach that actively engages a number of key project staff and senior IS&T.
IT Infrastructure Library (ITIL) For You -With Max Mart The ManageEngine Guy!!
Process Knowledge Playback Maintenance Project Presented By- Alka Mehta(885026) Arpit Saran(838563) Kirti Singh(883796) Kumari Meenu(840291) Rishabh Jha(835286)
Service Management World Class Operations - Impact Workshop.
Project Integration Management
Establishing Strategic Process Roadmaps
The Process Owner is the Secret Agent!
ITSM Governance is Imperative to Succeed
ITIL: Why Your IT Organization Should Care Service Support
ITIL: Why Your IT Organization Should Care Service Support
Manage Service Requests and Incidents
ITIL: Why Your IT Organization Should Care Service Support
Project Management Group
Fertilizer Industry Safety Information Analysis and Sharing Program
Manage Business Continuity Introductory Brief
{Project Name} Organizational Chart, Roles and Responsibilities
Presentation transcript:

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 1 Problem Management Jim Heronime, Manager, ITSM Program Tanya Friehauf-Dungca, Manager, Problem Management 2/17/11 A A A N C N U I N F O R M A T I O N T E C H N O L O G Y IT OPERATIONS

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 2 Agenda  PM Overview  History  Vision & Mission  Operational Level Agreement (OLA)  Action Items  Trending (Proactive Problem Management)  Facilitated Meetings (MIR & ToE)  KPIs and Metrics  Future Initiatives  Questions? Problem Management Team Members

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 3 Problem Management Overview  Main goal of Problem Management: – Detection of the underlying causes of an incident and the subsequent resolution and prevention of the incidents.  Problem Management ensures: – The identification and classification of problems, root cause analysis, and resolution of problems  Problem Management process also includes: – The formulation of recommendations for improvement, maintenance of problem records, and review of the status of corrective actions

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 4 History of PM at AAA  Began our formal Problem Management practice in – Track major incidents – ID Root cause for major incidents – Rudimentary MS-Access dB to store info  Began formal implementation of ITSM in June 2009 – Average root cause found was 55.4% – Mean time to close problems = 6 days  Implemented current iteration of Problem Management October By January – Average root cause found was 83% – Mean time to close problems = 3 days  We continue to mature our process

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 5 Vision and Mission  VISION: – To permanently eliminate problems in our production environment and prevent new problems from occurring  MISSION: – To aggressively identify root cause of problems and drive permanent solutions to stabilize our IT infrastructure  We do this by: – PROCESSES: Ensuring PM processes and procedures are followed by IT support teams – ACTION ITEMS: Managing assigned action items and their timeframes with support teams to drive permanent solutions – ROOT CAUSE: Driving root cause identification within OLA timeframes

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 6 OLAs for PM Be aggressive: 3 Business days to identify root cause - Report enables us to track daily progress

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 7 Action Items  Objective: – Action items are identified and assigned to drive permanent solutions  Types of Action Items: – Root cause identification for every problem created from an incident – Areas of improvement Documentation Process improvement & training Vendor management Hardware replacement  How are Action Items identified? – Incident management activities – Problem management activities – Root Cause Analysis – Meetings: Daily IT Operations Meeting, Major Incident Review (MIR), or Team of Experts (ToE)  How are they tracked? – Maximo – integrated system with Change, Incident, and Asset

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 8 Trend Analysis (Proactive Problem Management)  Objective: – Analyze related incidents for common root causes  Collaboration with Operations Bridge: – Weekly work sessions to identify potential areas of concern – The Problem Management team reviews related incidents to look for common symptoms, causes, or conditions  Commonalities identified by trend analysis? – A Global Problem record is created and assigned to the Service Owner with appropriately assigned action items  Service Owner analysis: – The Service Owner prioritizes their efforts – Determine to identify root cause – Prioritize and approve with business for funding, scheduling

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 9  Reporting: – The Problem Management team reports out on the status of the Trend records as appropriate until ticket closure  Examples of Trend statuses: Trend Identified – Pending funding Trend Identified – Pending approval Trend Identified – Pending change Trend Identified – Changes not funded Trend Identified – Not a managerial priority Determined not a trend Trend Analysis (Proactive Problem Management)

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 10 Major Incident Review (MIR)  What is it? – Evaluation of the incident process after a major incident  What’s it’s purpose? – Validate details of the incident record – Review incident handling – identify opportunities – Identify lessons learned - share across the enterprise – Identify action items  When is one required? – Mandated for all Severity 1 incidents – Lower severities by request or as needed  Why does Problem Management facilitate a Major Incident Review? – Unbiased view of events – no call involvement

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 11 MIR Agenda

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 12 MIR Template

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 13 Team of Experts (ToE)  What is it? – A special team of technical subject matter experts (SMEs) assembled to analyze and resolve critical problems at an accelerated pace to minimize or eliminate exposure.  How long has this process been in place? – This is one of our newest additions – since December 2010  Why are ToEs initiated? – Teams not collaboratively engaging each other – Need to identify root cause immediately – back to back incidents – Leadership’s request for information and status of critical or chronic problems

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 14 ToE (cont.)  ToE Activities – Root cause analysis – Brainstorm solutions and permanent fixes – Assign action items and due dates  Where’s the template? – Currently under construction

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 15 KPIs and Metrics  KPIs – Root cause identified within OLA – MIRs conducted for Sev1 Incidents  Operational Metrics – Total Problems by Severity – Problems by Causing Party – Outages by Domain (Applications, Network, Security, Servers, Telecom or Other)

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 16 KPIs *Baseline determined by internal historical data = 82% *Industry standards non-existent

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 17 KPI Details *2010 Average for RC Identified within OLA = 85.7%

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 18 Examples of Metrics *Change Freeze AT&T AAA NCNU

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 19 Future Initiatives  Workarounds and defects – Known Error Database  Action item validation – quality check on completed actions  ToE template development

A A A N C N U I N F O R M A T I O N T E C H N O L O G Y : IT OPERATIONS 20 Questions?  PROBLEM MANAGEMENT TEAM MEMBERS – Mark Hernandez - IT Service Transition Analyst V – Gessica Briggs-Sullivan – IT Service Transition Analyst III – Andrew Egan - Intern