Doctor Implementation Plan (Discussion) Feb. 6, 2015 Ryota Mibu, Tomi Juvonen, Gerald Kunzmann, Carlos Goncalves.

Slides:



Advertisements
Similar presentations
1 Introducing the Specifications of the Metro Ethernet Forum.
Advertisements

High Availability Project Qiao Fu Project Progress Project details: – Weekly meeting: – Mailing list – Participants: Hui Deng
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
Virtualized Infrastructure Deployment Policies (Copper) 19 February 2015 Bryan Sullivan, AT&T.
Zhipeng (Howard) Huang
Please direct any questions or comments to
Useful Tools for Testing
Policy Architecture Discussion 18 May 2015 Bryan Sullivan, AT&T.
© 2015 AT&T Intellectual Property. All rights reserved. AT&T, the AT&T logo and all other AT&T marks contained herein are trademarks of AT&T Intellectual.
24 February 2015 Ryota Mibu, NEC
(OpenStack Ceilometer)
IETF 91: Open Platform for NFV Collaboration with I2NSF Chris Donley 1.
1 Doctor Fault Management 18 May 2015 Ryota Mibu, NEC.
**DRAFT** Doctor Southbound API 14 April 2015 Ryota Mibu, NEC.
FileSecure Implementation Training Patch Management Version 1.1.
R2 Test strategy. Test strategy Testing is still a key challenge for OPNFV All the projects must manage their test strategy (unit, fonctional, security,
Department of Computer Science Engineering SRM University
1 Doctor Fault Management - Updates - 30 July 2015 Ryota Mibu, NEC.
Python and REST Kevin Hibma. What is REST? Why REST? REST stands for Representational State Transfer. (It is sometimes spelled "ReST".) It relies on a.
CaDSR Freestyle Search June 11, caDSR Freestyle Search Overview Architecture Implementation Dependencies Futures 2.
VSPERF – Test and Performance Subgroup Demo 26/08/2015 Maryam Tahhan.
The Process Manager in the ATLAS DAQ System G. Avolio, M. Dobson, G. Lehmann Miotto, M. Wiesmann (CERN)
**DRAFT** Blueprints Alignment (OpenStack Ceilometer) 4 March 2015 Ryota Mibu, NEC.
OPNFV Qtip Status Update Vikram Dham, Nauman Ahad Sep 2, 2015.
Fault Localization (Pinpoint) Project Proposal for OPNFV
BoF: Open NFV Orchestration using Tacker
Gerald Kunzmann, DOCOMO Carlos Goncalves, NEC Ryota Mibu, NEC
Promise Resource Reservation 09 November 2015
Gerald Kunzmann, DOCOMO Carlos Goncalves, NEC Ryota Mibu, NEC
NFV Configuration Problem Statements Haibin Song Georgios Karagiannis
1 OPNFV Summit 2015 Doctor Fault Management Gerald Kunzmann, DOCOMO Carlos Goncalves, NEC Ryota Mibu, NEC.
TOSCA Monitoring Straw-man for Initial Minimal Monitoring Use Case Roger Dev CA Technologies Revision 2 May 1, 2015.
DDM Central Catalogs and Central Database Pedro Salgado.
Project Tacker Open Platform for NFV Orchestration V1.1 / 02/16/16.
**DRAFT** Doctor Host Maintenance Maintenance changes to OpenStack Nova 17 May 2016 Tomi Juvonen Nokia.
Cloud Installation & Configuration Management. Outline  Definitions  Tools, “Comparison”  References.
Ashiq Khan NTT DOCOMO Congress in NFV-based Mobile Cellular Network Fault Recovery Ryota Mibu NEC Masahito Muroi NTT Tomi Juvonen Nokia 28 April 2016OpenStack.
Ashiq Khan NTT DOCOMO Congress in NFV-based Mobile Cellular Network Fault Recovery Ryota Mibu NEC Masahito Muroi NTT Tomi Juvonen Nokia 28 April 2016OpenStack.
Model-Driven NFV (Models) Project 22 March 2016 Bryan Sullivan, AT&T.
Failure Inspection in Doctor utilizing Vitrage and Congress
June 20–23, 2016 | Berlin, Germany. Copper: Configuration Policy Management in OPNFV Colorado Bryan Sullivan, AT&T.
Doctor Host Maintenance Maintenance changes to OpenStack Nova 13 Jun 2016 Tomi Juvonen Nokia.
**DRAFT** Doctor+Congress OPNFV Summit June 2016 Doctor+Congress PoC team.
Project Tacker Open Platform for NFV Orchestration OPNFV Design Summit.
Doctor Tech Deep Dive Tomi Juvonen, Nokia Ryota Mibu, NEC.
Fault Management with OpenStack Congress and Vitrage, Based on OPNFV Doctor Framework Barcelona 2016 Ryota Mibu NEC Ohad Shamir Nokia Masahito Muroi.
Useful Tools for Testing
X V Consumer C1 Consumer C2 Consumer C3
Tina Tsou, Bryan Sullivan,
Doctor + OPenStack Congress
Ashiq Khan, NTT DOCOMO Ryota Mibu, NEC
Maintenance changes to OpenStack Nova 21 Jun 2016 Tomi Juvonen Nokia
OPNFV Doctor - How OPNFV project works -
Doctor PoC Booth Vitrage Demo
Dovetail project update
Online Software Status
Tomi Juvonen SW Architect, Nokia
Tomi Juvonen Software Architect, Nokia
Nov, 2015 Howard Huang, Huawei Julien Zhang, ZTE
Infrastructure Maintenance & Upgrade: Zero VNF Downtime with OPNFV Doctor on OCP Hardware AirFrame Open Rack V1 and V2 compatible Demo video:
Proactive RCA with Vitrage, Kubernetes, Zabbix and Prometheus
OpenStack Ceilometer Blueprints for Liberty
Vitrage Project Update, OpenStack Summit Berlin
Doctor OpenStack Controller changes Tomi Juvonen Nokia
**DRAFT** NOVA Blueprint 03/10/2015
**DRAFT** Doctor Southbound API 23 Feb 2016 Ryota Mibu, NEC.
Latest Update on Gap Analysis of Openstack for DPACC
Figure 3-2 VIM-NFVI acceleration management architecture
Latest Update DPACC Architecture
Doctor Host Maintenance
Presentation transcript:

Doctor Implementation Plan (Discussion) Feb. 6, 2015 Ryota Mibu, Tomi Juvonen, Gerald Kunzmann, Carlos Goncalves

Contents Functional Block (Architecture) –Type A: VIM does not run any recovery by itself –Type B: some actions done by VIM automatically # describe sequences in user and admin perspective Implementation Plan NOTE: Agreement were described followed by ”  ”.

Functional Block (Architecture) Discussion Points –Should we consider auto recovery function in VIM? Leave it as option and future development?  Write this option in requirement document, but not implement in our first step.  We need actions by some manager outside of VIM. (Actions that requires admin right should be clarify in the document.) –Can we avoid duplication of data source particularly the resource map and state DB?  Yes. (State means state of resource handled in infra, not indicate internal state of VM.) –Can we done all failure selection and aggregation by Inspector?  We can done it by using Zabbix, but we have to consider replacement Zabbix by other tools to achieve framework which has modularity that means each module is pluggable and replaceable. This idea is important in OPNFV as well as OpenStack.  TODO(Ryota): Study Zabbix aggregation framework, and rethink functional block including Monitor, Inspector and Predictor. Also check “KIE”.  We will check other related projects. Other projects (HA for VNF, Failure Prediction) seems to be working on similar issue. (Failure Prediction team focus on finding warning rather than notify error immediately.)

Functional Block: Type A (User) Notifier User-side Manager NFVI Monitor Alarm conf 0. Set Alarm on VM 3. Update State 2. Find Affected 5. Notify VM error VNFs Controller Resource Map 1. Receive Failure Inspector 4. Notify all 6-2. Action 4. (alt) Notify 6-1. Action Failure Policy

Functional Block: Type A (Admin) Notifier User-side Manager NFVI Monitor Alarm conf 0. Set Alarm on Host 3. Update State 2. Find Affected VNFs Controller Resource Map 1. Receive Failure Inspector 4. Notify all 4. (alt) Notify Admin-side Manager 5. Notify PM error 6. Deactivate Host6. Shutdown Host Failure Policy

Functional Block: Type B (User) Notifier User-side Manager NFVI Monitor Alarm conf 0. Set Alarm on VM (+ recovery action) 3. Update State 2. Find Affected VNFs Controller Resource Map 1. Receive Failure Inspector 4. Notify all 6-2. Action 4. (alt) Notify 6-1. Action 5. Notify VM error Mender Failure Policy

Functional Block: Type B (Admin) Notifier User-side Manager NFVI Monitor Alarm conf 0. Set Alarm on Host (+ recovery action) 3. Update State 2. Find Affected VNFs Controller Resource Map 1. Receive Failure Inspector 4. Notify all 4. (alt) Notify Admin-side Manager 5. Notify PM error 6. Deactivate Host 6. Shutdown Host Mender Failure Policy

Implementation Plan Discussion Points –How we kept configuration of alarm and reaction passed by user/admin? Not all virtual resource controller (such as Nova, Neutron) have metadata on each resource.  Having own DB. –Which part of functional block can be covered by Zabbix and its plugin script? Anyhow, we should use python, so that we can migrate scripts to some other OpenStack component easier.  Plan A1  Python!

Implementation Plan A1 Notifier User-side Manager NFVI Monitor Alarm conf 0. Set Alarm on VM 3. Update State 2. Find Affected 5. Notify VM error VNFs Nova Resource Map 1. Receive Failure Inspector 4. Notify all 6-2. Action 4. (alt) Notify 6-1. Action Failure Policy Zabbix Configuration Queue

Implementation Plan A2 Notifier User-side Manager NFVI Monitor Alarm conf 0. Set Alarm on VM 3. Update State 2. Find Affected 5. Notify VM error VNFs Nova Resource Map 1. Receive Failure Inspector 4. Notify all 6-2. Action 4. (alt) Notify 6-1. Action API service Failure Policy Zabbix Configuration

OpenStack Summit Vancouver Session? –Design Session (Maybe session planning will be had in April) –General Session (CFS deadline Feb 9 th ) Contents? –Describe Concept –As Doctor project –How OpenStack help (e.g. How to reaction or fencing) Deliverables? –Doc Links – –  Let’s talk at the next meeting (Feb 10).