1 Light-path Monitor System of TWAREN Optical Network National Center for High-Performance Computing Speaker: Ming-Chang Liang
2 INTRODUCTION
3 TWAREN phase 2 TWAREN was adapted for more protection methods and better availability at the end of 2006, called TWAREN phase 2. Tens of optical switches and hundreds of lightpaths were then served as the foundation of the layer 2 VLAN services and the layer 3 IP routing services. In 2008, tens of VPLS switches were further incorporated to provide additional Multi-point VPLS VPN service. The layer 1 lightpaths can be protected by SNCP, layer 2 VLAN by spanning tree recalculation and layer 2 VPLS by fast reroute technology. All these improvements transform TWAREN phase 2 into a true hybrid network capable of providing multiple layers of services and high availability.
4 Architecture of Optical Network
5 Architecture of TWAREN phase 2 STM64 STM16 10GE GE NTU NCU NSYSU NCHU NCTU NTHU ASCC 6509 NCCU NCKU CCU 7609C Taipei NCHC 7609C MOEcc Hsinchu 7609C NCHC Tainan 7609C NCHC Taichung 7609 NCNU NIU NDHU NHLTC NTTU
6 CTC is not enough for us TWAREN phase 2 inherently has the ability to guard against a single point of hardware or circuit failure, so the failure is less likely to affect the actual service provisioning. When port or circuit is fault, we must determine which lightpaths are affected and then correlate with services of upper layers.
7 DESIGN OF NMS
8 1 st Stage Architecture of NMS Current Status DB Long Term DB Monitor Objs Data Collectors Traps MIBs Syslogs Net flows Telnet/SSH Fault Detection Threshold DB Case/Action DB GUI & Ticket System Threshold Analyzer Fault Location Auto Action Control API Report System TL1 Mirror Interactive Passive
9 Lightpaths Monitor System Monitor System in NOC Alarm trigger Alarm Light-path data Parser Database Trap parser ONS TWAREN Optical Network
10 Frequent incident case 1
11 Frequent incident case 2
12 Important information in a trap Value indexValueDescription 1Trap agent host nameThe hostname which sent this trap 2Trap agent IP addressThe IP address of host which sent this trap 3sysUpTimeThe system uptime of host which sent this trap 4snmpTrapOIDThe mapped OID of this trap 5Cerent454NodeTime The ONS clock time (YYYYMMDDhhmmss) 6cerent454AlarmState The severity level of this warning (defined by Cisco CerentNotificationClass)
13 cerent454AlarmState Alarm State Number Meaning Traps that use this alarm stats 31diagnosticAll traps of ONS 40cleared All traps of ONS will sent this state when fault be solved 50minorNonServiceAffecting 1.carrierLossOnTheLAN 2.transportLayerFailure 80minorServiceAffectingAll traps of ONS 90majorServiceAffecting 1.carrierLossOnTheLAN 2.transportLayerFailure 100criticalServiceAffecting 1.lossOfSignal 2.lossOfFrame
14 The OID of ONS Trap Trap Name MIB OID lossOfSignal lossOfFrame carrierLossOnTheLAN transportLayerFailure ONS Trap Name MIB OID lossOfSignal lossOfFrame carrierLossOnTheLAN transportLayerFailure ONS-15454
15 DESIGN OF DATABASE
16 Relationship of Data Tables Component People Location Unit Vendor …., etc Basic Data Tables Circuit VLAN Services VPLS Services ONS Light Path ONS Cross Connection …., etc Relationship Tables
17 Basic Data Tables Component_IDParent_C_IDName 10TN7609P 121Slot_1 20TP Slot_ Port_9 IDNamePhoneAddressService_TimeService_WeekDay 1John xxxxxxx8-171,3,5 2Mary xxxxxxxALL People Data Table IDNameAddress 1MOEccxxxxx 2NTUxxxxx Location Data Table Component Data Table IDName 1NCKU 18THU Unit Data Table IDName 1CHT 2APBT 3RingLine Vendor Data Table
18 Port Table FieldTypeDescription PortIDintPort Component ID Typeint Port type: Ethernet(0) SDH(1) CardNamevarcharLine Card type name BandwidthintBandwidth StatusintDefined by us
19 Topology Link Table FieldTypeDescription NodeAint The component IDs of the equipments connected with the link. NodeBint PortAint The component IDs of the ports connected with the link. PortBint
20 Cross Connection Table FieldTypeDescription CRSintCross Connection unique number SNCPintIf SNCP protection? (0: No, 1:Yes) PortFrom1intPort Component ID - From 1 PortFrom2intPort Component ID - From 2 PortTo1intPort Component ID - To 1 PortTo2intPort Component ID - To 2 ChannelFrom1intPort From 1 – Channel ID ChannelFrom2intPort From 2 – Channel ID ChannelTo1intPort To 1 - Channel ID ChannelTo2intPort To 2 - Channel ID SNCPPathFromint Current SNCP selector uses WORKING(1) or PROTECTION(2) SNCPPathToint SizeintVC bandwidth size, unit is VC1 (155mbps) StatusintDefined by us CKTIDvarcharCircuit Identification string
21 Light-Path Table FieldTypeDescription LightPathintLight Path unique number NamevarcharLight Path name (CKTID) PortFromintPort – From PortTointPort - To SNCPintIf SNCP protection? (0: No, 1:Yes) SizeintVC bandwidth size, unit is VC1 (155mbps) TraceCRSvarchar Cross-connection path string that a light-path passed through TraceConfiguredvarchar Configured ports path (port Component ID string) that a light-path should pass through. TraceCurrentvarchar Actual ports path (port Component ID string) that a light-path pass through currently. StatusintDefined by us
22 Alarm Table FieldTypeDescription IdIntAlarm serial number EventNameVarcharAlarm identify name HostNameVarcharThe name of the host which sent this alarm AgentIPVarcharThe IP address of the host which sent this alarm CategoryVarcharThe class of this alarm SeverityVarcharThe severity level of this alarm (defined by us) UpTimeVarchar The uptime of the equipment which sent this alarm TrapTimeVarcharThe time that this alarm be generated InterfaceVarcharThe port be affected AlarmStatusVarcharThe severity level that be defined by CISCO LightPathVarcharThe light-path names be affected
23 IMPLEMENTATION
24 Working steps Build Port table by reading from Component table. Send some TL1 commands to all ONS. Build TopologyLink and CrossConnection tables by parsing the responses of TL1. Build LightPath table by aggregating Port, TopologyLink, CrossConnection tables. Determine the affected ports When receive traps from ONS, and then correlate the affected services with database.
25 TL1 commands CommandDescription ACT-USER::username:123::password;Login RTRV-NE-IPMAP:::123;Get information about topology link neighbors RTRV-CRS::ALL:123;Get information about cross connections RTRV-VC::ALL:123;Get information about all VC statuses, including SNCP selector status CANC-USER::username:123:;logout
26 Partial SNCP (1) ONS-A ONS-B ONS-CONS-D RA RB Configured working path Configured protection path
27 Partial SNCP (2) ONS-A ONS-B ONS-CONS-D RA RB Actual working path Configured protection path
28
29 WEB-BASED ALARM LOG SYSTEM
30 Example 1
31
32 Example 2
33 INTEGRATED VISUAL INTERFACE
34
35
36