Presentation is loading. Please wait.

Presentation is loading. Please wait.

Traffic Engineering for the Modern MPLS Backbone

Similar presentations


Presentation on theme: "Traffic Engineering for the Modern MPLS Backbone"— Presentation transcript:

1 Traffic Engineering for the Modern MPLS Backbone
Extending PCEP for Stateful Control of MPLS RSVP-TE Attributes Edward Crabbe, Google Jan Medved, Juniper Robert Varga, Juniper

2 "...it is generally desirable to ensure that subsets of network resources do not become over utilized and congested while other subsets along alternate feasible paths remain underutilized. Bandwidth is a crucial resource in contemporary networks. Therefore, a central function of Traffic Engineering is to efficiently manage bandwidth resources. Minimizing congestion is a primary traffic and resource oriented performance objective. The interest here is on congestion problems that are prolonged rather than on transient congestion resulting from instantaneous bursts. Congestion typically manifests under two scenarios: When network resources are insufficient or inadequate to accommodate offered load When traffic streams are inefficiently mapped onto available resources; causing subsets of network resources to become over-utilized while others remain underutilized." [RFC 2702 Requirements for Traffic Engineering Over MPLS]

3 "...it is generally desirable to ensure that subsets of network resources do not become over utilized and congested while other subsets along alternate feasible paths remain underutilized. Bandwidth is a crucial resource in contemporary networks. Therefore, a central function of Traffic Engineering is to efficiently manage bandwidth resources. Minimizing congestion is a primary traffic and resource oriented performance objective. The interest here is on congestion problems that are prolonged rather than on transient congestion resulting from instantaneous bursts. Congestion typically manifests under two scenarios: When network resources are insufficient or inadequate to accommodate offered load When traffic streams are inefficiently mapped onto available resources; causing subsets of network resources to become over-utilized while others remain underutilized." [RFC 2702 Requirements for Traffic Engineering Over MPLS]

4 Topics MPLS TE Today Example Use Cases Stateful PCE Protocol Proposal

5 Topics MPLS TE Today Example Use Cases Stateful PCE Protocol Proposal

6 Online MPLS Control Mechanisms
Online:  computation and control of RSVP-TE parameters by local network element   auto bandwidth defacto 'standard' in online control mechnisms not widely deployed (but the few networks that do use it are quite large) provides independent, asynchronous control of device-local LSPs unsynchronized, fixed rate per device timers local empirical measurement of demands with little hysteresis per LSP: every S seconds compute [average] every K intervals take [max] of [average]s from last K intervals if [max] of [average]s greater than threshold T run CSPF

7 Offline MPLS Control Mechanisms
Offline: computation by system outside network element and control of RSVP-TE parameters via northbound API  Config PCE Openflow

8 Offline MPLS Control Mechanisms
Offline: computation by system outside network element and control of RSVP-TE parameters via northbound API  Config simplest' offline control method  relies on heavyweight config database changes for updates a bit heavy weight for transient forwarding state may lock config sections for duration of changes potentially problematic  platform dependent interface Does not trigger RFC3209 section 2.5 reroute behavior on most platforms config length effects compilation time ( ∴ boot time) Why not just do everything with static routes? :P PCE Openflow

9 Offline MPLS Control Mechanisms
Offline: computation by system outside network element and control of RSVP-TE parameters via northbound API  Config PCE No way to control timing of updates (without inefficiency in control plane as a result of directionality) No way to control sequence of updates across devices no way to collect results of PCEP PCRep messages (RSVP-TE error codes) no way to collect LSP state from devices 'in-band' Openflow

10 Topics MPLS TE Today Example Use Cases Stateful PCE Protocol Proposal

11 Example Stateful PCEP Use Cases
Deadlock Resolution Bin Packing Scheduling / Calendaring Predictability Adaptive Timescales Constraint Relaxation GCO ...

12 Example Stateful PCEP Use Cases
Deadlock Resolution Bin Packing Scheduling / Calendaring Predictability Adaptive Timescales Constraint Relaxation GCO ...

13 Deadlock 1 10 1 1 1 causes: control / dataplane decoupling
rfc3209 implies no teardown on reservation increase failure demand will be miss signaled for long periods lack of global LSP state lack of LSP level ingress admission control would require another online or offline control mechanism tension between overprovisioning level and transport elasticity A 1 C E 10 1 1 1 B D Link Metric Capacity A-C 1 20 B-C C-E 10 5 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 3 20

14 Deadlock 1 10 1 1 1 A C E B D Link Metric Capacity A-C 1 20 B-C C-E 10
5 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 3 20

15 Deadlock 1 10 1 1 1 A C E B D Link Metric Capacity A-C 1 20 B-C C-E 10
5 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 3 20

16 Deadlock 1 10 1 1 1 LSP 1: demand cannot be satisfied
LSP not torn down due to 3209 usage controlled due to control/data plane decoupling ⇒ information in IGP, RSVP is inaccurate LSP 2 lack of visibility w/r/t LSP 1 misbehavior results in unecessary, potentially prolongued degradation in service could be rerouted along C-E link modulo flow performance constraints A 1 C E 10 1 1 1 B D Link Metric Capacity A-C 1 20 B-C C-E 10 5 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 3 20

17 Deadlock 1 10 1 1 1 lack of LSP level ingress admission control
would require another online or offline control mechanism offline: need northbound API online: back to autopbw issues tension between overprovisioning level and transport elasticity A 1 C E 10 1 1 1 B D Link Metric Capacity A-C 1 20 B-C C-E 10 5 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 3 20

18 Bin Packing 1 10 1 1 1 causes: lack of global LSP state
bin packing is a sequencing problem - NP-Hard Better to solve w/ some throughput optimization A 1 C E 10 1 1 1 B D Link Metric Capacity A-C 1 10 B-C C-E 5 C-D D-E Time LSP Src Dst Demand 1 A E 5 2 B 10

19 Bin Packing 1 10 1 1 1 A C E B D Link Metric Capacity A-C 1 10 B-C C-E
5 C-D D-E Time LSP Src Dst Demand 1 A E 5 2 B 10

20 Bin Packing X 1 10 1 1 1 unable to shuffle demands w/o
some offline control stateful knowledge network LSPs 33% efficiency in capacity usage efficiency dictated by order of event arrival A 1 C E X 10 1 1 1 B D Link Metric Capacity A-C 1 10 B-C C-E 5 C-D D-E Time LSP Src Dst Demand 1 A E 5 2 B 10

21 Scheduling causes: autobw empirically derives demand with single period hysteresis  unable to use historical timeseries apriori knowledge of demand network must be overprovisioned for either offline: worst case demand over reopt interval (⇔) online: (autobw) reopt trigger threshold + safety margin A 1 C E 10 1 1 1 B D Link Metric Capacity A-C 1 20 B-C C-E 10 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 7 3 3+k

22 Scheduling apriori knowledge that LSP 1 demand at t3 will increase to 7 A 1 C E 10 1 1 1 B D Link Metric Capacity A-C 1 20 B-C C-E 10 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 7 3 3+k

23 Scheduling 1 10 1 1 1 A C E B D Link Metric Capacity A-C 1 20 B-C C-E
C-D D-E Time LSP Src Dst Demand 1 A E 2 B 7 3 3+k

24 Scheduling online: if network is not provisioned to trigger threshold, loss incurred churn ∝ 1/T smaller static, uniquitously applied trigger threshold results in higher churn for other less predictable demands A 1 C E 10 1 1 1 B D Link Metric Capacity A-C 1 20 B-C C-E 10 C-D D-E Time LSP Src Dst Demand 1 A E 2 B 7 3 3+k

25 Scheduling 1 10 1 1 1 A C E B D Link Metric Capacity A-C 1 10 B-C C-E
C-D D-E Time LSP Src Dst Demand 1 A E 2 B 7 3 3+k

26 Predictability causes: VS
routers act independently and asynchronously ⇒ path dictated by order of event arrival A 1 C E 10 1 1 1 B D Time LSP Src Dst Demand 1 A E 7 2 B Link Metric Capacity A-C 1 10 B-C C-E C-D D-E VS Time LSP Src Dst Demand 1 2 B E 7 A

27 Predictability VS 1 10 1 1 1 A C E B D Time LSP Src Dst Demand 1 A E 7
2 B Link Metric Capacity A-C 1 10 B-C C-E C-D D-E VS Time LSP Src Dst Demand 1 2 B E 7 A

28 Predictability VS 1 10 1 1 1 A C E B D Time LSP Src Dst Demand 1 A E 7
2 B Link Metric Capacity A-C 1 10 B-C C-E C-D D-E VS Time LSP Src Dst Demand 1 2 B E 7 A

29 Predictability VS 1 10 1 1 1 A C E B D Time LSP Src Dst Demand 1 A E 7
2 B Link Metric Capacity A-C 1 10 B-C C-E C-D D-E VS Time LSP Src Dst Demand 1 2 B E 7 A

30 Predictability VS 1 10 1 1 1 A C E B D Time LSP Src Dst Demand 1 A E 7
2 B Link Metric Capacity A-C 1 10 B-C C-E C-D D-E VS Time LSP Src Dst Demand 1 2 B E 7 A

31 Topics MPLS TE Today Example Use Cases Stateful PCE Protocol Proposal

32 Stateful PCE (RSVP-TE focused)
Will Provide: Efficient Distribution of LSP State to PCE Global view of network state to LSP level minimal distributed state - much more efficient than distributed protocol for accomplishing similar objectives Delegation of control of LSP attributes to PCE Support for redundant PCEs with near-hitless failover PCE driven control of LSP attribute changes Ordering and synchronization of attribute changes across devices opens the way for all manner of optimization

33 Stateful PCE for RSVP-TE Objectives
Allow a single PCC to interact with a mix of stateless and stateful PCEs simultaneously using the same PCEP. Allow stateful PCEs to learn about the state of LSPs in a PCC Allow a PCC to delegate control of its LSPs to an active stateful PCE on a per LSP basis. Allow PCE to control of computation timing and sequence across all LSPs that have been delegated to it. Enable uninterrupted operation of PCC's LSPs in the event PCE failure or while control of LSPs is being transferred between PCEs.

34 PCEP Modifications & Additions
Capability Additions Function Direction Message & RFC Updated Capability negotiation  E<->C PCRpt  [RFC5440] ISIS stateful capability advertisement E->C ISIS PCE-CAP-FLAGS sub-TLV [RFC 5089] OSPF stateful capability advertisement OSPF RI LSA, PCE TLV, PCE-CAP-FLAGS sub-TLV [RFC 5088] New Functions Function Direction PCEP Message LSP state synchronization C->E PCRpt LSP status request E->C PCSrq LSP status report/notification LSP control delegation C<->E LSP modification PCUpd

35 State Synchronization
Uses new Messages: PCRpt New object class and type: LSP-Object PCC PCE PCRpt, SyncDone=0 PCRpt, SyncDone=1 ...

36 LSP Delegation ... Uses new Messages: PCRpt, PCUpd
New object class and type: LSP-Object PCC PCE PCRpt, Delegate=1 PCUpd, Delegate=1 ... Empty PCUpd

37 PCE Driven Control ... Uses new Messages: PCRpt, PCUpd
New object class and type: LSP-Object PCC PCE PCRpt, Delegate=1 PCUpd, Delegate=1 ... Empty PCUpd

38 Next Steps Discussion... WG Adoption?


Download ppt "Traffic Engineering for the Modern MPLS Backbone"

Similar presentations


Ads by Google