Network Working Group S. Dasgupta Request for Comments: 5468 J. de Oliveira Category: Informational Drexel University JP. Vasseur Cisco Systems April 2009 Performance Analysis of Inter-Domain Path Computation Methodologies Status of This Memo This memo provides information for the Internet community. It does not specify an Internet standard of any kind. Distribution of this memo is unlimited. Copyright Notice Copyright (c) 2009 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents in effect on the date of publication of this document (http://trustee.ietf.org/license-info). Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Abstract This document presents a performance comparison between the per- domain path computation method and the Path Computation Element (PCE) Architecture-based Backward Recursive Path Computation (BRPC) procedure. Metrics to capture the significant performance aspects are identified, and detailed simulations are carried out on realistic scenarios. A performance analysis for each of the path computation methods is then undertaken. Dasgupta, et al. Informational [Page 1] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 Table of Contents 1. Introduction ....................................................2 2. Terminology .....................................................3 3. Evaluation Metrics ..............................................4 4. Simulation Setup ................................................5 5. Results and Analysis ............................................6 5.1. Path Cost ..................................................7 5.2. Crankback/Setup Delay ......................................7 5.3. Signaling Failures .........................................8 5.4. Failed TE-LSPs/Bandwidth on Link Failures ..................8 5.5. TE LSP/Bandwidth Setup Capacity ............................8 6. Security Considerations .........................................9 7. Acknowledgment ..................................................9 8. Informative References ..........................................9 1. Introduction The IETF has specified two approaches for the computation of inter- domain (Generalized) Multi-Protocol Label Switching ((G)MPLS) Traffic Engineering (TE) Label Switched Paths (LSP): the per-domain path computation approach defined in [RFC5152] and the PCE-based approach specified in [RFC4655]. More specifically, we study the PCE-based path computation model that makes use of the BRPC method outlined in [RFC5441]. In the rest of this document, we will call PD and PCE the per-domain path computation approach and the PCE path computation approach, respectively. In the per-domain path computation approach, each path segment within a domain is computed during the signaling process by each entry node of the domain up to the next-hop exit node of that same domain. In contrast, the PCE-based approach and, in particular, the BRPC method defined in [RFC5441] rely on the collaboration between a set of PCEs to find to shortest inter-domain path after the computation of which the corresponding TE LSP is signaled: path computation is undertaken using multiple PCEs in a backward recursive fashion from the destination domain to the source domain. The notion of a Virtual Shortest Path Tree (VSPT) is introduced. Each link of a VSPT represents the shortest path satisfying the set of required constraints between the border nodes of a domain and the destination LSR. The VSPT of each domain is returned by the corresponding PCE to create a new VSPT by PCEs present in other domains. [RFC5441] discusses the BRPC procedure in complete detail. Dasgupta, et al. Informational [Page 2] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 This document presents some simulation results and analysis to compare the performance of the above two inter-domain path computation approaches. Two realistic topologies with accompanying traffic matrices are used to undertake the simulations. Note that although the simulation results discussed in this document have used inter-area networks, they also apply to Inter-AS cases. Disclaimer: although simulations have been made on different and realistic topologies showing consistent results, the metrics shown below may vary with the network topology. Note that this document refers to multiple figures that are only available in the PDF version. 2. Terminology Terminology used in this document: TE LSP: Traffic Engineered Label Switched Path. CSPF: Constrained Shortest Path First. PCE: Path Computation Element. BRPC: Backward Recursive PCE-based Computation. AS: Autonomous System. ABR: Routers used to connect two IGP areas (areas in OSPF or levels in IS-IS). ASBR: Routers used to connect together ASes of a different or the same Service Provider via one or more Inter-AS links. Border LSR: A border LSR is either an ABR in the context of inter- area TE or an ASBR in the context of Inter-AS TE. VSPT: Virtual Shortest Path Tree. LSA: Link State Advertisement. LSR: Label Switching Router. IGP: Interior Gateway Protocol. TED: Traffic Engineering Database. Dasgupta, et al. Informational [Page 3] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 PD: Per-Domain 3. Evaluation Metrics This section discusses the metrics that are used to quantify and compare the performance of the two approaches. o Path Cost. The maximum and average path costs are observed for each TE LSP. The distributions for the maximum and average path costs are then compared for the two path computation approaches. o Signaling Failures. Signaling failures may occur in various circumstances. With PD, the head-end LSR chooses the downstream border router (ABR, ASBR) according to some selection criteria (IGP shortest path, ....) based on the information in its TED. This ABR then selects the next ABR using its TED, continuing the process till the destination is reached. At each step, the TED information could be out of date, potentially resulting in a signaling failure during setup. In the BRPC procedure, the PCEs are the ABRs that cooperate to form the VSPT based on the information in their respective TEDs. As in the case of the PD approach, information in the TED could be out of date, potentially resulting in signaling failures during setup. Also, only with the PD approach, another situation that leads to a signaling failure is when the selected exit ABR does not have any path obeying the set of constraints toward a downstream exit node or the TE LSP destination. This situation does not occur with the BRPC. The signaling failure metric captures the total number of signaling failures that occur during initial setup and re-route (on link failure) of a TE LSP. The distribution of the number of signaling failures encountered for all TE LSPs is then compared for the PD and BRPC methods. o Crankback Signaling. In this document, we made the assumption that in the case of PD, when an entry border node fails to find a route in the corresponding domain, boundary re-routing crankback [RFC4920] signaling was used. A crankback signaling message propagates to the entry border node of the domain and a new exit border node is chosen. After this, path computation takes place to find a path segment to a new entry border node of the next domain. This causes an additional delay in setup time. This metric captures the distribution of the number of crankback signals and the corresponding delay in setup time for a TE LSP when using PD. The total delay arising from the crankback signaling is proportional to the costs of the links over which the signal travels, i.e., the path that is setup from the entry border node of a domain to its exit border node (the assumption was made Dasgupta, et al. Informational [Page 4] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 that link metrics reflect propagation delays). Similar to the above metrics, the distribution of total crankback signaling and corresponding proportional delay across all TE LSPs is compared. o TE LSPs/Bandwidth Setup Capacity. Due to the different path computation techniques, there is a significant difference in the amount of TE LSPs/bandwidth that can be set up. This metric captures the difference in the number of TE LSPs and corresponding bandwidth that can be set up using the two path computation techniques. The traffic matrix is continuously scaled and stopped when the first TE LSP cannot be set up for both the methods. The difference in the scaling factor gives the extra bandwidth that can be set up using the corresponding path computation technique. o Failed TE LSPs/Bandwidth on Link Failure. Link failures are induced in the network during the course of the simulations conducted. This metric captures the number of TE LSPs and the corresponding bandwidth that failed to find a route when one or more links lying on its path failed. 4. Simulation Setup A very detailed simulator has been developed to replicate a real-life network scenario accurately. Following is the set of entities used in the simulation with a brief description of their behavior. +------------+-------+-------+--------+--------+---------+----------+ | Domain | # of | # of | OC48 | OC192 | [0,20) | [20,100] | | Name | nodes | links | links | links | Mbps | Mbps | +------------+-------+-------+--------+--------+---------+----------+ | D1 | 17 | 24 | 18 | 6 | 125 | 368 | | D2 | 14 | 17 | 12 | 5 | 76 | 186 | | D3 | 19 | 26 | 20 | 6 | 14 | 20 | | D4 | 9 | 12 | 9 | 3 | 7 | 18 | | MESH-CORE | 83 | 167 | 132 | 35 | 0 | 0 | | (backbone) | | | | | | | | SYM-CORE | 29 | 377 | 26 | 11 | 0 | 0 | | (backbone) | | | | | | | +------------+-------+-------+--------+--------+---------+----------+ Table 1. Domain Details and TE LSP Size Distribution o Topology Description. To obtain meaningful results applicable to present-day Service Provider topologies, simulations have been run on two representative topologies. They consists of a large backbone area to which four smaller areas are connected. For the first topology named MESH-CORE, a densely connected backbone was obtained from RocketFuel [ROCKETFUEL]. The second topology has a Dasgupta, et al. Informational [Page 5] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 symmetrical backbone and is called SYM-CORE. The four connected smaller areas are obtained from [DEF-DES]. Details of the topologies are shown in Table 1 along with their layout in Figure 1. All TE LSPs set up on this network have their source and destinations in different areas and all of them need to traverse the backbone network. Table 1 also shows the number of TE LSPs that have their sources in the corresponding areas along with their size distribution. o Node Behavior. Every node in the topology represents a router that maintains states for all the TE LSPs passing through it. Each node in a domain is a source for TE LSPs to all the other nodes in the other domains. As in a real-life scenario, where routers boot up at random points in time, the nodes in the topologies also start sending traffic on the TE LSPs originating from them at a random start time (to take into account the different boot-up times). All nodes are up within an hour of the start of simulation. All nodes maintain a TED that is updated using LSA updates as outlined in [RFC3630]. The flooding scope of the Traffic Engineering IGP updates are restricted only to the domain in which they originate in compliance with [RFC3630] and [RFC5305]. o TE LSP Setup. When a node boots up, it sets up all TE LSPs that originate from it in descending order of size. The network is dimensioned such that all TE LSPs can find a path. Once set up, all TE LSPs stay in the network for the complete duration of the simulation unless they fail due to a link failure. Even though the TE LSPs are set up in descending order of size from a head-end router, from the network perspective, TE LSPs are set up in random fashion as the routers boot up at random times. o Inducing Failures. For thorough performance analysis and comparison, link failures are induced in all the areas. Each link in a domain can fail independently with a mean failure time of 24 hours and be restored with a mean restore time of 15 minutes. Both inter-failure and inter-restore times are uniformly distributed. No attempt to re-optimize the path of a TE LSP is made when a link is restored. The links that join two domains never fail. This step has been taken to concentrate only on how link failures within domains affect the performance. 5. Results and Analysis Simulations were carried out on the two topologies previously described. The results are presented and discussed in this section. All figures are from the PDF version of this document. In the figures, "PD-Setup" and "PCE-Setup" represent results corresponding Dasgupta, et al. Informational [Page 6] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 to the initial setting up of TE LSPs on an empty network using the per-domain and the PCE approach, respectively. Similarly, "PD- Failure" and "PCE-Failure" denote the results under the link failure scenario. A period of one week was simulated and results were collected after the transient period. Figure 2 and Figure 3 illustrate the behavior of the metrics for topologies MESH-CORE and SYM-CORE, respectively. 5.1. Path Cost Figures 2a and 3a show the distribution of the average path cost of the TE LSPs for MESH-CORE and SYM-CORE, respectively. During the initial setup, roughly 40% of TE LSPs for MESH-CORE and 70% of TE LSPs for SYM-CORE have path costs greater with PD (PD-Setup) than with the PCE approach (PCE-Setup). This is due to the ability of the BRPC procedure to select the inter-domain shortest constrained paths that satisfy the constraints. Since the per-domain approach to path computation is undertaken in stages where every entry border router to a domain computes the path in the corresponding domain, the most optimal (shortest constrained inter-domain) route is not always found. When failures start to take place in the network, TE LSPs are re-routed over different paths resulting in path costs that are different from the initial costs. PD-Failure and PCE-Failure in Figures 2a and 3a show the distribution of the average path costs that the TE LSPs have over the duration of the simulation with link failures occurring. Similarly, the average path costs with the PD approach are much higher than the PCE approach when link failures occur. Figures 2b and 3b show similar trends and present the maximum path costs for a TE LSP for the two topologies, respectively. It can be seen that with per-domain path computation, the maximum path costs are larger for 30% and 100% of the TE LSPs for MESH-CORE and SYM- CORE, respectively. 5.2. Crankback/Setup Delay Due to crankbacks that take place in the per-domain approach of path computation, TE LSP setup time is significantly increased. This could lead to Quality-of-Service (QoS) requirements not being met, especially during failures when re-routing needs to be quick in order to keep traffic disruption to a minimum (for example in the absence of local repair mechanisms such as defined in [RFC4090]). Since crankbacks do not take place during path computation with a PCE, setup delays are significantly reduced. Figures 2c and 3c show the distributions of the number of crankbacks that took place during the setup of the corresponding TE LSPs for MESH-CORE and SYM-CORE, respectively. It can be seen that all crankbacks occurred when failures were taking place in the networks. Figures 2d and 3d illustrate the "proportional" setup delays experienced by the TE LSPs Dasgupta, et al. Informational [Page 7] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 due to crankbacks for the two topologies. It can be observed that for a large proportion of the TE LSPs, the setup delays arising out of crankbacks are very large, possibly proving to be very detrimental to QoS requirements. The large delays arise out of the crankback signaling that needs to propagate back and forth from the exit border router of a domain to its entry border router. More crankbacks occur for SYM-CORE as compared to MESH-CORE as it is a very "restricted" and "constrained" network in terms of connectivity. This causes a lack of routes and often several cycles of crankback signaling are required to find a constrained path. 5.3. Signaling Failures As discussed in the previous sections, signaling failures occur either due to an outdated TED or when a path cannot be found from the selected entry border router. Figures 2e and 3e show the distribution of the total number of signaling failures experienced by the TE LSPs during setup. About 38% and 55% of TE LSPs for MESH-CORE and SYM-CORE, respectively, experience a signaling failures with per- domain path computation when link failures take place in the network. In contrast, only about 3% of the TE LSPs experience signaling failures with the PCE method. It should be noted that the signaling failures experienced with the PCE correspond only to the TEDs being out of date. 5.4. Failed TE-LSPs/Bandwidth on Link Failures Figures 2f and 3f show the number of TE LSPs and the associated required bandwidth that fail to find a route when link failures are taking place in the topologies. For MESH-CORE, with the per-domain approach, 395 TE LSPs failed to find a path corresponding to 1612 Mbps of bandwidth. For PCE, this number is lesser at 374 corresponding to 1546 Mbps of bandwidth. For SYM-CORE, with the per- domain approach, 434 TE LSPs fail to find a route corresponding to 1893 Mbps of bandwidth. With the PCE approach, only 192 TE LSPs fail to find a route, corresponding to 895 Mbps of bandwidth. It is clearly visible that the PCE allows more TE LSPs to find a route thus leading to better performance during link failures. 5.5. TE LSP/Bandwidth Setup Capacity Since PCE and the per-domain path computation approach differ in how path computation takes place, more bandwidth can be set up with PCE. This is primarily due to the way in which BRPC functions. To observe the extra bandwidth that can fit into the network, the traffic matrix was scaled. Scaling was stopped when the first TE LSP failed to set up with PCE. This metric, like all the others discussed above, is topology dependent (therefore, the choice of two topologies for this Dasgupta, et al. Informational [Page 8] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 study). This metric highlights the ability of PCE to fit more bandwidth in the network. For MESH-CORE, on scaling, 1556 Mbps more could be set up with PCE. In comparison, for SYM-CORE, this value is 986 Mbps. The amount of extra bandwidth that can be set up on SYM- CORE is lesser due to its restricted nature and limited capacity. 6. Security Considerations This document does not raise any security issues. 7. Acknowledgment The authors would like to acknowledge Dimitri Papadimitriou for his helpful comments to clarify the text. 8. Informative References [DEF-DES] J. Guichard, F. Le Faucheur, and J.-P. Vasseur, "Definitive MPLS Network Designs", Cisco Press, 2005. [RFC5152] Vasseur, JP., Ed., Ayyangar, A., Ed., and R. Zhang, "A Per-Domain Path Computation Method for Establishing Inter-Domain Traffic Engineering (TE) Label Switched Paths (LSPs)", RFC 5152, February 2008. [RFC5441] Vasseur, JP., Zhang, R., Bitar, N., and JL. Le Roux, "A Backward Recursive PCE-Based Computation (BRPC) Procedure to Compute Shortest Constrained Inter-Domain Traffic Engineering Label Switched Paths", RFC 5441, April 2009. [RFC3630] Katz, D., Kompella, K., and D. Yeung, "Traffic Engineering (TE) Extensions to OSPF Version 2", RFC 3630, September 2003. [RFC5305] Li, T. and H. Smit, "IS-IS Extensions for Traffic Engineering", RFC 5305, October 2008. [RFC4090] Pan, P., Ed., Swallow, G., Ed., and A. Atlas, Ed., "Fast Reroute Extensions to RSVP-TE for LSP Tunnels", RFC 4090, May 2005. [RFC4655] Farrel, A., Vasseur, J.-P., and J. Ash, "A Path Computation Element (PCE)-Based Architecture", RFC 4655, August 2006. Dasgupta, et al. Informational [Page 9] RFC 5468 Analysis of Inter-Domain Path Computation April 2009 [RFC4920] Farrel, A., Ed., Satyanarayana, A., Iwata, A., Fujita, N., and G. Ash, "Crankback Signaling Extensions for MPLS and GMPLS RSVP-TE", RFC 4920, July 2007. [ROCKETFUEL] N. Spring, R. Mahajan, and D. Wehterall, "Measuring ISP Topologies with Rocketfuel", Proceedings of ACM SIGCOMM, 2002. Authors' Addresses Sukrit Dasgupta Drexel University Dept of ECE, 3141 Chestnut Street Philadelphia, PA 19104 USA Phone: 215-895-1862 EMail: sukrit@ece.drexel.edu URI: www.pages.drexel.edu/~sd88 Jaudelice C. de Oliveira Drexel University Dept. of ECE, 3141 Chestnut Street Philadelphia, PA 19104 USA Phone: 215-895-2248 EMail: jau@ece.drexel.edu URI: www.ece.drexel.edu/faculty/deoliveira JP Vasseur Cisco Systems 1414 Massachussetts Avenue Boxborough, MA 01719 USA EMail: jpv@cisco.com Dasgupta, et al. Informational [Page 10]