DRES: Dynamic Range Encoding Scheme for TCAM Coprocessors 2008 YU-ANTL Lab Seminar June 11, 2008 JeongKi Park Advanced Networking Technology Lab. (YU-ANTL)

Slides:



Advertisements
Similar presentations
August 17, 2000 Hot Interconnects 8 Devavrat Shah and Pankaj Gupta
Advertisements

Fast Updating Algorithms for TCAMs Devavrat Shah Pankaj Gupta IEEE MICRO, Jan.-Feb
A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
An On-Chip IP Address Lookup Algorithm Author: Xuehong Sun and Yiqiang Q. Zhao Publisher: IEEE TRANSACTIONS ON COMPUTERS, 2005 Presenter: Yu Hao, Tseng.
Fast Firewall Implementation for Software and Hardware-based Routers Lili Qiu, Microsoft Research George Varghese, UCSD Subhash Suri, UCSB 9 th International.
Bio Michel Hanna M.S. in E.E., Cairo University, Egypt B.S. in E.E., Cairo University at Fayoum, Egypt Currently is a Ph.D. Student in Computer Engineering.
Outline Introduction Related work on packet classification Grouper Performance Empirical Evaluation Conclusions.
A Ternary Unification Framework for Optimizing TCAM-Based Packet Classification Systems Author: Eric Norige, Alex X. Liu, and Eric Torng Publisher: ANCS.
ClassBench: A Packet Classification Benchmark
1 MIPS Extension for a TCAM Based Parallel Architecture for Fast IP Lookup Author: Oğuzhan ERDEM Cüneyt F. BAZLAMAÇCI Publisher: ISCIS 2009 Presenter:
Efficient Multi-match Packet Classification with TCAM Fang Yu Randy H. Katz EECS Department, UC Berkeley {fyu,
Fast Filter Updates for Packet Classification using TCAM Authors: Haoyu Song, Jonathan Turner. Publisher: GLOBECOM 2006, IEEE Present: Chen-Yu Lin Date:
Power Efficient IP Lookup with Supernode Caching Lu Peng, Wencheng Lu*, and Lide Duan Dept. of Electrical & Computer Engineering Louisiana State University.
1 Memory-Efficient 5D Packet Classification At 40 Gbps Authors: Ioannis Papaefstathiou, and Vassilis Papaefstathiou Publisher: IEEE INFOCOM 2007 Presenter:
1 Energy Efficient Multi-match Packet Classification with TCAM Fang Yu
Improved TCAM-based Pre-Filtering for Network Intrusion Detection Systems Department of Computer Science and Information Engineering National Cheng Kung.
Efficient Multidimensional Packet Classification with Fast Updates Author: Yeim-Kuan Chang Publisher: IEEE TRANSACTIONS ON COMPUTERS, VOL. 58, NO. 4, APRIL.
1 Range Encoding Cheng-Chien Su. 2 Outline DRES: Dynamic Range Encoding Scheme for TCAM Coprocessors  Hao Che, Zhijun Wang, Kai Zheng, Bin Liu  IEEE.
Efficient Multi-Match Packet Classification with TCAM Fang Yu
1 DRES:Dynamic Range Encoding Scheme for TCAM Coprocessors Authors: Hao Che, Zhijun Wang, Kai Zheng and Bin Liu Publisher: IEEE Transactions on Computers,
1 Energy Efficient Packet Classification Hardware Accelerator Alan Kennedy, Xiaojun Wang HDL Lab, School of Electronic Engineering, Dublin City University.
1 Performance Improvement of Two-Dimensional Packet Classification by Filter Rephrasing Department of Computer Science and Information Engineering National.
1 A Fast IP Lookup Scheme for Longest-Matching Prefix Authors: Lih-Chyau Wuu, Shou-Yu Pin Reporter: Chen-Nien Tsai.
Two stage packet classification using most specific filter matching and transport level sharing Authors: M.E. Kounavis *,A. Kumar,R. Yavatkar,H. Vin Presenter:
SSA: A Power and Memory Efficient Scheme to Multi-Match Packet Classification Fang Yu 1 T. V. Lakshman 2 Martin Austin Motoyama 1 Randy H. Katz 1 1 EECS.
An Efficient IP Lookup Architecture with Fast Update Using Single-Match TCAMs Author: Jinsoo Kim, Junghwan Kim Publisher: WWIC 2008 Presenter: Chen-Yu.
Algorithms for Advanced Packet Classification with TCAMs Karthik Lakshminarayanan UC Berkeley Joint work with Anand Rangarajan and Srinivasan Venkatachary.
Worst-Case TCAM Rule Expansion Ori Rottenstreich (Technion, Israel) Joint work with Isaac Keslassy (Technion, Israel)
Fast binary and multiway prefix searches for pachet forwarding Author: Yeim-Kuan Chang Publisher: COMPUTER NETWORKS, Volume 51, Issue 3, pp , February.
1 Efficient packet classification using TCAMs Authors: Derek Pao, Yiu Keung Li and Peng Zhou Publisher: Computer Networks 2006 Present: Chen-Yu Lin Date:
Sarang Dharmapurikar With contributions from : Praveen Krishnamurthy,
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
CoPTUA: Consistent Policy Table Update Algorithm for TCAM without Locking Zhijun Wang, Hao Che, Mohan Kumar, Senior Member, IEEE, and Sajal K. Das.
Layered Interval Codes for TCAM-based Classification David Hay, Politecnico di Torino Joint work with Anat Bremler-Barr (IDC), Danny Hendler (BGU) and.
Timothy Whelan Supervisor: Mr Barry Irwin Security and Networks Research Group Department of Computer Science Rhodes University Hardware based packet filtering.
Author: Haoyu Song, Fang Hao, Murali Kodialam, T.V. Lakshman Publisher: IEEE INFOCOM 2009 Presenter: Chin-Chung Pan Date: 2009/12/09.
GLOBECOM (Global Communications Conference), 2012
Vladimír Smotlacha CESNET Full Packet Monitoring Sensors: Hardware and Software Challenges.
Wire Speed Packet Classification Without TCAMs ACM SIGMETRICS 2007 Qunfeng Dong (University of Wisconsin-Madison) Suman Banerjee (University of Wisconsin-Madison)
Fast Packet Classification Using Bloom filters Authors: Sarang Dharmapurikar, Haoyu Song, Jonathan Turner, and John Lockwood Publisher: ANCS 2006 Present:
Firewall Fingerprinting Amir R. Khakpour 1, Joshua W. Hulst 1, Zhihui Ge 2, Alex X. Liu 1, Dan Pei 2, Jia Wang 2 1 Michigan State University 2 AT&T Labs.
Packet Classifiers In Ternary CAMs Can Be Smaller Qunfeng Dong (University of Wisconsin-Madison) Suman Banerjee (University of Wisconsin-Madison) Jia Wang.
Multi-Field Range Encoding for Packet Classification in TCAM Author: Yeim-Kuan Chang, Chun-I Lee and Cheng-Chien Su Publisher: INFOCOM 2011 Presenter:
Applied Research Laboratory Edward W. Spitznagel 24 October Packet Classification using Extended TCAMs Edward W. Spitznagel, Jonathan S. Turner,
1. Outline Introduction Related work on packet classification Grouper Performance Analysis Empirical Evaluation Conclusions 2/42.
StrideBV: Single chip 400G+ packet classification Author: Thilan Ganegedara, Viktor K. Prasanna Publisher: HPSR 2012 Presenter: Chun-Sheng Hsueh Date:
1 Fast packet classification for two-dimensional conflict-free filters Department of Computer Science and Information Engineering National Cheng Kung University,
Efficient Cache Structures of IP Routers to Provide Policy-Based Services Graduate School of Engineering Osaka City University
IPv6-Oriented 4 OC768 Packet Classification with Deriving-Merging Partition and Field- Variable Encoding Scheme Mr. Xin Zhang Undergrad. in Tsinghua University,
A Smart Pre-Classifier to Reduce Power Consumption of TCAMs for Multi-dimensional Packet Classification Yadi Ma, Suman Banerjee University of Wisconsin-Madison.
CS 740: Advanced Computer Networks IP Lookup and classification Supplemental material 02/05/2007.
Range Enhanced Packet Classification Design on FPGA Author: Yeim-Kuan Chang, Chun-sheng Hsueh Publisher: IEEE Transactions on Emerging Topics in Computing.
Parallel tree search: An algorithmic approach for multi- field packet classification Authors: Derek Pao and Cutson Liu. Publisher: Computer communications.
1 Bit Weaving: A Non-Prefix Approach to Compressing Packet Classifiers in TCAMs Author: Chad R. Meiners, Alex X. Liu, and Eric Torng Publisher: IEEE/ACM.
Packet Classification Using Multidimensional Cutting Sumeet Singh (UCSD) Florin Baboescu (UCSD) George Varghese (UCSD) Jia Wang (AT&T Labs-Research) Reviewed.
IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo a, Jose G. Delgado-Frias Publisher: Journal of Systems.
1 Space-Efficient TCAM-based Classification Using Gray Coding Authors: Anat Bremler-Barr and Danny Hendler Publisher: IEEE INFOCOM 2007 Present: Chen-Yu.
1 IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo, Jose G. Delgado-Frias Publisher: Journal of Systems.
Hierarchical packet classification using a Bloom filter and rule-priority tries Source : Computer Communications Authors : A. G. Alagu Priya 、 Hyesook.
1 DESIGN AND EVALUATION OF A PIPELINED FORWARDING ENGINE Department of Computer Science and Information Engineering National Cheng Kung University, Taiwan.
Packet Classification Using Multi- Iteration RFC Author: Chun-Hui Tsai, Hung-Mao Chu, Pi-Chung Wang Publisher: 2013 IEEE 37th Annual Computer Software.
Design for a generic knowledge base for autonomic QoE optimization in multimedia access networks September 9, 2008 Bong-Kyun Lee Dept. of Information and.
IP Address Lookup Masoud Sabaei Assistant professor Computer Engineering and Information Technology Department, Amirkabir University of Technology.
Scalable Multi-match Packet Classification Using TCAM and SRAM Author: Yu-Chieh Cheng, Pi-Chung Wang Publisher: IEEE Transactions on Computers (2015) Presenter:
IP Routers – internal view
Packet Classification Using Coarse-Grained Tuple Spaces
Scalable Multi-Match Packet Classification Using TCAM and SRAM
Worst-Case TCAM Rule Expansion
Authors: Ding-Yuan Lee, Ching-Che Wang, An-Yeu Wu Publisher: 2019 VLSI
Packet Classification Using Binary Content Addressable Memory
Presentation transcript:

DRES: Dynamic Range Encoding Scheme for TCAM Coprocessors 2008 YU-ANTL Lab Seminar June 11, 2008 JeongKi Park Advanced Networking Technology Lab. (YU-ANTL) Dept. of Information & Comm. Eng, Graduate School, Yeungnam University, KOREA (Tel : ; Fax : Hao Che, Zhijun Wang, Kai Zheng, Bin Liu IEEE TRANSACTIONS ON COMPUTERS, VOL52, NO.7 JULY 2008

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 2 Outline  Introduction  Rule Implementation in TCAM  Range Encoding Structures of Encoded Rule and Encoded Search Key TCAM-Based Search Key Encoding Process Dynamic Range Selection Algorithm Code Vector and Index Vector Encoding Algorithms  Encoded Range Update Process Encoding a Newly Selected Range & Releasing Encoded Ranges  Performance Evaluation Based on Real-world Databases  Analytical Performance Evaluation  Conclusion and Future Work

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 3 Introduction (1)  Fast packet classification algorithms for Longest Prefix Matching and general policy/firewall filtering Most approaches cannot provide lookup performance matching multigigabit line rate  TCAM coprocessor works as a look aside processor for packet classification NPU generates a search key TCAM finds a matched rule in a small constant number of clock cycle  Range matching low storage efficiency Need to do a range preprocessing/encoding (bitmapping)  Rule encoding  Search key encoding  Range selection algorithm  Database update algorithm

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 4 Introduction (2)  Range encoding scheme Top-down approach  Is designed the most efficient rule encoding  Subjects to hardware constraints of the existing TCAM-based packet classification solutions Bottom-up approach  May not offer the highest encoding gain  But it will work with only software upgrades  Dynamic Range Encoding Scheme Bottom-up approach with improving the TCAM storage efficiency Comprehensive solution which include all four algorithms Encoded rule structure allows any subset of ranges in any subset of rule fields to be encoded General range encoding framework that allows any existing range encoding algorithms to be incorporated for range encoding

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 5 Rule Implementation in TCAM  TCAM coprocessor has a TCAM and an associated memory (SRAM) TCAM is organized in slots (64,72,128,192,256 bits) A bit in each slot can take one of three values: 0, 1, *(don’t care)  When packet comes, the NPU generates a search key based on the packet header information and passes it to the TCAM coprocessor  The rule PF typically use five-tuple 104 bit(for IPv4)

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 6 Rule Implementation in TCAM  Example,

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 7 Structures of Encoded Rule and Encoded Search Key  Hybrid encoding approach Retains all of the fields in a rule and appends a sequence of code bits of length c t, called the code vector or index vector, to the rule to form an encoded rule Use some free bits for vectors

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 8 TCAM-Based Search Key Encoding Process  The search key encoding process Assume that m k ranges from the kth rule filed in a rule database are selected for encoding. Then, K search key fields matching against the corresponding K range tables must be done to generate an index vector and, hence, an encoded search key Must be performed at wire speed by the NPU  In DRES, it uses the TCAM coprocessor itself for sequential search key encoding One encoded rule table K range tables Process  The NPU updates the index vector by performing an OR operation between the index vector and the returned intermediate index vector  The encoded search key is formed by appending the final index vector to the original search key  This encoded search key is used to match against the encoded rule table

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 9 TCAM-Based Search Key Encoding Process  This scheme may have heavy use of the TCAM coprocessor It allows 4.28 TCAM lookups per packet  TCAM 133 MHz (133 million lookups per second)  31.3 million packets at a 10 Gbps line rate in the worst case For 2.5Gbps line rate, each packet is allowed to perform 17 lookups Encoding more fields leads to heavier TCAM access contention but better TCAM storage efficiency The ability of DRES to allow any subset of rule fields to be selected for encoding makes it possible for DRES to fully exploit this trade-off

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 10 Dynamic Range Selection Algorithm S i is the number of subranges needed to exactly implement range R i in a TCAM E i is the number of rule entries to implement all of the rules that contain range Ri G i is the encoding gain for R i, defined as the number of rule entries that can be eliminated if R i is encoded {256 – 512} {768 – 2047} {6000 – 6064} { >1023} {512 – 1536} { >1023} {256 – 512}

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 11 Code Vector and Index Vector Encoding Algorithms  P 2 C algorithm is most efficient N ranges to be encoded by using log 2 (N+1)[best cast] ~ N[worse case] bits But it can be shown that the dynamic range selection problem can be translated in polynomial time to a weighted knapsack problem, which is NP-complete  Bit-map intersection algorithm for DRES For example, L3 has R6 in its source port and R2 in its destination port  *1***1***

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 12 Encoded Range Update Process  New rules may be added and old rule may be deleted from time to time  This paper propose a lock-free encoded range update algorithm Allow the encoded range update and the search key encoding/PF table lookup processes to occur simultaneously without impacting the lookup performance  Updating a TCAM database without TCAM locking may generate An erroneous lookup may occur if a TCAM rule gets a match while the rule or its corresponding action is partially updated An inconsistent lookup means that a search key does not match the best matched rule. An inconsistent lookup may occur when a match takes place in the middle of a database update process and there is no guarantee of table consistency until the process finishes  Write process using a valid bit field in each TCAM slot 1) inactivate the rule 2) write the rule/action 3) activate the rule again  Move process that operation to move a rule-action pair to a new TCAM-associated memory location 1) using a write process to write the pair to the new location 2) inactivating the rule at the old location

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 13 Encoding a Newly Selected Range & Releasing Encoded Ranges

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 14 Performance Evaluation Based on Real- world Databases  Four real-world PF databases  Storage expansion ratio = the # of rule entries / the # of rules in a TCAM  FSP(the range frequency in the source port)  FDP(the range frequency in the destination port)  TF(the range frequency in the both port)  NSUB(the number of subranges)  DB1 and DB2 have RN1, RN4 in both source and destination ports  DB3 has RN1, RN4, RN6 in source port and RN1, RN7, RN8 in destination ports  DB4 has RN1, RN2, RN3 in both ports and RN10 in the destination ports

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 15 Performance Evaluation Based on Real- world Databases  Apply DRES to all four DB for a 64-bit slot TCAM Five-tuple rule : 104 bits Each rule entry take two slots : 128 bits 24 free bits which is much larger than 7, MAX # of unique ranges

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 16 Analytical Performance Evaluation  Use a probabilistic model to analyze the performance of DRES in a wide range of parameters

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 17 Analytical Performance Evaluation  The popularity distribution closely follows a Zipf-like distribution, with z=3 z is Zipf coefficient is the rank of the jth range is proportional to the popularity rank of range j c is a normalization factor

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 18 Impact of the Number of Unique Ranges

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 19 Conclusion  DRES is proposed to improve the TCAM storage efficiency in support of range matching Bottom-up approach Work its TCAM coprocessor with only a software upgrade TCAM storage expansion is reduced from 6.2 to as low as 1.23  DRES is its ability to have full control over the encoded rule size Exploit the TCAM structure for maximizing the encoding gains Adopts the bit-map intersection encoding scheme A range selection algorithm Uses a lock-free encoded range update algorithm

Advanced Networking Tech. Lab. Yeungnam University (YU-ANTL) YU-ANTL Lab. Seminar JeongKi Park 20 Reference [1] Hao Che, Zhijun Wang, Kai Zheng, Bin Liu, “DRES: Dynamic Range Encoding Scheme for TCAM Coprocessors,” IEEE Transactions On Computers, Vol.57, No.7, July 2008.