IPv6-Oriented 4 OC768 Packet Classification with Deriving-Merging Partition and Field- Variable Encoding Scheme Mr. Xin Zhang Undergrad. in Tsinghua University,

Slides:



Advertisements
Similar presentations
A Search Memory Substrate for High Throughput and Low Power Packet Processing Sangyeun Cho, Michel Hanna and Rami Melhem Dept. of Computer Science University.
Advertisements

August 17, 2000 Hot Interconnects 8 Devavrat Shah and Pankaj Gupta
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Packet Classification using Hierarchical Intelligent Cuttings
Multi-dimensional Packet Classification on FPGA: 100Gbps and Beyond
Balajee Vamanan, Gwendolyn Voskuilen, and T. N. Vijaykumar School of Electrical & Computer Engineering SIGCOMM 2010.
NetFPGA Project: 4-Port Layer 2/3 Switch Ankur Singla Gene Juknevicius
A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
1 An Efficient, Hardware-based Multi-Hash Scheme for High Speed IP Lookup Hot Interconnects 2008 Socrates Demetriades, Michel Hanna, Sangyeun Cho and Rami.
HybridCuts: A Scheme Combining Decomposition and Cutting for Packet Classification Author: Wenjun Li, Xianfeng Li Publisher: 2013 IEEE 21 st Annual Symposium.
Outline Introduction Related work on packet classification Grouper Performance Empirical Evaluation Conclusions.
A Ternary Unification Framework for Optimizing TCAM-Based Packet Classification Systems Author: Eric Norige, Alex X. Liu, and Eric Torng Publisher: ANCS.
1 TCAM Razor: A Systematic Approach Towards Minimizing Packet Classifiers in TCAMs Department of Computer Science and Information Engineering National.
ClassBench: A Packet Classification Benchmark
1 High-performance TCAM- based IP Lookup Engines Authors: Hui Yu, Jing Chenm Jianpian Wang and S.Q. Zheng Publisher: IEEE INFOCOM 2008 Present: 林呈俞 Date:
A Dynamic Binary Hash Scheme for IPv6 Lookup Q. Sun 1, X. Huang 1, X. Zhou 1, and Y. Ma 1,2 1. School of Computer Science and Technology 2. Beijing Key.
Fast Filter Updates for Packet Classification using TCAM Authors: Haoyu Song, Jonathan Turner. Publisher: GLOBECOM 2006, IEEE Present: Chen-Yu Lin Date:
Making Parallel Packet Switches Practical Sundar Iyer, Nick McKeown Departments of Electrical Engineering & Computer Science,
1 A Tree Based Router Search Engine Architecture With Single Port Memories Author: Baboescu, F.Baboescu, F. Tullsen, D.M. Rosu, G. Singh, S. Tullsen, D.M.Rosu,
Packet Classification on Multiple Fields Pankaj Gupta and Nick McKeown Stanford University {pankaj, September 2, 1999.
1 Energy Efficient Multi-match Packet Classification with TCAM Fang Yu
Efficient Multidimensional Packet Classification with Fast Updates Author: Yeim-Kuan Chang Publisher: IEEE TRANSACTIONS ON COMPUTERS, VOL. 58, NO. 4, APRIL.
1 Energy Efficient Packet Classification Hardware Accelerator Alan Kennedy, Xiaojun Wang HDL Lab, School of Electronic Engineering, Dublin City University.
SSA: A Power and Memory Efficient Scheme to Multi-Match Packet Classification Fang Yu 1 T. V. Lakshman 2 Martin Austin Motoyama 1 Randy H. Katz 1 1 EECS.
EaseCAM: An Energy And Storage Efficient TCAM-based IP-Lookup Architecture Rabi Mahapatra Texas A&M University;
Worst-Case TCAM Rule Expansion Ori Rottenstreich (Technion, Israel) Joint work with Isaac Keslassy (Technion, Israel)
1 Efficient packet classification using TCAMs Authors: Derek Pao, Yiu Keung Li and Peng Zhou Publisher: Computer Networks 2006 Present: Chen-Yu Lin Date:
1 Route Table Partitioning and Load Balancing for Parallel Searching with TCAMs Department of Computer Science and Information Engineering National Cheng.
Sarang Dharmapurikar With contributions from : Praveen Krishnamurthy,
Presented by Group 2: Presented by Group 2: Shan Gao ( ) Shan Gao ( ) Dayang Yu ( ) Dayang Yu ( ) Jiayu Zhou ( ) Jiayu Zhou.
Approximate Load Balance Based on ID/Locator Split Routing Architecture 1 Sanqi Zhou, Jia Chen, Hongbin Luo, Hongke Zhang Beijing JiaoTong University
PARALLEL TABLE LOOKUP FOR NEXT GENERATION INTERNET
Fast and deterministic hash table lookup using discriminative bloom filters  Author: Kun Huang, Gaogang Xie,  Publisher: 2013 ELSEVIER Journal of Network.
(TPDS) A Scalable and Modular Architecture for High-Performance Packet Classification Authors: Thilan Ganegedara, Weirong Jiang, and Viktor K. Prasanna.
Timothy Whelan Supervisor: Mr Barry Irwin Security and Networks Research Group Department of Computer Science Rhodes University Hardware based packet filtering.
Author: Haoyu Song, Fang Hao, Murali Kodialam, T.V. Lakshman Publisher: IEEE INFOCOM 2009 Presenter: Chin-Chung Pan Date: 2009/12/09.
On Energy-Efficient Trap Coverage in Wireless Sensor Networks Junkun Li, Jiming Chen, Shibo He, Tian He, Yu Gu, Youxian Sun Zhejiang University, China.
Wire Speed Packet Classification Without TCAMs ACM SIGMETRICS 2007 Qunfeng Dong (University of Wisconsin-Madison) Suman Banerjee (University of Wisconsin-Madison)
Packet Classification on Multiple Fields 참고 논문 : Pankaj Gupta and Nick McKeown SigComm 1999.
Firewall Fingerprinting Amir R. Khakpour 1, Joshua W. Hulst 1, Zhihui Ge 2, Alex X. Liu 1, Dan Pei 2, Jia Wang 2 1 Michigan State University 2 AT&T Labs.
Packet Classifiers In Ternary CAMs Can Be Smaller Qunfeng Dong (University of Wisconsin-Madison) Suman Banerjee (University of Wisconsin-Madison) Jia Wang.
Multi-Field Range Encoding for Packet Classification in TCAM Author: Yeim-Kuan Chang, Chun-I Lee and Cheng-Chien Su Publisher: INFOCOM 2011 Presenter:
1. Outline Introduction Related work on packet classification Grouper Performance Analysis Empirical Evaluation Conclusions 2/42.
StrideBV: Single chip 400G+ packet classification Author: Thilan Ganegedara, Viktor K. Prasanna Publisher: HPSR 2012 Presenter: Chun-Sheng Hsueh Date:
1 Fast packet classification for two-dimensional conflict-free filters Department of Computer Science and Information Engineering National Cheng Kung University,
SCALABLE PACKET CLASSIFICATION USING INTERPRETING A CROSS-PLATFORM MULTI-CORE SOLUTION Author: Haipeng Cheng, Zheng Chen, Bei Hua and Xinan Tang Publisher/Conf.:
Efficient Cache Structures of IP Routers to Provide Policy-Based Services Graduate School of Engineering Osaka City University
A Smart Pre-Classifier to Reduce Power Consumption of TCAMs for Multi-dimensional Packet Classification Yadi Ma, Suman Banerjee University of Wisconsin-Madison.
High-Speed Policy-Based Packet Forwarding Using Efficient Multi-dimensional Range Matching Lakshman and Stiliadis ACM SIGCOMM 98.
Bit Weaving: A Non-Prefix Approach to Compressing Packet Classifiers in TCAMs Author: Chad R. Meiners, Alex X. Liu, and Eric Torng Publisher: 2012 IEEE/ACM.
Cross-Product Packet Classification in GNIFS based on Non-overlapping Areas and Equivalence Class Author: Mohua Zhang, Ge Li Publisher: AISS 2012 Presenter:
CS 740: Advanced Computer Networks IP Lookup and classification Supplemental material 02/05/2007.
Tracking Millions of Flows In High Speed Networks for Application Identification Tian Pan, Xiaoyu Guo, Chenhui Zhang, Junchen Jiang, Hao Wu and Bin Liut.
Lightweight Traffic-Aware Packet Classification for Continuous Operation Author: Shariful Hasan Shaikot, Min Sik Kim Presenter: Yen-Chun Tseng Date: 2014/11/26.
Parallel tree search: An algorithmic approach for multi- field packet classification Authors: Derek Pao and Cutson Liu. Publisher: Computer communications.
Packet Classification Using Multidimensional Cutting Sumeet Singh (UCSD) Florin Baboescu (UCSD) George Varghese (UCSD) Jia Wang (AT&T Labs-Research) Reviewed.
On-Chip Logic Minimization Roman Lysecky & Frank Vahid* Department of Computer Science and Engineering University of California, Riverside *Also with the.
IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo a, Jose G. Delgado-Frias Publisher: Journal of Systems.
1 Space-Efficient TCAM-based Classification Using Gray Coding Authors: Anat Bremler-Barr and Danny Hendler Publisher: IEEE INFOCOM 2007 Present: Chen-Yu.
1 IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo, Jose G. Delgado-Frias Publisher: Journal of Systems.
Stochastic Pre-Classification for SDN Data Plane Matching Author : Luke McHale, C. Jasson Casey, Paul V. Gratz, Alex Sprintson Conference: 2014 IEEE 22nd.
DRES: Dynamic Range Encoding Scheme for TCAM Coprocessors 2008 YU-ANTL Lab Seminar June 11, 2008 JeongKi Park Advanced Networking Technology Lab. (YU-ANTL)
IP Address Lookup Masoud Sabaei Assistant professor Computer Engineering and Information Technology Department, Amirkabir University of Technology.
Ethernet Packet Filtering – Part 2 Øyvind Holmeide 10/28/2014 by.
Toward Advocacy-Free Evaluation of Packet Classification Algorithms
Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights Feng Zhang †⋄, Jidong Zhai ⋄, Xipeng Shen #, Onur Mutlu ⋆, Wenguang.
Packet Classification Using Coarse-Grained Tuple Spaces
Worst-Case TCAM Rule Expansion
Authors: Ding-Yuan Lee, Ching-Che Wang, An-Yeu Wu Publisher: 2019 VLSI
Hazem Hamed, Adel El-Atawy, Ehab Al-Shaer
Presentation transcript:

IPv6-Oriented 4 OC768 Packet Classification with Deriving-Merging Partition and Field- Variable Encoding Scheme Mr. Xin Zhang Undergrad. in Tsinghua University, Beijing, P. R. China Presentation at IEEE INFOCOM’06 Apr. 26, 2006, Barcelona, Spain X. Zhang, B. Liu, W. Li, Y. Xi, D. Bermingham, X. Wang

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 2/20 Outline V. Conclusion IV. Performance Analysis III. Detailed Ideas & Solutions II. Related Works & Motivation I. Brief Background

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 3/20 Sour. IP (SIP) Dest. IP (DIP)Sour. Port (SP)Dest. Port (DP)Prot. I. Brief Background A Typical IPv6 5-tuple Rule TCAM and Range Encoding Prefix format bits Range format bits Exact value 8 bits TCAM Partition Y Partition B Partition G Coded Ranges Encoding Table Search Key Codes

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 4/20 II. Related Works & Motivation Parallel distributed schemes fail to provide excellent balancing ( K. Zheng, infocom’04, ’05) Encoding speed becomes the bottleneck for even higher throughput (J.V. L-P 2 C, D. Pao-PIC, C. Hao-DRES) When scalable to 4*OC-768 or IPv6, previous schemes call for prohibitive cost Our Goals?

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 5/20 Robust- ness Low Storage IPv6 Oriented 4*OC-768 long prefix unknown features Maximum Parallelism High Utilization Load Balancing 4*OC-768 Goals Minimum Parallelism II. Related Works & Motivation

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 6/20 Fast, robust economical III. 3-level load balancing High speed low cost IPv6 Long Prefix Save storage Adaptive algorithms Practical I. IPv6 5-tuple rule encoding II. 3-plane parallel encoding Encoding Performance Speed & Reliability Unknown Features Overview of Solutions II. Related Works & Motivation

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 7/ tuple IPv6 Rule Encoding For SIP & DIP IPv6 Addresses MulticastUnicastAnycast Can be directly removed Encode the first 16 bits to 8 bits III. Detailed Ideas & Solutions For Protocol Field Prot. 8 bitsProt. 4 bits

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 8/20 Layer #1 [1-1023] (LO)[ ] (HI) Layer #2 EM Layer #3 AR Layer #n … AR well known (LO), ephemeral user (HI), arbitrary range (AR) and exact match (EM) port have different properties Pre-defined Layers tuple IPv6 Rule Encoding For SP & DP III. Detailed Ideas & Solutions To prevent from running out of bits at certain layer resulting in the increasing update complexity Pre-defined bits SP (12): 1/6/2  1/6/2/2/1 DP (24): 1/9/6/2  1/9/6/2/2/2/2

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 9/20 III. Detailed Ideas & Solutions Principles of Encoding High Speed Low Costs Field-Variable Processes moderate parallelism On-chip RAM Plane Parallel Encoding SIP + DIP + SP + DP + Prot

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 10/ Plane Parallel Encoding 5-tuple rule S&DIPSPDP Plane #1 HI/LO EM SP or DP AR HI/LO EMAR Plane #2 III. Detailed Ideas & Solutions Prot. Plane #1 and #2 still fail to match 4*OC 768

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 11/ Plane Parallel Encoding 5-tuple rule S&DIPSP-EM Plane #1 SP-ARDP-EMDP-AR Plane #2 S&DIP SP-EM SP-AR DP-EM SP-AR DP-AR Plane #3: Field-Variable Parallelism III. Detailed Ideas & Solutions S&DIPSPDP Plane #3? Prot. Inefficient and costly! Note that different fields have different processing speeds

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 12/ Level Load Balancing III. Detailed Ideas & Solutions Problem Statement of Load Balancing Distributed Storage Complete Policy Table + 2 Parallel TCAMs Balanced TrafficBalanced StorageLow Redundancy packets

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 13/ Level Load Balancing Size Threshold Sub tables Deriving-Merging Policy Table Partition Remove “bad” bits through some heuristic standards III. Detailed Ideas & Solutions Deriving-Merging Adjustment (DMA) Preliminary Partition Bits (PPB)

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 14/ Level Load Balancing TCAM#1 No. of rules Sub tables TCAM#2 #1 busier than #2? Yes No Packet III. Detailed Ideas & Solutions Policy Table Partition Distribution among TCAMs Redundancy Based Dynamic Balancing

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 15/20 IV. Performance Analysis Implemental Results for 3-Plane Encoding SIP& DIP SPDP Protocol AREMAREM Original speed (RAM cycle) Parallel Num. of Encoding Units Parallel speed (RAM cycle) RAM Costs (Block) 8* M4K 56* M * M512 2* M512 Storage Req. (Kbit RAM)

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 16/20 Fig. 1 Sizes of sub tables after PPB For IPv4, the original 5-tuple rule is 104-bit long. The final number of candidate bits is decreased dramatically to 15. The redundancy, max group size, average group size are all smaller than those in similar research Experimental Results for Policy Table Partition Fig.2 Sizes of sub tables after DMA IV. Performance Analysis

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 17/20 Experimental Results for Policy Table Partition Rule Set#4 (264 rules) PID Case1 (rules) Case2 (rules) Case3 (rules) PID Case1 (rules) Case2 (rules) Case3 (rules) SumCase1: 852 rules; Case2: 264 rules; Case3: 510 rules IV. Performance Analysis

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 18/20 Experimental Results for Distribution Algorithms TCAMSub Group ID (Excluding GR) Num. of Rules Traffic Load Ratio # % # % # % # % IV. Performance Analysis

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 19/20 Throughput Storage Worst-Case Loss Probability & Processing Delay Classify 266Mpps with TCAM working at 133MSPS (double-data rate I/O), RAM at 266Mhz Employ 4 TCAM chips with 1.78 times of the original table size, and 125 Kbit on-chip RAMs compared to 8 TCAM chips in C. Hao-DRES and K.Zheng-infocom’05 The loss probability is well close to zero when buffer depth>=5, and delay is only 12T c (TCAM cycle) compared to 54T c in K. Zheng-infocom’05 Update TCAM can be easily updated with “CoPTUA”; inserting a new range does not bother existing codes in the pre-defined bits manner. compared to P 2 C IV. Performance Analysis

Mr. Xin Zhang, IEEE INFOCOM’06, Barcelona, Spain 20/20 V. Conclusion Achieved ultra high throughput matching 4*OC768 line rate with the TCAM storage 1.7 times the policy table Propose the Deriving-Merging Partition and 3- level balancing resulting in a guaranteed worst case performance We designed the 3-plane IPv6 rule encoding scheme matching 4*OC768 line rate with 125 Kbits on-chip RAM We proposed a set of adaptive algorithms to deal with different IPv6 policy table characteristics