Scalable Memory-Less Architecture for String Matching With FPGAs

Slides:



Advertisements
Similar presentations
A Search Memory Substrate for High Throughput and Low Power Packet Processing Sangyeun Cho, Michel Hanna and Rami Melhem Dept. of Computer Science University.
Advertisements

A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
1 An Efficient, Hardware-based Multi-Hash Scheme for High Speed IP Lookup Hot Interconnects 2008 Socrates Demetriades, Michel Hanna, Sangyeun Cho and Rami.
Bio Michel Hanna M.S. in E.E., Cairo University, Egypt B.S. in E.E., Cairo University at Fayoum, Egypt Currently is a Ph.D. Student in Computer Engineering.
Hybrid Data Structure for IP Lookup in Virtual Routers Using FPGAs Authors: Oĝuzhan Erdem, Hoang Le, Viktor K. Prasanna, Cüneyt F. Bazlamaçcı Publisher:
A Memory-Efficient Reconfigurable Aho-Corasick FSM Implementation for Intrusion Detection Systems Authors: Seongwook Youn and Dennis McLeod Presenter:
Low Power TCAM Forwarding Engine for IP Packets Authors: Alireza Mahini, Reza Berangi, Seyedeh Fatemeh and Hamidreza Mahini Presenter: Yi-Sheng, Lin (
Pipelined Parallel AC-based Approach for Multi-String Matching Department of Computer Science and Information Engineering National Cheng Kung University,
1 Author: Ioannis Sourdis, Sri Harsha Katamaneni Publisher: IEEE ASAP,2011 Presenter: Jia-Wei Yo Date: 2011/11/16 Longest prefix Match and Updates in Range.
Power Efficient IP Lookup with Supernode Caching Lu Peng, Wencheng Lu*, and Lide Duan Dept. of Electrical & Computer Engineering Louisiana State University.
Scalable IPv6 Lookup/Update Design for High-Throughput Routers Authors: Chung-Ho Chen, Chao-Hsien Hsu, Chen -Chieh Wang Presenter: Yi-Sheng, Lin ( 林意勝.
An Efficient Hardware-based Multi-hash Scheme for High Speed IP Lookup Department of Computer Science and Information Engineering National Cheng Kung University,
Parallel IP Lookup using Multiple SRAM-based Pipelines Authors: Weirong Jiang and Viktor K. Prasanna Presenter: Yi-Sheng, Lin ( 林意勝 ) Date:
Parallel-Search Trie-based Scheme for Fast IP Lookup
Pipelined Architecture For Multi-String Match Department of Computer Science and Information Engineering National Cheng Kung University, Taiwan R.O.C.
Study of IP address lookup Schemes
An Efficient IP Lookup Architecture with Fast Update Using Single-Match TCAMs Author: Jinsoo Kim, Junghwan Kim Publisher: WWIC 2008 Presenter: Chen-Yu.
1 Performing packet content inspection by longest prefix matching technology Authors: Nen-Fu Huang, Yen-Ming Chu, Yen-Min Wu and Chia- Wen Ho Publisher:
1 Efficient packet classification using TCAMs Authors: Derek Pao, Yiu Keung Li and Peng Zhou Publisher: Computer Networks 2006 Present: Chen-Yu Lin Date:
1 Route Table Partitioning and Load Balancing for Parallel Searching with TCAMs Department of Computer Science and Information Engineering National Cheng.
Packet Classification Using Multi-Iteration RFC Author: Chun-Hui Tsai, Hung-Mao Chu, Pi-Chung Wang Publisher: COMPSACW, 2013 IEEE 37th Annual (Computer.
LayeredTrees: Most Specific Prefix based Pipelined Design for On-Chip IP Address Lookups Author: Yeim-Kuau Chang, Fang-Chen Kuo, Han-Jhen Guo and Cheng-Chien.
GLOBECOM (Global Communications Conference), 2012
A Regular Expression Matching Algorithm Using Transition Merging Department of Computer Science and Information Engineering National Cheng Kung University,
A Hybrid IP Lookup Architecture with Fast Updates Author : Layong Luo, Gaogang Xie, Yingke Xie, Laurent Mathy, Kavé Salamatian Conference: IEEE INFOCOM,
1 Optimization of Regular Expression Pattern Matching Circuits on FPGA Department of Computer Science and Information Engineering National Cheng Kung University,
IP Routing Processing with Graphic Processors Author: Shuai Mu, Xinya Zhang, Nairen Zhang, Jiaxin Lu, Yangdong Steve Deng, Shu Zhang Publisher: IEEE Conference.
Regular Expression Matching for Reconfigurable Packet Inspection Authors: Jo˜ao Bispo, Ioannis Sourdis, Jo˜ao M.P. Cardoso and Stamatis Vassiliadis Publisher:
StrideBV: Single chip 400G+ packet classification Author: Thilan Ganegedara, Viktor K. Prasanna Publisher: HPSR 2012 Presenter: Chun-Sheng Hsueh Date:
1 Power-Efficient TCAM Partitioning for IP Lookups with Incremental Updates Author: Yeim-Kuan Chang Publisher: ICOIN 2005 Presenter: Po Ting Huang Date:
PARALLEL-SEARCH TRIE- BASED SCHEME FOR FAST IP LOOKUP Author: Roberto Rojas-Cessa, Lakshmi Ramesh, Ziqian Dong, Lin Cai Nirwan Ansari Publisher: IEEE GLOBECOM.
A Dynamic Longest Prefix Matching Content Addressable Memory for IP Routing Author: Satendra Kumar Maurya, Lawrence T. Clark Publisher: IEEE TRANSACTIONS.
Memory-Efficient Regular Expression Search Using State Merging Author: Michela Becchi, Srihari Cadambi Publisher: INFOCOM th IEEE International.
Research on TCAM-based OpenFlow Switch Author: Fei Long, Zhigang Sun, Ziwen Zhang, Hui Chen, Longgen Liao Conference: 2012 International Conference on.
Memory-Efficient and Scalable Virtual Routers Using FPGA Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan,
Updating Designed for Fast IP Lookup Author : Natasa Maksic, Zoran Chicha and Aleksandra Smiljani´c Conference: IEEE High Performance Switching and Routing.
A Fast Regular Expression Matching Engine for NIDS Applying Prediction Scheme Author: Lei Jiang, Qiong Dai, Qiu Tang, Jianlong Tan and Binxing Fang Publisher:
Range Enhanced Packet Classification Design on FPGA Author: Yeim-Kuan Chang, Chun-sheng Hsueh Publisher: IEEE Transactions on Emerging Topics in Computing.
PC-TRIO: A Power Efficient TACM Architecture for Packet Classifiers Author: Tania Banerjee, Sartaj Sahni, Gunasekaran Seetharaman Publisher: IEEE Computer.
Packet Classification Using Dynamically Generated Decision Trees
Hierarchical Hybrid Search Structure for High Performance Packet Classification Authors : O˜guzhan Erdem, Hoang Le, Viktor K. Prasanna Publisher : INFOCOM,
Scalable Multi-match Packet Classification Using TCAM and SRAM Author: Yu-Chieh Cheng, Pi-Chung Wang Publisher: IEEE Transactions on Computers (2015) Presenter:
Author: Yun R. Qu, Shijie Zhou, and Viktor K. Prasanna Publisher:
Reorganized and Compact DFA for Efficient Regular Expression Matching
2018/4/27 PiDFA : A Practical Multi-stride Regular Expression Matching Engine Based On FPGA Author: Jiajia Yang, Lei Jiang, Qiu Tang, Qiong Dai, Jianlong.
High-throughput Online Hash Table on FPGA
2018/6/26 An Energy-efficient TCAM-based Packet Classification with Decision-tree Mapping Author: Zhao Ruan, Xianfeng Li , Wenjun Li Publisher: 2013.
Statistical Optimal Hash-based Longest Prefix Match
P4-to-VHDL: Automatic Generation of 100 Gbps Packet Parsers
Parallel Processing Priority Trie-based IP Lookup Approach
2018/12/10 Energy Efficient SDN Commodity Switch based Practical Flow Forwarding Method Author: Amer AlGhadhban and Basem Shihada Publisher: 2016 IEEE/IFIP.
2018/12/29 A Novel Approach for Prefix Minimization using Ternary trie (PMTT) for Packet Classification Author: Sanchita Saha Ray, Abhishek Chatterjee,
Binary Prefix Search Author: Yeim-Kuan Chang
Memory-Efficient Regular Expression Search Using State Merging
Virtual TCAM for Data Center Switches
A Small and Fast IP Forwarding Table Using Hashing
Scalable Multi-Match Packet Classification Using TCAM and SRAM
A New String Matching Algorithm Based on Logical Indexing
EMOMA- Exact Match in One Memory Access
Compact DFA Structure for Multiple Regular Expressions Matching
2019/5/5 A Flexible Wildcard-Pattern Matching Accelerator via Simultaneous Discrete Finite Automata Author: Hsiang-Jen Tsai, Chien-Chih Chen, Yin-Chi Peng,
Power-efficient range-match-based packet classification on FPGA
Large-scale Packet Classification on FPGA
Authors: A. Rasmussen, A. Kragelund, M. Berger, H. Wessing, S. Ruepp
A Hybrid IP Lookup Architecture with Fast Updates
A SRAM-based Architecture for Trie-based IP Lookup Using FPGA
Authors: Ding-Yuan Lee, Ching-Che Wang, An-Yeu Wu Publisher: 2019 VLSI
MEET-IP Memory and Energy Efficient TCAM-based IP Lookup
Towards TCAM-based Scalable Virtual Routers
Packet Classification Using Binary Content Addressable Memory
Presentation transcript:

Scalable Memory-Less Architecture for String Matching With FPGAs Authors: Ideh Sarbishei, Shervin Vakili, J.M. Pierre Langlois, Yvon Savaria Publisher: 2017 IEEE International Symposium on Circuits and Systems (ISCAS) Presenter: Tzu-Chieh, Lin Date: 106/11/22 National Cheng Kung University CSIE Computer & Internet Architecture Lab

INTRODUCTION A string matching engine searches for all occurrences of a given string inside a predetermined target string set. When a match is found, its location and associated information are returned. Many of applications require real-time processing, such as solving the Longest Prefix Match (LPM) problem, which is a crucial part of the IP lookup process. Existing approaches for string matching can be categorized into Ternary Content Addressable Memory (TCAM)-based , trie-based techniques, and TCAM-emulation. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

INTRODUCTION TCAM-based techniques exploit the TCAM capability to perform high-speed parallel searches in the target string set, but have limited access speed, high power consumption, and high cost. TCAM-emulation techniques aim at reducing TCAM cost and power consumption using components that replicate TCAM functionality. Trie-based techniques create search trees from the given target string set and store the tree information in memory using a trie data structure. For applications such as LPM using large prefix sets, trie information may not fit in on-chip memories, thus increasing search latency. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

RELATED WORK Clark and Schimmel suggested an FPGA implementation of a scalable string match that allows adjusting the trade-off between capacity and throughput according to application requirements. They applied multi-character decoders to their design in order to improve performance and eliminate redundant comparisons. Xilinx has developed an IP core that can be configured to utilize either SRLs or BRAM resources to emulate the functionality of different TCAM core sizes. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

SPLIT-BUCKET ARCHITECTURE A parallel comparison with all the strings in the target string set is performed in the MB using the following three pipelined sub-blocks: the Decoder Block (DB), the Comparator Block (CB) and the Bucket Selection Block (BSB). Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Decoder Block (DB) The input string is thus split into v segments of k bits, each one decoded by a k-to-2k binary decoder. For non-binary alphabets, symbols should first be encoded in binary to allow the utilization of binary decoders. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Comparator Block (CB) The Split-Bucket architecture employs a string partitioning scheme to improve string matching efficiency It partitions the target string set into a predefined number of buckets using a bucket identifier. The bucket identifier is an n-bit portion of the string (String(Bis : BIe)). The BIs, BIe indicate the starting and ending indexes of the bucket identifier, respectively. Strings with the same bucket identifier share a bucket. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

The CB consists of parallel components, called CBucket. Each CBucket compares the decoded input string with the target strings of the corresponding bucket using several AND operations, each corresponding to a target string. When a string is added to the target string set, the corresponding AND operation must be appended to the appropriate CBucket. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Computer & Internet Architecture Lab CSIE, National Cheng Kung University

AND operation The AND operation inputs are the outputs of the decoder for the corresponding segment. The number of input ports of the ath AND operation ( 𝐼𝑛𝑃𝑜𝑟𝑡𝑠 𝑎 ) depends on the bit-width of the a 𝑡ℎ string in the target string set ( 𝐵𝑊 𝑎 ). If k divides 𝐵𝑊 𝑎 , the 𝐼𝑛𝑃𝑜𝑟𝑡𝑠 𝑎 is equal to 𝐵𝑊 𝑎 𝑘 . Otherwise, the number of input ports of the a 𝑡ℎ AND operation for this case is calculated by (1): Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Bucket Selection Block (BSB) The BSB consists of a multiplexer that selects the valid comparison results and passes the outputs of the appropriate CBucket according to the input string bucket identifier Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Encoder Block (EB) The EB is composed of two pipelined stages. The first stage encodes the MB output using a binary encoder while the second stage calculates the global match address. The encoder passes the local address of the match address (AddrLocal) in the appropriate CBucket. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

IP ADDRESS LOOKUP WITH THE SPLIT-BUCKET An IP address lookup engine based on the Split-Bucket was implemented on a Virtex-7 FPGA using the Xilinx ISE 13.4 tool. We chose n = 8, which partitions the routing table into 256 buckets, bits 16 to 23 were found to be the most appropriate bucket identifier. In this experiment, the important parameters of the proposed architecture have the following values: w = 32, k = 8, v = 4, BIs = 9, and BIe = 16. Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Computer & Internet Architecture Lab CSIE, National Cheng Kung University

Computer & Internet Architecture Lab CSIE, National Cheng Kung University

[4] Memory efficient IP lookup in 100 GBPS networks [2] High performance IP lookup on FPGA with combined length-infix pipelined search [3] Automatic synthesis of efficient intrusion detection systems on FPGA [4] Memory efficient IP lookup in 100 GBPS networks [5] Scalable Pattern Matching for High Speed Networks [6] Parameterizable Content-Addressable Computer & Internet Architecture Lab CSIE, National Cheng Kung University