Author : Masanori Bando and H. Jonathan Chao Publisher : INFOCOM, 2010 Presenter : Jo-Ning Yu Date : 2011/02/16.

Slides:



Advertisements
Similar presentations
An On-Chip IP Address Lookup Algorithm Author: Xuehong Sun and Yiqiang Q. Zhao Publisher: IEEE TRANSACTIONS ON COMPUTERS, 2005 Presenter: Yu Hao, Tseng.
Advertisements

A Fast and Memory Efficient Dynamic IP Lookup Algorithm Based on B-Tree Author:Yeim-Kuan Chang and Yung-Chieh Lin Publisher: 2009 International Conference.
IP Routing Lookups Scalable High Speed IP Routing Lookups.
Exploiting Graphics Processors for High- performance IP Lookup in Software Routers Author: Jin Zhao, Xinya Zhang, Xin Wang, Yangdong Deng, Xiaoming Fu.
An Efficient IP Address Lookup Algorithm Using a Priority Trie Authors: Hyesook Lim and Ju Hyoung Mun Presenter: Yi-Sheng, Lin ( 林意勝 ) Date: Mar. 11, 2008.
1 Author: Ioannis Sourdis, Sri Harsha Katamaneni Publisher: IEEE ASAP,2011 Presenter: Jia-Wei Yo Date: 2011/11/16 Longest prefix Match and Updates in Range.
IP Address Lookup for Internet Routers Using Balanced Binary Search with Prefix Vector Author: Hyesook Lim, Hyeong-gee Kim, Changhoon Publisher: IEEE TRANSACTIONS.
1 MIPS Extension for a TCAM Based Parallel Architecture for Fast IP Lookup Author: Oğuzhan ERDEM Cüneyt F. BAZLAMAÇCI Publisher: ISCIS 2009 Presenter:
Higher Order Tries Key = Social Security Number.   9 decimal digits. 10-way trie (order 10 trie) Height
1 Searching Very Large Routing Tables in Wide Embedded Memory Author: Jan van Lunteren Publisher: GLOBECOM 2001 Presenter: Han-Chen Chen Date: 2010/01/06.
1 Greedy Prefix Cache for IP Routing Lookups Author: Zhuo Huang, Gang Liu, Jih-Kwon Peir Publisher: I-SPAN 2009 Presenter: Hsin-Mao Chen Date:2010/03/10.
Beyond TCAMs: An SRAM-based Parallel Multi-Pipeline Architecture for Terabit IP Lookup Author: Weirong Jiang ViktorK.Prasanna Publisher: Infocom 08 Present:
1 Multi-Prefix Trie: a New Data Structure for Designing Dynamic Router-Tables Author: Sun-Yuan Hsieh Member, IEEE, Yi-Ling Huang, and Ying- Chi Yang Publisher:
1 Towards Green Routers: Depth- Bounded Multi-Pipeline Architecture for Power-Efficient IP Lookup Author: Weirong Jiang Viktor K. Prasanna Publisher: Performance,
Efficient IP-Address Lookup with a Shared Forwarding Table for Multiple Virtual Routers Author: Jing Fu, Jennifer Rexford Publisher: ACM CoNEXT 2008 Presenter:
1 Multi-Terabit IP Lookup Using Parallel Bidirectional Pipelines Author: Weirong Jiang Viktor K. Prasanna Publisher: ACM 2008 Presenter: Po Ting Huang.
Scalable IPv6 Lookup/Update Design for High-Throughput Routers Authors: Chung-Ho Chen, Chao-Hsien Hsu, Chen -Chieh Wang Presenter: Yi-Sheng, Lin ( 林意勝.
1 A Novel Scalable IPv6 Lookup Scheme Using Compressed Pipelined Tries Author: Michel Hanna, Sangyeun Cho, and Rami Melhem Publisher: NETWORKING 2011 Presenter:
An Efficient Hardware-based Multi-hash Scheme for High Speed IP Lookup Department of Computer Science and Information Engineering National Cheng Kung University,
Parallel IP Lookup using Multiple SRAM-based Pipelines Authors: Weirong Jiang and Viktor K. Prasanna Presenter: Yi-Sheng, Lin ( 林意勝 ) Date:
Performance Evaluation of IPv6 Packet Classification with Caching Author: Kai-Yuan Ho, Yaw-Chung Chen Publisher: ChinaCom 2008 Presenter: Chen-Yu Chaug.
Study of IP address lookup Schemes
1 A Fast IP Lookup Scheme for Longest-Matching Prefix Authors: Lih-Chyau Wuu, Shou-Yu Pin Reporter: Chen-Nien Tsai.
An Efficient IP Lookup Architecture with Fast Update Using Single-Match TCAMs Author: Jinsoo Kim, Junghwan Kim Publisher: WWIC 2008 Presenter: Chen-Yu.
1 HEXA: Compact Data Structures or Faster Packet Processing Author: Sailesh Kumar, Jonathan Turner, Patrick Crowley, Michael Mitzenmacher. Publisher: ICNP.
Fast binary and multiway prefix searches for pachet forwarding Author: Yeim-Kuan Chang Publisher: COMPUTER NETWORKS, Volume 51, Issue 3, pp , February.
Address Lookup in IP Routers. 2 Routing Table Lookup Routing Decision Forwarding Decision Forwarding Decision Routing Table Routing Table Routing Table.
1 Efficient packet classification using TCAMs Authors: Derek Pao, Yiu Keung Li and Peng Zhou Publisher: Computer Networks 2006 Present: Chen-Yu Lin Date:
PARALLEL TABLE LOOKUP FOR NEXT GENERATION INTERNET
IP Address Lookup Masoud Sabaei Assistant professor
LayeredTrees: Most Specific Prefix based Pipelined Design for On-Chip IP Address Lookups Author: Yeim-Kuau Chang, Fang-Chen Kuo, Han-Jhen Guo and Cheng-Chien.
Author: Haoyu Song, Fang Hao, Murali Kodialam, T.V. Lakshman Publisher: IEEE INFOCOM 2009 Presenter: Chin-Chung Pan Date: 2009/12/09.
CAMP: Fast and Efficient IP Lookup Architecture Sailesh Kumar, Michela Becchi, Patrick Crowley, Jonathan Turner Washington University in St. Louis.
1 Towards Practical Architectures for SRAM-based Pipelined Lookup Engines Author: Weirong Jiang, Viktor K. Prasanna Publisher: INFOCOM 2010 Presenter:
Author : Guangdeng Liao, Heeyeol Yu, Laxmi Bhuyan Publisher : Publisher : DAC'10 Presenter : Jo-Ning Yu Date : 2010/10/06.
1 Dynamic Pipelining: Making IP- Lookup Truly Scalable Jahangir Hasan T. N. Vijaykumar School of Electrical and Computer Engineering, Purdue University.
Higher Order Tries Key = Social Security Number.   9 decimal digits. 10-way trie (order 10 trie) Height
Compact Trie Forest: Scalable architecture for IP Lookup on FPGAs Author: O˘guzhan Erdem, Aydin Carus and Hoang Le Publisher: ReConFig 2012 Presenter:
1 Power-Efficient TCAM Partitioning for IP Lookups with Incremental Updates Author: Yeim-Kuan Chang Publisher: ICOIN 2005 Presenter: Po Ting Huang Date:
A Small IP Forwarding Table Using Hashing Yeim-Kuan Chang and Wen-Hsin Cheng Dept. of Computer Science and Information Engineering National Cheng Kung.
PARALLEL-SEARCH TRIE- BASED SCHEME FOR FAST IP LOOKUP Author: Roberto Rojas-Cessa, Lakshmi Ramesh, Ziqian Dong, Lin Cai Nirwan Ansari Publisher: IEEE GLOBECOM.
Author: Heeyeol Yu and Rabi Mahapatra
Author : N. Sertac Artan, Haowei Yuan, and H. Jonathan Chao Publisher/Conf : IEEE GLOBECOM 2008 Speaker : Chen Deyu Data :
Memory-Efficient IPv4/v6 Lookup on FPGAs Using Distance-Bounded Path Compression Author: Hoang Le, Weirong Jiang and Viktor K. Prasanna Publisher: IEEE.
Author : Yang Xu, Lei Ma, Zhaobo Liu, H. Jonathan Chao Publisher : ANCS 2011 Presenter : Jo-Ning Yu Date : 2011/12/28.
Author: Haoyu Song, Murali Kodialam, Fang Hao and T.V. Lakshman Publisher/Conf. : IEEE International Conference on Network Protocols (ICNP), 2009 Speaker:
Memory-Efficient and Scalable Virtual Routers Using FPGA Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan,
Updating Designed for Fast IP Lookup Author : Natasa Maksic, Zoran Chicha and Aleksandra Smiljani´c Conference: IEEE High Performance Switching and Routing.
HIGH-PERFORMANCE LONGEST PREFIX MATCH LOGIC SUPPORTING FAST UPDATES FOR IP FORWARDING DEVICES Author: Arun Kumar S P Publisher/Conf.: 2009 IEEE International.
Parallel tree search: An algorithmic approach for multi- field packet classification Authors: Derek Pao and Cutson Liu. Publisher: Computer communications.
Evaluating and Optimizing IP Lookup on Many Core Processors Author: Peng He, Hongtao Guan, Gaogang Xie and Kav´e Salamatian Publisher: International Conference.
IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo a, Jose G. Delgado-Frias Publisher: Journal of Systems.
1 IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo, Jose G. Delgado-Frias Publisher: Journal of Systems.
Hierarchical packet classification using a Bloom filter and rule-priority tries Source : Computer Communications Authors : A. G. Alagu Priya 、 Hyesook.
Author : Masanori Bando, N. Sertac Artan and H. Jonathan Chao Publisher : Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 Presenter.
Ofir Luzon Supervisor: Prof. Michael Segal Longest Prefix Match For IP Lookup.
SCALAR PREFIX SEARCH: A NEW ROUTE LOOKUP ALGORITHM FOR NEXT GENERATION INTERNET Author: Mohammad Behdadfar, Hossein Saidi, Hamid Alaei and Babak Samari.
IP Address Lookup Masoud Sabaei Assistant professor Computer Engineering and Information Technology Department, Amirkabir University of Technology.
Range Hash for Regular Expression Pre-Filtering Publisher : ANCS’ 10 Author : Masanori Bando, N. Sertac Artan, Rihua Wei, Xiangyi Guo and H. Jonathan Chao.
IP Routers – internal view
A Scalable Routing Architecture for Prefix Tries
Parallel Processing Priority Trie-based IP Lookup Approach
Packet Classification Using Coarse-Grained Tuple Spaces
Higher Order Tries Key = Social Security Number.
A Small and Fast IP Forwarding Table Using Hashing
A Semantic Peer-to-Peer Overlay for Web Services Discovery
A Trie Merging Approach with Incremental Updates for Virtual Routers
A SRAM-based Architecture for Trie-based IP Lookup Using FPGA
Towards TCAM-based Scalable Virtual Routers
Presentation transcript:

Author : Masanori Bando and H. Jonathan Chao Publisher : INFOCOM, 2010 Presenter : Jo-Ning Yu Date : 2011/02/16

  Introduction  FlashTrie  Prefix-Compressed Trie  HashTune  Membership Query Module  Lookup operation  Architecture  Update  Performance evaluation 2 Outline

  Tree Bitmap have been used in today’s high-end routers.  However, their large data structure often requires multiple external memory accesses for each route lookup.  In this paper, we propose a new IP route lookup architecture called FlashTrie that overcomes the shortcomings of the multibit-trie based approach.  We also develop a new data structure called Prefix-Compressed Trie that reduces the size of a bitmap by more than 80%.  We use a hash-based membership query to limit off-chip memory accesses per lookup to one and to balance memory utilization among the memory modules. 3 Introduction

  overview 4 FlashTrie Architecture

  Construction of the PC-Trie has two rules: 1.All sibling nodes must be filled with NHI if at least one node in the node set contains NHI. 2.The parents node set can be deleted if two child node sets exist. 5 Prefix-Compressed Trie (PC-Trie)

  The main difference between Tree Bitmap and our data structure is that a bit in the Tree Bitmap represents only one node, while a bit represents more than one node in PC-Trie. 6 Prefix-Compressed Trie (PC-Trie)

  Advantage  Tree Bitmap, the internal bitmap : 2 stride+1 −1 bits.  PC-Trie n, 2 (s+1−log(n)) −1 bits, s is the stride size (in bits).  For example, a PC-Trie 8 and 8-bit stride (n = 8, s = 8) :  Tree Bitmap = −1 = 2 9 −1 = 511 bits  PC-Trie = 2 (8+1−log(8)) −1 = 2 (9−3) −1 = 2 6 −1 = 63 bits  Drawback  NHI may need to be duplicated as the figure shows.  However, the number of memory slots needed for the NHI table is reduced.  Because the size of a NHI is so small that multiple NHI can be stored into a single memory slot. 7 Prefix-Compressed Trie (PC-Trie)

  The hash table has two different entries: 1.no-collision :  Least-Significant Bit of the hash table entry is set to “0”. 2.collision :  Least-Significant Bit of the hash table entry is set to “1”.  The collided items are stored in Black Sheep (BS) memory. 8 Membership Query Module

 1.no-collision :  If the Verify Bits are matched with the input IP address, then the Verify Bits plus the hash result becomes the PC-Trie address for the input IP address. 2.collision :  We have multiple on-chip BS memory modules that are accessed in parallel to void access one BS memory multiple times.  Based on simulation results, in the worst case  IPv4 : five BS memory modules are needed.  IPv6 : seven BS memory modules are needed. 9 Membership Query Module

 10 Membership Query Module

  We use HashTune as the hash function because it has a compact data structure and better memory utilization.  Advantages: 1.Key distribution is more uniform over the entire hash table. 2.The size of Verify Bits can be reduced. 11 HashTune

  The entire hash table is segmented into multiple small hash tables called groups and all groups have the same number of bins.  Each group is allowed to select a different hash function from the pool of hash functions.  The selected hash function ID is stored in a Hash ID Table, which is also stored in on-chip memory and used for query operations. 12 HashTune

  Each group is assigned an ID called Group ID and the ID is selected from LSBs of root nodes in each sub-trie.  For example :  Resolving 17 bits input and choosing 8 LSBs as the group ID gives us the remaining 9 bits to be stored as Verify Bits.  As a result, the Verify Bit size and on-chip memory requirements are reduced. 13 HashTune

  IPv4 Prefix Distribution from 2002 to  Sources from the RouteViews project, Oregon, USA. 14 Prefix Distribution  We select three levels based on the prefix distribution of figure.  They are IPv4/16 (16 Most-Significant Bits of an IPv4 address), IPv4/17 (MSB 17 bits to 24 bits), and IPv4/25 (MSB 25 bits to 32 bits).

  IPv6 Prefix Distribution.  Sources from the RouteViews project, Oregon, USA and expected future IPv6 routing tables. 15 Prefix Distribution  The number of routes increases every 8 bits, and we can observe major prefix lengths in that 8-bit region (e.g., /32, /40, and /48).  Thus, we also select stride size 8 for IPv6.

  The input 32-bit IPv4 address is categorized in IPv4/16, IPv4/17, and IPv4/25.  IPv4/16 is resolved using Direct Lookup (on-chip), and IPv4/17 and IPv4/25 are resolved using the Membership Query (on-chip) and PC-Trie (off-chip). 16 Lookup Operation

  FlashTrie Architecture for IPv4 17 Architecture Suppose “100,” is the input to this PC-Trie The contents of the bitmap is “1,” which means NHI exists for the input “100”. Since the LSB of the input is “0”, E is selected as the final NHI.  For PC-Trie n, more than one (log 2 ( n )) bit from the LSB of the input destination IP address is used to find the final NHI.

 18 Performance Evaluation 279, 117 routes stride size = 8 bits

 19 Performance Evaluation  Memory Requirements  On-Chip Memory: 1.Direct Lookup for up to /16. 2.Hash table for membership queries. 3.Hash ID table (storing a hash ID for each group). 4.Black Sheep memories for collided items in the hash table.

  Memory Requirements  Off-Chip Memory: 1.PC-Trie  sub-trie size : (stride size = 8 bits)  Tree Bitmap = 1063 bits (internal bitmap + external bitmap + pointers)  PC-Trie8 = 83 bits (PC-Trie bitmap + pointer) 20 Performance Evaluation

  Memory Requirements  Off-Chip Memory: 2.NHI 21 Performance Evaluation

  Memory Requirements  It is evident from the result that the reduction in bitmap size is more than 80% (for higher compression degree of PC-Trie). 22 Performance Evaluation

  Lookup Speed and Timing Analysis 23 Performance Evaluation