1 of 20 Smart-NICs: Power Proxying for Reduced Power Consumption in Network Edge Devices Karthikeyan Sabhanatarajan, Ann Gordon-Ross +, Mark Oden, Mukund.

Slides:



Advertisements
Similar presentations
1 Ports and IPv6. 2 Ports Transmission Control Protocol (TCP) or the User Datagram Protocol (UDP), used for communication Generally speaking, a computer.
Advertisements

Somniloquy*: Augmenting Network Interfaces to Reduce PC Energy Usage Ranveer Chandra (Microsoft Research) Yuvraj Agarwal, Rajesh Gupta (UCSD), Victor Bahl,
Traffic Management - OpenFlow Switch on the NetFPGA platform Chun-Jen Chung( ) SriramGopinath( )
Chapter 14 TCP/IP and Routing Part #1 Unix System Administration.
10 - Network Layer. Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving.
EE 122: Router Design Kevin Lai September 25, 2002.
Introduction To Networking
Introduction to Management Information Systems Chapter 5 Data Communications and Internet Technology HTM 304 Fall 07.
Chapter 2 Internet Protocol DoD Model Four layers: – Process/Application layer – Host-to-Host layer – Internet layer – Network Access layer.
Review on Networking Technologies Linda Wu (CMPT )
Embedded Transport Acceleration Intel Xeon Processor as a Packet Processing Engine Abhishek Mitra Professor: Dr. Bhuyan.
Chapter 9 Classification And Forwarding. Outline.
CECS 474 Computer Network Interoperability Tracy Bradley Maples, Ph.D. Computer Engineering & Computer Science Cal ifornia State University, Long Beach.
Chapter 4 Queuing, Datagrams, and Addressing
Support Protocols and Technologies. Topics Filling in the gaps we need to make for IP forwarding work in practice – Getting IP addresses (DHCP) – Mapping.
Semester 1 Module 8 Ethernet Switching Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology
Computer Science and Engineering Computer System Security CSE 5339/7339 Session 24 November 11, 2004.
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
1 Internetworking : Internet architecture and TCP/IP Protocol TK3133 Computer Networking Technology.
TCP/IP Reference Model For more notes and topics visit: eITnotes.com.
Introduction1-1 Data Communications and Computer Networks Chapter 5 CS 3830 Lecture 27 Omar Meqdadi Department of Computer Science and Software Engineering.
Introduction to Networking. Key Terms packet  envelope of data sent between computers server  provides services to the network client  requests actions.
Lecture 3 Review of Internet Protocols Transport Layer.
Traffic Management - OpenFlow Switch on the NetFPGA platform Chun-Jen Chung( ) Sriram Gopinath( )
Lect1..ppt - 01/06/05 CDA 6505 Network Architecture and Client/Server Computing Lecture 2 Protocols and the TCP/IP Suite by Zornitza Genova Prodanoff.
I. Basic Network Concepts. I.1 Networks Network Node Address Packet Protocol.
RiceNIC: A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Dr. Scott Rixner Rice Computer Architecture:
Fall 2005Computer Networks20-1 Chapter 20. Network Layer Protocols: ARP, IPv4, ICMPv4, IPv6, and ICMPv ARP 20.2 IP 20.3 ICMP 20.4 IPv6.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
NETWORKING COMPONENTS AN OVERVIEW OF COMMONLY USED HARDWARE Christopher Johnson LTEC 4550.
Vladimír Smotlacha CESNET Full Packet Monitoring Sensors: Hardware and Software Challenges.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Ethernet Network Fundamentals – Chapter 9.
CHAPTER 5 TCP/IP PROTOCOLS. P ROTOCOL STANDARDS Protocols are formal rules of behavior When computers communicate, it is necessary to define a set of.
Sem1 - Module 8 Ethernet Switching. Shared media environments Shared media environment: –Occurs when multiple hosts have access to the same medium. –For.
Chapter 2 Protocols and the TCP/IP Suite 1 Chapter 2 Protocols and the TCP/IP Suite.
StrideBV: Single chip 400G+ packet classification Author: Thilan Ganegedara, Viktor K. Prasanna Publisher: HPSR 2012 Presenter: Chun-Sheng Hsueh Date:
4/19/20021 TCPSplitter: A Reconfigurable Hardware Based TCP Flow Monitor David V. Schuehler.
BZUPAGES.COM Presentation on TCP/IP Presented to: Sir Taimoor Presented by: Jamila BB Roll no Nudrat Rehman Roll no
Department of Computer Science and Engineering Applied Research Laboratory Architecture for a Hardware Based, TCP/IP Content Scanning System David V. Schuehler.
A Resource Efficient Content Inspection System for Next Generation Smart NICs Karthikeyan Sabhanatarajan, Ann Gordon-Ross* The Energy Efficient Internet.
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 April 11, 2006 Session 23.
Real-Time Performance Analysis of Adaptive Link Rate Baoke Zhang, Karthikeyan Sabhanatarajan, Ann Gordon-Ross*, Alan D. George* This work was supported.
High-Speed Policy-Based Packet Forwarding Using Efficient Multi-dimensional Range Matching Lakshman and Stiliadis ACM SIGCOMM 98.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
Linux Operations and Administration Chapter Eight Network Communications.
1 of 20 Low Power and Dynamic Optimization Techniques for Power-Constrained Domains Ann Gordon-Ross Department of Electrical and Computer Engineering University.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
Cisco Confidential © 2013 Cisco and/or its affiliates. All rights reserved. 1 Cisco Networking Training (CCENT/CCT/CCNA R&S) Rick Rowe Ron Giannetti.
Data Communications and Computer Networks Chapter 4 CS 3830 Lecture 19 Omar Meqdadi Department of Computer Science and Software Engineering University.
The OSI Model. Understanding the OSI Model In early 1980s, manufacturers began to standardize networking so that networks from different manufacturers.
Address Resolution Protocol Yasir Jan 20 th March 2008 Future Internet.
Wrapping up subnetting, mapping IPs to physical ports BSAD 146 Dave Novak Sources: Network+ Guide to Networks, Dean 2013.
IP - Internet Protocol No. 1  Seattle Pacific University IP: The Internet Protocol Kevin Bolding Electrical Engineering Seattle Pacific University.
1 Computer Networks Chapter 5. Network layer The network layer is concerned with getting packets from the source all the way to the destination. Getting.
CIS 700-5: The Design and Implementation of Cloud Networks
Scaling the Network: The Internet Protocol
Part I. Overview of Data Communications and Networking
What the OSI Protocol Layers Do
ISO/OSI Model and Collision Domain
Reference Router on NetFPGA 1G
Net 431: ADVANCED COMPUTER NETWORKS
Net 323: NETWORK Protocols
I. Basic Network Concepts
Networking Theory (part 2)
Scaling the Network: The Internet Protocol
IP Control Gateway (IPCG)
Reference Router on NetFPGA 1G
ECE 671 – Lecture 8 Network Adapters.
Network Basics and Architectures Neil Tang 09/05/2008
Networking Theory (part 2)
Presentation transcript:

1 of 20 Smart-NICs: Power Proxying for Reduced Power Consumption in Network Edge Devices Karthikeyan Sabhanatarajan, Ann Gordon-Ross +, Mark Oden, Mukund Navada, Alan D. George + High Performance Computing and Simulation Research Laboratory Department of Electrical and Computer Engineering University of Florida, Gainesville This work was supported by the U.S. National Science Foundation + Also Affiliated with NSF Center for High- Performance Reconfigurable Computing

2 of 20 2 Introduction INTERNET

3 of 20 3 Introduction Connected edge devices account for 2% of the total power consumed in the US [EPA- 06] –130 TWh/Year This is $1.3 $.10 per kWh 1 single-unit nuclear power plant outputs 8 TWh/Year Translates to 16 single-unit nuclear power plants! Why so much power? –PCs can consume up to 200 W –1 billion PCs worldwide by 2010 [Kanellos-04] What can we do? –PCs are idle 75% of the time [Purushothaman-06] –But only 10% of PCs are allowed to sleep during that time [EPA-06] –Sleeping reduces power consumption by 80% or more –If PCs were allowed to sleep, only 3 single-unit nuclear power plants would be required Question: Why aren’t these PCs asleep?!?!

4 of 20 4 Maintaining Network Connectivity INTERNET IDLE GNUTELLA FILE SHARING APPLICATION FILE QUERY PACKET FILE RESPONSE PACKET Bob Alice Alice checks to see if Bob has a file needed for p2p file sharing Z Z z z FILE QUERY PACKET Problem: PC must be awake to maintain network connectivity

5 of 20 5 A Solution – Power Proxying Primary challenge is to maintain network connectivity while the PC is power down to standby mode - sleeping Some packets do not require a complex response –Automated responses are sufficient –Network Interface Card (NIC) can act as proxy for the PC –Allow the PC to sleep while NIC services packets with automated responses –A technique known as power proxying –We call such a NIC a “Smart”-NIC - SNIC

6 of 20 6 Power Proxying INTERNET IDLE GNUTELLA FILE SHARING APPLICATION Alice Bob Z Z z z PC delegates power to the SNIC to handle to network traffic FILE QUERY PACKET FILE RESPONSE PACKET

7 of 20 7 Power Proxying INTERNET IDLE Proxiable Packet Response Z Z z z Chatter Packet Non-Proxiable/Wake up Packet SNIC Response Bob

8 of 20 8 What to Proxy? - Proxiable Protocols Proxiable protocols - Network protocols amenable to proxying –Responses may be automated –Keep alive packets, IP conflict avoidance, etc. Z Z z z IDLE FOUR Categories of Proxiable Packets ARP QUERY ARP RESPONSE PING PING RESPONSE P2P FILE QUERY P2P RESPONSE Mail Notification ARP (Address Resolution Protocol) ICMP (Internet Control Message Protocol) TCP (Transmission Control Protocol) UDP (User Datagram Protocol)

9 of 20 9 Response Power Proxying Operation z z z IDLE SNIC Packet Classifier Application Handler 1. PC decides to sleep 2. PC offloads power proxy rules to the SNIC 3. PC sleeps and SNIC proxy is activated Rules 4. Packet Arrives Rules source addr source port dest port ?= Match? No (not chatter) 7(a) Wake up PC 7(b) Discard No (chatter) Yes 7(c) Invoke app handler Payload Header 6. Rule checking 5. Header inspection Payload Header App ID 8. Determine response ? 9. Proxyied Response SW HW or SW? source addr source port dest port

10 of Packet Classifier Requirements PC-BASED CLASSIFIER ROUTER-BASED CLASSIFIER 3) Operates only during system inactivity 3) Continual operation 4) Process packets addressed only to a particular destination and Broad/MultiCast packets 4) Process packets to any destination 5) Limited processing resources - processors clocked in MHz 5) Processors clocked in GHz range 1) Must sustain link rates of 10/100/1000/10000 Mbps 1) Must sustain link rates of 10/100/1000/10000 Mbps 2) No packet loss allowed 6) Limited number of rules directly depend on number of proxiable applications running 6) Larger number rules with a wide complexity range 7) Packets match only one rule - rules are disjoint 7) Packets can match multiple rules

11 of Packet Classifier - SW vs. HW Software Classifier Hardware Classifier 1) Limited operating frequency between 66 MHz to 400 MHz 1) Custom hardware can be designed for the required frequency 2) Cannot meet the network throughput demands even for the fastest packet classification algorithms 2) Can easily meet the network throughput demands 3) High power even during idle period 3) Comparatively lower power

12 of Rules Header Processor Header Processor Incoming Packet (From MAC Core) Packet Class Application ID Source Port CAM Dest Port CAM Match Match Address Address Match ID MultiMatch Source Port Source IPDest Port Custom HW Packet Classification Source IP Address CAM Source Port Source Port CAM Source Port CAM Dest Port CAM Dest Port CAM Invokes application handler OR MultiMatch Source IP Address CAM Source IP Address CAM

13 of Packet Classifier Placement From PHY Packet Classifier Packet Descriptor FIFO Tx FIFO Rx FIFO MAC Core uP Application Handler Response No change to critical path

14 of Experimental Setup Software packet classifier –Implemented on RiceNIC platform using PowerPC405 RiceNIC is a programmable NIC –PowerPC clocked at 300 MHz and 100 MHz Hardware packet classifier –Xilinx IP cores to generate CAMs as block memory –Prototyped in Verilog HDL –System implemented and simulated using Xilinx ISE 9.1 and ModelSIM –Clocked at 1.25 MHz, 12.5 MHz, and 125 MHz corresponding to 10 Mbps, 100 Mbps, and 1000 Mbps –Power calculated using Xilinx XPower

15 of Results – Packet Classification Time Hardware classification outperforms software classification at 300 MHz and 100 MHz Worst-case packet classification time for each protocol class with 100 rules Increasing packet classification complexity

16 of Results – Classification Time Vs Rules As more applications are identified as proxyiable, rule set sizes will increase Thus scalability is important Software Logarithmic Constant Hardware

17 of Results – Packet Throughput Throughput is measured in Millions of Packets Per Second (MPPS) Minimum throughput for 1Gpbs Software cannot meet requirements! Hardware exceeds Gbps throughput

18 of Results – HW Speedup vs. SW 9x speedup to 2.5x speedup

19 of Results – Power Consumption SW classifier is 2.4x more power than HW –SW = mW and 441 mW for 100 MHz and 300 MHz respectively –HW = 180 mW for 100 rules. Link rate scalability –For SW to meet 1 Gpbs throughput Clocked at 500 MHz Require an additional 294 mW of power Resulting in 4x more power than HW

20 of Conclusions PCs consume a lot of power –Left powered on to maintain network connectivity Introduced power proxying –SNIC maintains network connectivity so PC can sleep –Can increase sleep time by 85% [Purushothamom-06] Low-power hardware-based packet classifier to enable power proxying –Exceeds Gigabit Ethernet throughput requirement –Up to 9x speedup in packet classification time over a software packet classifier –75% less power than a software packet classifier –Better scalability with respect to future rule set size and link rates than a software packet classifier