Www.openfabrics.org InfiniBand Routing in OFA Jason Gunthorpe – Obsidian Sean Hefty – Intel Hal Rosenstock – Voltaire.

Slides:



Advertisements
Similar presentations
The subnet /28 has been selected to be further subnetted to support point-to-point serial links. What is the maximum number of serial links.
Advertisements

Neighbor Discovery for IPv6 Mangesh Kaushikkar. Overview Introduction Terminology Protocol Overview Message Formats Conceptual Model of a Host.
Transitioning to IPv6 April 15,2005 Presented By: Richard Moore PBS Enterprise Technology.
1 IPv6. 2 Problem: 32-bit address space will be completely allocated by Solution: Design a new IP with a larger address space, called the IP version.
IPv4 & IPv6 Coexistence & Migration Joe Zhao SW2 Great China R&D Center ZyXEL Communications, Inc.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
1 © 2004, Cisco Systems, Inc. All rights reserved. Chapter 3 Ethernet Technologies/ Ethernet Switching/ TCP/IP Protocol Suite and IP Addressing.
Multi-Layer Switching Layers 1, 2, and 3. Cisco Hierarchical Model Access Layer –Workgroup –Access layer aggregation and L3/L4 services Distribution Layer.
RoCEv2 Update from the IBTA
Oct 21, 2004CS573: Network Protocols and Standards1 IP: Addressing, ARP, Routing Network Protocols and Standards Autumn
1 CCNA 2 v3.1 Module 8. 2 TCP/IP Suite Error and Control Messages CCNA 2 Module 8.
Layer 2: Redundancy and High Availability Part 1: General Overview on Assignment 1.
1 Chapter Overview IP (v4) Address IPv6. 2 IPv4 Addresses Internet Protocol (IP) is the only network layer protocol with its own addressing system and.
IB ACM InfiniBand Communication Management Assistant (for Scaling) Sean Hefty.
資 管 Lee Lesson 11 Coexistence and Migration. 資 管 Lee Lesson Objectives Coexistence and migration overview Coexistence mechanisms ◦ Dual Stack ◦ Tunneling.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
Copyright © 2007 InfiniBand ® Trade Association. Other names and brands are properties of their respective owners. IB Cross-Subnet Communication OpenFabrics.
Network Redundancy Multiple paths may exist between systems. Redundancy is not a requirement of a packet switching network. Redundancy was part of the.
OFA-IWG - March 2010 OFA Interoperability Working Group Update Authors: Mikkel Hagen, Rupert Dance Date: 3/15/2010.
Chapter 4: Managing LAN Traffic
1/29/2002 CS Distributed Systems 1 Infiniband Architecture Aniruddha Bohra.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 6: Static Routing Routing Protocols.
InfiniBand Routing Solution Approach Yaron Haviv, CTO, Voltaire
Scalable name and address resolution infrastructure -- Ira Weiny/John Fleck #OFADevWorkshop.
ECE 526 – Network Processing Systems Design Networking: protocols and packet format Chapter 3: D. E. Comer Fall 2008.
High Availability through the Linux bonding driver
Addressing IP v4 W.Lilakiatsakun. Anatomy of IPv4 (1) Dotted Decimal Address Network Address Host Address.
Scalable RDMA Software Solution Sean Hefty Intel Corporation.
Campus Networking Best Practices Hervey Allen NSRC & University of Oregon Dale Smith University of Oregon & NSRC
CCNP Network Route IPV-6 Part-I IPV6 Addressing: IPV-4 is 32-BIT, IPV-6 is 128-BIT IPV-6 are divided into 8 groups. Each is 4 Hex characters. Each group.
1 RFC Transmission of IPv6 Packets over IEEE Networks Speaker: Li-Wen Chen Date:
© 1999, Cisco Systems, Inc. 1-1 Chapter 2 Overview of a Campus Network © 1999, Cisco Systems, Inc.
CSC 600 Internetworking with TCP/IP Unit 7: IPv6 (ch. 33) Dr. Cheer-Sun Yang Spring 2001.
Björn Landfeldt School of Information Technologies NETS 3303 Networked Systems Multicast.
© 2006 Cisco Systems, Inc. All rights reserved. BSCI v3.0—8-1 Implementing IPv6 Implementing Dynamic IPv6 Addresses.
Network Layer4-1 Datagram networks r no call setup at network layer r routers: no state about end-to-end connections m no network-level concept of “connection”
InfiniBand Routers Ian Colloff : QLogic LWG Co-Chair.
CCNA 1 v3.0 Module 9 TCP/IP Protocol Suite and IP Addressing
OFED 1.3 InfiniBand Management Update Hal Rosenstock.
W&L Page 1 CCNA CCNA Training 2.7 Configure and verify trunking on Cisco switches Jose Luis Flores / Amel Walkinshaw Aug, 2015.
OFED 1.2 Management Update Hal Rosenstock.
+ Routing Concepts 1 st semester Objectives  Describe the primary functions and features of a router.  Explain how routers use information.
IP Over InfiniBand Working Group Management Information Bases 55th IETF Atlanta Sean Harnedy InfiniSwitch Corporation
Range Extension Sonoma’06 David Southwell, Jason Gunthorpe – Obsidan Linden Mercer – Naval Research Labs Bill Boas – OpenIB/ SC|05.
OpenFabrics Developers Summit SC06 QoS Update and Implementation RFC Eitan Zahavi, Mellanox Technologies Nov 2006.
Chapter 8: IP Addressing
Quality of Service Support Dror Goldenberg - Mellanox Sean Hefty – Intel.
TCP/IP Protocol Suite and IP Addressing Presented By : Dupien AMS.
Network Layer IP Address.
Mobility support in IP v4. Internet Computing (CS-413) 2.
Atrium Router Project Proposal Subhas Mondal, Manoj Nair, Subhash Singh.
1 CMPT 471 Networking II Multicasting © Janice Regan,
IP Over InfiniBand Working Group Management Information Bases
IP: Addressing, ARP, Routing
Infiniband Architecture
Chapter 6 Exploring IPv6.
Support for Flow bindings in MIPv6 and NEMO
Fabric Interfaces Architecture – v4
Simple Connectivity Between InfiniBand Subnets
ECSE-6600: Internet Protocols
Mobility And IP Addressing
RoCEE in OFED Update Liran Liss, Mellanox Technologies March 15, 2010
شبکه هاي کامپيوتري فصل پنجم: لايه شبکه (NetworkLayer)
Routing and Switching Essentials v6.0
Anup K.Talukdar B.R.Badrinath Arup Acharya
16EC Computer networks unit II Mr.M.Jagadesh
Computer Networks Protocols
OSI Network Layer Presented By Dr. Waleed Alseat Mutah University.
Multicasting Unicast.
Presentation transcript:

InfiniBand Routing in OFA Jason Gunthorpe – Obsidian Sean Hefty – Intel Hal Rosenstock – Voltaire

What Works Prototype wire-speed 2 port Obsidian router:  SC|06 XNET demo with Qlogic and Mellanox  Non-CM RDMA flows AFCEA|07 Obsidian demo with Rackable:  Unicast IPoIB traffic between two subnets Longbow XR Optical Fiber Host Subnet B Host Subnet A Two Port Router

Problem Areas QP Lid Matching IB CM Multipath / APM IPoIB Multicast Scalability RDMA CM Addressing Router / SA Communication Link Flow Control

QP LID Matching C9-57 requires QP to verify LRH:SLID/DLID Mixes OSI layers 2 (LID), 3 (GID) & 4 (QPN) Major problem for LMC > 0 or multiple routers Eliminate matching?  May break existing HW/FW Most Pressing Issue

QP LID Matching Router LMC=1 CA DLID=2 DLID=3 QP3 Forward DLID=3,SLID=1 DGID=B QP2 Forward DLID=2,SLID=1 DGID=B Router LID=3 Router LID=4 CA A CA B Return path with mismatched router SLID Return path requires SLID=3 for QP3 and SLID=2 for QP2 QP4 Forward DLID=3,SLID=1 DGID=B QP4 Return DLID=1,SLID=4 DGID=A

IB CM Spec requires active side select paths  Must learn passive side path  Specify active & passive side LIDs  4 paths in total Passive side path carried in REQ Requires inter-subnet coordination May require protocol changes to avoid How does the passive side obtain LIDs?

Multipath / APM Routers required to produce different LRHs to same port  Must be predictable and based on GRH  Use DGID, FL, TC fields to select LRH CM / SA must know GRH to LRH mappings APM must select paths that are independent  Harder if APM failover is between routers Needs Specification

Multipath / APM Router CA Good Primary/Secondary Bad Primary Router CA Bad Path uses all switches/routers Fails completely if any link fails

IPoIB Currently uses link local scope for multicast groups  Prevents crossing routers Need this configurable per interface Inter-subnet multicast groups need to agree on parameters Scalability issues  IPv6 solicited node multicast  IPv4 ARP broadcast IB routers likely to provide IP routing for scalability

Multicast Scalability Which MC groups must an SA know about? RFC 4391 (sec 10) solution for IPoIB scalability  Interaction with native IB apps? All routers MC group not native IB concept How can this be optimized? Uncertainty on SA, router & IPoIB MC interaction

RDMA CM Addressing Unscalable to span IPoIB across routers RDMA CM uses ARP to learn remote GID  Limited to single IPoIB subnet Expand RDMA CM beyond IPoIB subnet Use GID addressing with IPv6 DNS/etc? Discover GIDs without using ARP?

Router / SA Communication Unicast and multicast routing protocols Router to host or SA prefix advertisement Inter-subnet coordination  PKey, TClass (QoS), SA services Multicast memberships Least Pressing Issues Needs Specification

Link Flow Control Implementing in routers can lead to dead lock  Depends on per-subnet routing, not routers No flow control leads to packet loss  Even small loss affects IB RC performance Need Solution Router Traffic to router and traffic from router on same VL/Link Form half a network Cycle.

Final Thoughts IB intra-subnet traffic has centralized control within the SM IB inter-subnet needs to be decentralized to scale well Retaining the unique features of IB will require different approaches from Ethernet/IP

Go Forward Work-arounds to allow more testing Software router for experimentation?  Linux, commodity HCAs Device Implementers: Follow Specs More IBTA Specifications Needed GMPs can have GRHs Path records can return global paths