Www.openfabrics.org OFED 1.2 Management Update Hal Rosenstock.

Slides:



Advertisements
Similar presentations
EdgeNet2006 Summit1 Virtual LAN as A Network Control Mechanism Tzi-cker Chiueh Computer Science Department Stony Brook University.
Advertisements

Dynamic Topology Optimization for Supercomputer Interconnection Networks Layer-1 (L1) switch –Dumb switch, Electronic “patch panel” –Establishes hard links.
COS 461 Fall 1997 Routing COS 461 Fall 1997 Typical Structure.
IP datagrams Service paradigm, IP datagrams, routing, encapsulation, fragmentation and reassembly.
Packet Switching COM1337/3501 Textbook: Computer Networks: A Systems Approach, L. Peterson, B. Davie, Morgan Kaufmann Chapter 3.
Advanced Networking Wickus Nienaber Daniel Beech.
1 InfiniBand HW Architecture InfiniBand Unified Fabric InfiniBand Architecture Router xCA Link Topology Switched Fabric (vs shared bus) 64K nodes per sub-net.
High Performance Router Architectures for Network- based Computing By Dr. Timothy Mark Pinkston University of South California Computer Engineering Division.
William Stallings Data and Computer Communications 7 th Edition (Selected slides used for lectures at Bina Nusantara University) Internetworking.
Hands-On Microsoft Windows Server 2003 Networking Chapter 7 Windows Internet Naming Service.
1 Lecture 25: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Review session,
2006 Sonoma Workshop January 2006 Pre-Plugfest Interop Session Tuan Phamdo – Intel – Co-Chair IBTA CIWG Sujal Das - Director, SW Product Mgmt, Mellanox.
MULTICOMPUTER 1. MULTICOMPUTER, YANG DIPELAJARI Multiprocessors vs multicomputers Interconnection topologies Switching schemes Communication with messages.
Switching, routing, and flow control in interconnection networks.
IB ACM InfiniBand Communication Management Assistant (for Scaling) Sean Hefty.
Chapter 13: WAN Technologies and Routing 1. LAN vs. WAN 2. Packet switch 3. Forming a WAN 4. Addressing in WAN 5. Routing in WAN 6. Modeling WAN using.
Copyright © 2007 InfiniBand ® Trade Association. Other names and brands are properties of their respective owners. IB Cross-Subnet Communication OpenFabrics.
ICMP (Internet Control Message Protocol) Computer Networks By: Saeedeh Zahmatkesh spring.
OFA-IWG - March 2010 OFA Interoperability Working Group Update Authors: Mikkel Hagen, Rupert Dance Date: 3/15/2010.
1 Chapter 27 Internetwork Routing (Static and automatic routing; route propagation; BGP, RIP, OSPF; multicast routing)
Chapter 4: Managing LAN Traffic
Module 13: Network Load Balancing Fundamentals. Server Availability and Scalability Overview Windows Network Load Balancing Configuring Windows Network.
Wave Relay System and General Project Details. Wave Relay System Provides seamless multi-hop connectivity Operates at layer 2 of networking stack Seamless.
OFED 1.x Roadmap & Release Process November 06 Jeff Squyres, Woodruff, Robert J, Betsy Zeller, Tziporet Koren,
1 Computer Communication & Networks Lecture 22 Network Layer: Delivery, Forwarding, Routing (contd.)
Current major high performance networking technologies InfiniBand 10G-Ethernet.
1 Chapter 27 Internetwork Routing (Static and automatic routing; route propagation; BGP, RIP, OSPF; multicast routing)
Virtual Networking. Module Objectives By the end of this module participants will be able to: Understand the use of virtual LANs Create VLAN subinterfaces.
Infiniband subnet management Discuss the Infiniband subnet management system Discuss fat tree and subnet management in an Infiniband with a fat tree topology.
Open Fabrics BOF Supercomputing 2008 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller.
OFED 1.2 Lessons, 1.3 Planning and Field Support May 07 Tziporet Koren.
 Network Segments  NICs  Repeaters  Hubs  Bridges  Switches  Routers and Brouters  Gateways 2.
InfiniBand Routing Solution Approach Yaron Haviv, CTO, Voltaire
Improving Capacity and Flexibility of Wireless Mesh Networks by Interface Switching Yunxia Feng, Minglu Li and Min-You Wu Presented by: Yunxia Feng Dept.
Management Scalability Author: Todd Rimmer Date: April 2014.
Update on Scalable SA Project #OFADevWorkshop Hal Rosenstock Mellanox Technologies.
Network and Communications Ju Wang Chapter 5 Routing Algorithm Adopted from Choi’s notes Virginia Commonwealth University.
InfiniBand in the Lab Erik 1.
Scalable RDMA Software Solution Sean Hefty Intel Corporation.
Chapter 8-2 : Multicomputers Multiprocessors vs multicomputers Multiprocessors vs multicomputers Interconnection topologies Interconnection topologies.
An OLSR implementation, experience, and future design issues.
Mobile IP Outline Intro to mobile IP Operation Problems with mobility.
OpenFabrics Enterprise Distribution (OFED) Update
InfiniBand Routers Ian Colloff : QLogic LWG Co-Chair.
Management Tools Development related to DoE Hal Rosenstock.
OFED 1.3 InfiniBand Management Update Hal Rosenstock.
COS 420 Day 15. Agenda Finish Individualized Project Presentations on Thrusday Have Grading sheets to me by Friday Group Project Discussion Goals & Timelines.
Host Side Dynamic Reconfiguration with InfiniBand TM By Wei Lin Guay*, Sven-Arne Reinemo*, Olav Lysne*, Tor Skeie*, Bjørn Dag Johnsen^ and Line Holen^
Linux Management Enhancements Hal Rosenstock.
OpenFabrics Developers Summit SC06 QoS Update and Implementation RFC Eitan Zahavi, Mellanox Technologies Nov 2006.
1 LAN switching and Bridges Relates to Lab Outline Interconnection devices Bridges/LAN switches vs. Routers Bridges Learning Bridges Transparent.
Computer Networks 0110-IP Gergely Windisch
InfiniBand Routing in OFA Jason Gunthorpe – Obsidian Sean Hefty – Intel Hal Rosenstock – Voltaire.
Instructor Materials Chapter 2: Scaling VLANs
Network Layer COMPUTER NETWORKS Networking Standards (Network LAYER)
Enhancements for Voltaire’s InfiniBand simulator
IP Over InfiniBand Working Group Management Information Bases
Data Center Network Architectures
The Underlying Technologies
Advanced Computer Networks
What Are Routers? Routers are an intermediate system at the network layer that is used to connect networks together based on a common network layer protocol.
Cluster Communications
Simple Connectivity Between InfiniBand Subnets
Switching and Forwarding Bridges and Extended LANs
Chapter 5: Inter-VLAN Routing
Chapter 2: Scaling VLANs
Dynamic Routing and OSPF
PRESENTATION COMPUTER NETWORKS
Bridges Neil Tang 10/10/2008 CS440 Computer Networks.
Virtual LAN (VLAN).
Presentation transcript:

OFED 1.2 Management Update Hal Rosenstock

2 OpenSM for OFED 1.2  Release Info  git://git.openfabrics.org/~ofed_1_2/management. git  openib (OFED 1.2 rc3)  Currently used as basis for Pelaton cluster  New Functionality  Bug Fixes

3 New Functionality  Routing improvements  SA optional record support “virtually” complete  IB router enablement  SA database dump/restore

4 Routing Improvements  Performance improvements of over an order of magnitude  Min hop  Up/down  New routing (pathing) algorithms  Fat Tree (Mellanox contribution)  LASH (Simula contribution)

5 Fat Tree Routing  Optimizes routing for congestion free “Shift” communication pattern  Deals with Fat Trees of various types  Symmetrical  Not just K-Ary-N-Trees Non constant K Not fully staffed  Any CBB ratio  Automatically detects whether the topology is a Fat Tree  Provides  LFT tables assignment  MPI “rank” file of hosts Can be used for creating topology-aware communication patterns

6 LASH – LAyered SHortest path  All dependency cycles found over the physical links are broken by separating the involved routes using “virtual layers”.  Within each layer, the routing function is deadlock free, but incomplete.  By restricting packets to one virtual layer, the complete routing function across all layers remains deadlock free.  Layers are not just a QoS issue! LASH can also be implemented with QoS  Deterministic, all packets follow shortest paths (can be extended to also support multipath routing).  Origin:  2002, Simula Research Laboratory, Oslo, Norway.  Tor Skeie Olav Lysne

7 LASH – the method (roughly) 1.Calculate shortest paths between all source / destinations 2.For each path, for all pairs find a virtual layer i that the current path can be assigned to without closing a dependency cycle in the (current) routing function for layer i. if such a layer cannot be found, create a new layer. 3.Once complete, lower numbered layers tend to be over represented with paths so a balancing stage is carried out to distribute an equal number of paths between each layer  The resulting algorithm is a deadlock free minimal path routing algorithm.

8 LASH – Status in OpenFabrics  Added to OFED 1.2 branch as experimental in January ’07. Now transitioned from experimental.  One upcoming commercial offering using OpenFabrics will employ LASH  Further improvements requried to bring number of layers down. Mesh (any size) requires on 1 layer. Torus 10x10 requires 4 layers for independent paths and 8 layers for double paths (return path in the same layer). This can be improved and will scale. man page has details on layer requirements  The need for virtual layers is independent of the number of end nodes (HCAs); HCA does not need to support more than 1 VL  LASH resource web page under development at Simula

9 Performance LASH versus Up/Down  LASH avoids the congestion problem associated with the root node that is prevalent in Up*/Down* and supports minimal routing  LASH requires the use of Virtual Layers  Up*/Down* does not Throughput plot comparing the performance of LASH an Up*/Down*. 128 switches were interconnected as a mesh for the experiments

10 SA Optional Record Support  InformInfo improvements  InformInfoRecord, MulticastForwardingTableRecord, and SwitchInfoRecord added  SMInfoRecord now supports all SMs  Not just local SM  Missing ServiceAssociationRecord  Also, TraceRecord

11 IB Router Enablement  Experimental  ROUTER_EXP not enabled in build by default  Much of IBA missing for routers  Fix handling of router ports  Support for off subnet GIDs in SA PathRecord  Support for non link-local scope in MGID in SA MCMemberRecord

12 SA Database Dump/Restore  SA registrations can be dumped/restored  Multicast  Services  Events  opensm-sa.dump in /var/log by default  -S option with dump file restores SA database  If restoration successful, no client reregister

13 Additional New Functionality  Socket support for console  Log rotation while running  Scope support in partition configuration for IPoIB multicast groups  Option to force SDR link speed

14 Bug Fixes (since OFED 1.1)  See OFED 1.2 OpenSM release notes for details  Also, for non compliances

15 Upcoming (beyond OFED 1.2)  More routing performance improvements  Even more speedups  Better packaging/installation  “Native” daemon mode  Performance management  Quality of Service manager  Based on IBTA annex soon to be released

16 Needed  Better IPv6 solicited node multicast (SNM) handling  Multiple groups share same MLID  NodeDescription changed trap handling  “Selected” IBA enhancements  Handle local events ?

17 Futures  Many things  More improvements Core Routing algorithms  Continued improvements in Stability and Scalability More tests and testing Larger cluster experience  What do you think is needed ?  What would you like to see added ?

18 Diagnostics  Many improvements since OFED 1.1  Covered in DoE tools talk  ibdiagui  GUI for ibdiagnet Used at SC06  Mellanox contribution  Part of ibutils package git://git.openfabrics.org/ofed_1_2/ibutils.git

19 ibdiagui

20 Related  ibsim  OpenSM and OpenIB diags work unmodified on this uses ibnetdiscover format for topology  Voltaire contribution  Not part of OFED 1.2  git://git.openfabrics.org/~sashak/ibsim.git

21 Thank You

22 Backup

23 Other technology from Simula  MRoots  Use multiple Up*/Down* trees each with their own root in different layer. Reduces root congestion problem  LASH-TOR  Transition Orientated LASH, an extension to reduce the number of virtual channels required for LASH by using transitions between virtual layers  FRoots  Fault tolerant routing using layers to ensure fabric stays connected in the face of a fault. This works and could be implemented for InfiniBand  Please contact Tor Skeie or Olav Lysne for further  Simula Research Laboratory is a state funded research lab that conducts basic research in the fields of communication technology, scientific computing and software engineering. Simula focuses on fundamental scientific problems with a large potential for important applications in society.