SC’13 BoF Discussion Sean Hefty Intel Corporation.

Slides:



Advertisements
Similar presentations
MicroKernel Pattern Presented by Sahibzada Sami ud din Kashif Khurshid.
Advertisements

Device Virtualization Architecture
Middleware Support for RDMA-based Data Transfer in Cloud Computing Yufei Ren, Tan Li, Dantong Yu, Shudong Jin, Thomas Robertazzi Department of Electrical.
KOFI Stan Smith Intel SSG/DPD January, 2015 Kernel OpenFabrics Interface.
OFED TCP Port Mapper Proposal June 15, Overview Current NE020 Linux OFED driver uses host TCP/IP stack MAC and IP address for RDMA connections Hardware.
MPI Requirements of the Network Layer Presented to the OpenFabrics libfabric Working Group January 21, 2014 Community feedback assembled by Jeff Squyres,
IWARP Update #OFADevWorkshop.
VIA and Its Extension To TCP/IP Network Yingping Lu Based on Paper “Queue Pair IP, …” by Philip Buonadonna.
Develop Application with Open Fabrics Yufei Ren Tan Li.
Active Messages: a Mechanism for Integrated Communication and Computation von Eicken et. al. Brian Kazian CS258 Spring 2008.
An overview of Infiniband Reykjavik, June 24th 2008 R E Y K J A V I K U N I V E R S I T Y Dept. Computer Science Center for Analysis and Design of Intelligent.
IWARP Ethernet Key to Driving Ethernet into the Future Brian Hausauer Chief Architect NetEffect, Inc.
OFED (iWarp) Enhancements Felix Marti, Open Fabrics Alliance Workshop Sonoma, April 2008 Chelsio Communications.
Stan Smith Intel SSG/DPD June, 2015 Kernel Fabric Interface KFI Framework.
Stan Smith Intel SSG/DPD February, 2015 Kernel OpenFabrics Interface kOFI Framework.
IB ACM InfiniBand Communication Management Assistant (for Scaling) Sean Hefty.
New Direction Proposal: An OpenFabrics Framework for high-performance I/O apps OFA TAC, Key drivers: Sean Hefty, Paul Grun.
Open Fabrics Interfaces Architecture Introduction Sean Hefty Intel Corporation.
Discussing an I/O Framework SC13 - Denver. #OFADevWorkshop 2 The OpenFabrics Alliance has recently undertaken an effort to review the dominant paradigm.
OpenFabrics 2.0 Sean Hefty Intel Corporation. Claims Verbs is a poor semantic match for industry standard APIs (MPI, PGAS,...) –Want to minimize software.
Fabtests – test framework ideas/suggestions Howard Pritchard – LANL LA-UR OFI WG F2F - 8/
Open Fabrics BOF Supercomputing 2008 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller.
OpenFabrics 2.0 or libibverbs 1.0 Sean Hefty Intel Corporation.
BLU-ICE and the Distributed Control System Constraints for Software Development Strategies Timothy M. McPhillips Stanford Synchrotron Radiation Laboratory.
Scalable Fabric Interfaces Sean Hefty Intel Corporation OFI software will be backward compatible.
Scalable name and address resolution infrastructure -- Ira Weiny/John Fleck #OFADevWorkshop.
2006 Sonoma Workshop February 2006Page 1 Sockets Direct Protocol (SDP) for Windows - Motivation and Plans Gilad Shainer Mellanox Technologies Inc.
OFI SW - Progress Sean Hefty - Intel Corporation.
Fabric Interfaces Architecture Sean Hefty - Intel Corporation.
Scalable RDMA Software Solution Sean Hefty Intel Corporation.
RDMA IP CM Service Annex Arkady Kanevsky, Ph.D. IBTA SWG San Francisco September 25, 2006.
Minimizing Communication Latency to Maximize Network Communication Throughput over InfiniBand Design and Implementation of MPICH-2 over InfiniBand with.
IWARP Status Tom Tucker. 2 iWARP Branch Status  OpenFabrics SVN  iWARP in separate branch in SVN  Current with trunk as of SVN 7626  Support for two.
InfiniBand support for Socket- based connection model by CM Arkady Kanevsky November 16, 2005 version 4.
Fabric Interfaces Architecture Sean Hefty - Intel Corporation.
Stan Smith Intel SSG/DPD February, 2015 Kernel OpenFabrics Interface Initialization.
IB Verbs Compatibility
Design and Implementation of Spacecraft Avionics Software Architecture based on Spacecraft Onboard Interface Services and Packet Utilization Standard Beijing.
OFI SW Sean Hefty - Intel Corporation. Target Software 2 Verbs 1.x + extensions 2.0 RDMA CM 1.x + extensions 2.0 Fabric Interfaces.
iSER update 2014 OFA Developer Workshop Eyal Salomon
OpenFabrics Interface WG A brief introduction Paul Grun – co chair OFI WG Cray, Inc.
OpenFabrics 2.0 rsockets+ requirements Sean Hefty - Intel Corporation Bob Russell, Patrick MacArthur - UNH.
Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet P. Balaji, S. Bhagvat, R. Thakur and D. K. Panda, Mathematics.
Open Fabrics Interfaces Software Sean Hefty - Intel Corporation.
Reliable Multicast (RMC) Liran Liss Mellanox Technologies Inc.
Stan Smith Intel SSG/DPD June, 2015 Kernel Fabric Interface Kfabric Framework.
Intel MPI OFA Experience Sean Hefty (don’t shoot the messenger)
Advisor: Hung Shi-Hao Presenter: Chen Yu-Jen
Tgt: Framework Target Drivers FUJITA Tomonori NTT Cyber Solutions Laboratories Mike Christie Red Hat, Inc Ottawa Linux.
Interface Concepts Modeling Core Team
Facade Pattern Jim Fawcett CSE776 – Design Patterns Summer 2010
Unit - 3 OBJECT ORIENTED DESIGN PROCESS AND AXIOMS
Agenda About us Why para-virtualize RDMA Project overview Open issues
The Mach System Sri Ramkrishna.
Interface extensions YANG & VLAN sub-interface YANG Status update
A Brief Introduction to OpenFabrics Interfaces - libfabric
Fabric Interfaces Architecture – v4
Design and Implementation of Spacecraft Avionics Software Architecture based on Spacecraft Onboard Interface Services and Packet Utilization Standard Beijing.
Advancing open fabrics interfaces
Facade Pattern Jim Fawcett CSE776 – Design Patterns Summer 2010
Introduction to the Kernel and Device Drivers
OpenFabrics Interfaces: Past, present, and future
Request ordering for FI_MSG and FI_RDM endpoints
OpenFabrics Alliance An Update for SSSI
Improving IPC by Kernel Design
SAMANVITHA RAMAYANAM 18TH FEBRUARY 2010 CPE 691
Application taxonomy & characterization
NVMe.
Network Diagnostics Framework
Chapter 13: I/O Systems.
Presentation transcript:

SC’13 BoF Discussion Sean Hefty Intel Corporation

SC’13 BoF Feedback Most people I’ve spoken with are highly supportive of the concepts being discussed –General alignment Active discussion at SC ‘13 –Engagement is high Good participation Want even wider focus outside of MPI –Diverse viewpoints helping to drive conversation More likely to produce better product Need to understand all concerns Address as many concerns as reasonable 2

Main Concerns from BoF Application centric APIs may require new protocols –Must ensure wire compatibility RDMA CM provides a simpler abstraction over verbs –How does this effort differ Some applications want low-level access –Technology specific apps 3

Evolving OFA Software from the Viewpoint of libibverbs Analyze libibverbs and librdmacm details –Selected issues –Previously covered fast path call overhead Identify specific areas for improvement Abstract the improvements to create a higher- performing, more extensible framework 4

Current libraries 5 libibverbs Verbs Provider Infiniband verbs interface User to kernel ‘command’ interface librdmacm Verb Helpers

libibverbs 6 libibverbs Verbs Provider Infiniband verbs interface User to kernel ‘command’ interface 2 interfaces are exported – one for applications and one for providers Application interface is based on IB defined hardware interface Fast path calls go directly to provider (inline - not exported) Data structures expose internal implementation details

libibverbs Structures struct ibv_qp –pthread objects (mutex, cond) –internal counter –internal handle –qp data (state, qpn, type) –5 data pointers –user context struct ibv_cq – pthread objects (mutex, cond) – internal counters – internal handle – 2 data pointers – user context 7 Exposes internal implementation IB specific QP data -Not useful for iWarp -Not needed with rdma cm IB specific QP data -Not useful for iWarp -Not needed with rdma cm Data pointers may not be valid or needed by app

libibverbs Structures struct ibv_context –struct ibv_device *device –struct ibv_context_ops ops; –int cmd_fd; –… 8 Provider implemented functions Embedded structure – fixed size! -Adding new calls is non-trivial -No mechanism to do so upstream Embedded structure – fixed size! -Adding new calls is non-trivial -No mechanism to do so upstream Provider functions associated with an opened device -Cannot optimize call based on use -Branches in code path and larger data structures Provider functions associated with an opened device -Cannot optimize call based on use -Branches in code path and larger data structures

librdmacm Original APIs –Create ID, resolve address, resolve route –Connect, listen, accept, reject Newer APIs –Getaddrinfo, create endpoint Verb helpers –Register memory –Send/receive, rdma read/write, UD send 9 No-op for iwarp and RoCE Scalability issues Required for UD Addresses scalability and ease of use Simplified calls, but limited by verbs API

Evolving ‘Verbs’ Merge libibverbs and librdmacm Eliminate duplicate functionality Remove internal implementation details from the API –Abstract data structures Associate functionality with specific objects –Enable optimized code paths Abstract objects for extensibility –Enable other use cases 10 These are goals of the proposed framework

BoF Concern Application centric APIs may require new protocols –Must ensure wire compatibility Transport protocol is exposed by framework API definitions and protocol definitions are separate Vendor-specific protocols are already being defined outside of verbs –PSM, MXM, FCA TBD: discuss related protocols –Address resolution, route lookup, connection setup 11

BoF Concern RDMA CM provides a simpler abstraction over verbs –How does this effort differ RDMA CM abstractions are limited by functions exported by verbs –Still filter to single post send/recv call RDMA CM is highly dependent on, but maintained separately –RDMA CM use is required for iWarp RDMA CM functionality is carried forward 12

BoF Concern Some applications want low-level access –Technology specific apps Low-level access will still be possible Focus is on ease of use (librdmacm model) with high-performance (libibverbs fast path) 13

(Scalable) Fabric Interfaces Q: What is implied by incorporating interface sets under a single framework? Objects exist that are usable between the interfaces Isolated interfaces turn the framework into a complex dlopen Interfaces are composable May be used together 14 Fabric Interfaces Message Queue Control Interface Control Interface RDMA Atomics Active Messaging Tag Matching Collective Operations CM Services Cohesive interfaces, not merely a union

Migrating Providers from Verbs to FI libfabric CM Services libibverbs FI RDMA CM Verbs Provider Message Queue RDMA Verbs Verbs Provider ibverbs abstraction layer Providers can integrate into framework with minimal changes Providers leverage abstraction layer and RDMA CM capabilities Add optimize data transfers and completion processing calls to avoid translation overhead 15

Migrating Apps from Verbs to FI Expose ‘verbs’ interfaces directly from FI Use macros to convert ‘libibverbs’ exported calls to FI calls Or layer libibverbs over libfabric Applications must recompile Minimal benefit to app Dependent on accessed data structures and interfaces 16

Migrating Apps from Verbs to FI Define ‘verbs’ compatibility mode –Allows mapping objects between interfaces –E.g. QP  fabric socket CQ  EC –Restricts implementation –Mapping must be documented Identified by protocol Allow software to adopt new interfaces selectively –E.g. send/recv/ec_read libfabric Dual-Provider Library verbs libibverbs FI FI Provider Verbs Provider 17