OCP Accelerator Module

Slides:



Advertisements
Similar presentations
Flexible I/O in a Rigid World
Advertisements

Supermicro © 2009Confidential 06/01/2009 Supermicro GPU Server Solutions SYS-7046T-GRF SYS-6016T-GF-TM2 SYS-6016T-GF-TC2 SYS-6016T-GF SYS-6016T-XF.
1 of 24 The new way for FPGA & ASIC development © GE-Research.
Digital RF Stabilization System Based on MicroTCA Technology - Libera LLRF Robert Černe May 2010, RT10, Lisboa
ESODAC Study for a new ESO Detector Array Controller.
CS 213 Commercial Multiprocessors. Origin2000 System – Shared Memory Directory state in same or separate DRAMs, accessed in parallel Upto 512 nodes (1024.
1 Tell10 Guido Haefeli, Lausanne Electronics upgrade meeting 10.February 2011.
The Limits of Switch Bandwidth
New PSB Beam Control A. Blas, P. Leinonen, J. Sanchez-Quesada Working group meeting 27/08/ Clock distribution 2.Motherboard to daughter cards connectors.
Mahesh Wagh Intel Corporation Member, PCIe Protocol Workgroup.
HARDWARE OPAL-RT MARC PASTOR Real-Time 2009 Montreal, Quebec, Canada.
CSE 291 High Performance Interconnect Fall 2012 University of California, San Diego Course Information Instructor CK Cheng,
“ Analyzer for 40Gbit Ethernet “ (Bi-semestrial project) Executers: פריד מחאג ' נה Farid Mahajna Husam Kadan חוסאם קעדאן Instructor:
Silicon Building Blocks for Blade Server Designs accelerate your Innovation.
1 5. Application Examples 5.1. Programmable compensation for analog circuits (Optimal tuning) 5.2. Programmable delays in high-speed digital circuits (Clock.
LAN Switching and Wireless – Chapter 1 Vilina Hutter, Instructor
Looking Ahead: A New PSU Research Cloud Architecture Chuck Gilbert - Systems Architect and Systems Team Lead Research CI Coordinating Committee Meeting.
March 9, 2015 San Jose Compute Engineering Workshop.
Slide ‹Nr.› l © 2015 CommAgility & N.A.T. GmbH l All trademarks and logos are property of their respective holders CommAgility and N.A.T. CERN/HPC workshop.
Questions on IFPAC_SCHEMATIC. Signal Chain Preamplifier Compensation Capacitor should go to –Vs, not GND Where is resistor For compensation Network? Does.
Serial Data Link on Advanced TCA Back Plane M. Nomachi and S. Ajimura Osaka University, Japan CAMAC – FASTBUS – VME / Compact PCI What ’ s next?
2-PAD Digital Beamformer Chris Shenton11 th October PAD Digital Beamformer Chris Shenton 11 th October 2007.
Proposal for the assembly of the PHOBOS ring counters (H.P. 3/20/98) I would like to propose and discuss with you an alternative layout and assembly procedure.
Connector Differential Receiver 8 Channels 65 MHz 12 bits ADC FPGA Receive/buffer ADC data Format triggered Events Generate L1 Primitives Receive timing.
Correlator Options for 128T MWA Cambridge Meeting Roger Cappallo MIT Haystack Observatory
Trigger Hardware Development Modular Trigger Processing Architecture Matt Stettler, Magnus Hansen CERN Costas Foudas, Greg Iles, John Jones Imperial College.
Pickering’s Experience of LXI Switching Applications David Owen Business Development Manager, Pickering Interfaces
RapidIO based Low Latency Heterogeneous Supercomputing Devashish Paul, Director Strategic Marketing, Systems Solutions
29/05/09A. Salamon – TDAQ WG - CERN1 LKr calorimeter L0 trigger V. Bonaiuto, L. Cesaroni, A. Fucci, A. Salamon, G. Salina, F. Sargeni.
Seonyeong Heo Dept. of CSE, POSTECH
Oracle & HPE 3PAR.
Modular Electronics an overview of modern standards
LHCb and InfiniBand on FPGA
Flexible I/O in a Rigid World
PRAD DAQ System Overview
A Survey of Data Center Network Architectures By Obasuyi Edokpolor
Subject Name: Optical Fiber Communication Subject Code: 10EC72
OCP: High Performance Computing Project
HCAL Data Concentrator Production Status
OCP NIC 3.0 Discussion Updated New
SCSI over PCI Express (SOP) use cases
6 September 2017 OCP NIC subgroup teleconference
Design Package Contribution for Project Olympus US1-EPYC
OCP Mezz 3.0 Mechanical Recommendations
Low Latency Analytics HPC Clusters
Dr. Jeffrey M. Harris Director of Research and System Architecture
Thermal Model Description
Thermal Simulation Discussion
QCT Rackgo X Yosemite V2 2018/08/22.
OCP Mezzanine NIC 3.0 Discussion
NVMe SSD Form Factor Challenges
Anne Pratoomtong ECE734, Spring2002
OCP Mezzanine NIC 3.0 Discussion
OCP NIC 3.0 Discussion - Presentation to Server WG / IC
Comments on OCP Mezz v3 & Connector/Pinout Proposal
NIC v3.0 Mechanical Discussion
OCP Mezzanine NIC 3.0 Discussion
Challenges Implementing Complex Systems with FPGA Components
OCP NIC 3.0 Discussion - status update to community
OCP Mezzanine NIC 3.0 Discussion
Mechanical Discussion
Mechanical Discussion
Shenghsun Cho, Mrunal Patel, Han Chen, Michael Ferdman, Peter Milder
OCP Mezzanine NIC 3.0 Discussion
Updated Thermal Model Description
Mechanical Discussion
Dylan Lang VITA 57.4 FMC+ Tutorial Dylan Lang
Thermal Simulation Discussion
FIGURE 5-1 MOS Transistor, Symbols, and Switch Models
OCP Engineering Workshop Rack & Power, Advanced Cooling Solutions, Data Center Facility
Presentation transcript:

OCP Accelerator Module Whitney Zhao, Hardware Engineer, Facebook Siamak Tavallaei, Principal Architect, Microsoft (OCP Server Project co-Chair)

AI’s rapid evolution is producing an explosion of new types of hardware accelerators for ML and Deep Learning GPU FPGA ASIC NPU TPU NNP xPU…

Different Implementations targeting similar requirements!

Common Requirements Power & Cooling Robustness & Serviceability Configuration, Programming, & Management Inter-module Communication to Scale Up Input / Output Bandwidth to Scale Out

PCIe CEM Form Factor ?

PCIe CEM Form Factor is not it!

PCIe AIC Multi Cards in system Multi links High Interconnect BW needed between Card to Card Too much insertion loss from ASIC to HS Connectors in PCIe Form Factor Inter-Card Cabling is difficult and limited in supported topologies

Mezzanine Module High-density Connectors for input/output Links Low insertion loss  high-speed interconnect Enough space for Accelerators and associated local logic & power Flexible for heatsink design for air-cooled & liquid cooling Flexible inter-Module interconnect topologies

Mezzanine Module(accelerators) Baseboard (interconnect topology) Need Interoperable Mezzanine Module(accelerators) Baseboard (interconnect topology) Tray (serviceability) Chassis (scale-out deployment)

Module proposal Molex Mirror Mezz connector, 344 differential pairs for two connectors. Good for up to 8X16 links(Host + interconnect) Support up to 56Ghz NRZ Module width 104mm Module length could be flexible(>132mm). 165mm is recommended for both disaggregated design or HPC all in one server design

Module Power Support both 12V or 48V as input 12V to support up to 200w TDP 48V to support up to 500w TDP Power pin map: on Conn 0

Baseboard Design Examples Microsoft HGX-1

Baseboard Design Examples Facebook Big Basin

Inviting feedback to build the requirement set

Participate & Collaborate at OCP Server Project