Towards Hyperscale HPC & RDMA

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

The Development of Mellanox - NVIDIA GPUDirect over InfiniBand A New Model for GPU to GPU Communications Gilad Shainer.
Agile Infrastructure built on OpenStack Building The Next Generation Data Center with OpenStack John Griffith, Senior Software Engineer,
Module 1: Demystifying Software Defined Networking Module 2: Realizing SDN - Microsoft’s Software Defined Networking Solutions with Windows Server 2012.
RDS and Oracle 10g RAC Update Paul Tsien, Oracle.
IWARP Update #OFADevWorkshop.
Copyright © 2005 Department of Computer Science 1 Solving the TCP-incast Problem with Application-Level Scheduling Maxim Podlesny, University of Waterloo.
RoCEv2 Update from the IBTA
Dynamically Scaling Applications in the Cloud Presented by Paul.
Hybrid Hyper-scale Enterpris e Grade Azure compute regions.
Application Layer 2-1 Chapter 2 Application Layer Computer Networking: A Top Down Approach 6 th edition Jim Kurose, Keith Ross Application Layer – Lecture.
Congestion Control Tanenbaum 5.3, /12/2015Congestion Control (A Loss Based Technique: TCP)2 What? Why? Congestion occurs when –there is no reservation.
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
Hybrid Hyper-scale Enterpris e Grade Azure compute regions.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Practical TDMA for Datacenter Ethernet
Barracuda Load Balancer Server Availability and Scalability.
Application Layer 2-1 Chapter 2 Application Layer Computer Networking: A Top Down Approach 6 th edition Jim Kurose, Keith Ross Addison-Wesley March 2012.
Copyright 2009 Fujitsu America, Inc. 0 Fujitsu PRIMERGY Servers “Next Generation HPC and Cloud Architecture” PRIMERGY CX1000 Tom Donnelly April
Detail: Reducing the Flow Completion Time Tail in Datacenter Networks SIGCOMM PIGGY.
CustomerSegment and workloads Your Datacenter Active Directory SharePoint SQL Server.
Towards a Common Communication Infrastructure for Clusters and Grids Darius Buntinas Argonne National Laboratory.
Distributed Visualization and Data Resources Enabling Remote Interaction and Analysis Scott A. Friedman UCLA Institute for Digital.
CS 381 Final Exam Study Guide Final Exam Date: Tuesday, May 12 th Time: 10:30am -12:30pm Room: SB 105 Exam aid: 8 ½ x 11 page of notes front and back.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Extreme-scale computing systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward exa-scale computing.
Liam Newcombe BCS Data Centre Specialist Group Secretary Modelling Data Centre Energy Efficiency and Cost.
Cloud Scale Performance & Diagnosability Comprehensive SDN Core Infrastructure Enhancements vRSS Remote Live Monitoring NIC Teaming Hyper-V Network.
CON Software-Defined Networking in a Hybrid, Open Data Center Krishna Srinivasan Senior Principal Product Strategy Manager Oracle Virtual Networking.
Example: Sorting on Distributed Computing Environment Apr 20,
March 9, 2015 San Jose Compute Engineering Workshop.
A.SATHEESH Department of Software Engineering Periyar Maniammai University Tamil Nadu.
© BLADE Network Technologies, © FASTER VIRTUAL PROVEN Cloud Ready Network Open Products, Shipping Now.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Update IDC HPC Forum.
(MRC) 2 These slides are not approved for public release Resilient high-dimensional datacenter 1 Control Plane: Controllers and Switches.
CLOUD COMPUTING AND LESSONS FROM THE PAST Presented By Sanjana Malhotra.
Presented by: Xianghan Pei
Ethernet Congestion Management Brad Booth, Intel September 2004.
Revisiting Transport Congestion Control Jian He UT Austin 1.
What is Flexpod? Flexpod is a reference architecture for server, storage and networking components that are pretested and validated to work together as.
ICTCP: Incast Congestion Control for TCP in Data Center Networks By: Hilfi Alkaff.
“Your application performance is only as good as your network” (4)
GENI Enabled Software Defined Exchange (SDX) and ScienceDMZ (SD-SDMZ)
Learn how the cloud is accelerating network transformation
Balazs Voneki CERN/EP/LHCb Online group
Latency and Communication Challenges in Automated Manufacturing
CIS 700-5: The Design and Implementation of Cloud Networks
VPN Extension Requirements for Private Clouds
5/3/2018 3:51 AM Memory Efficient Loss Recovery for Hardware-based Transport in Datacenter Yuanwei Lu1,2, Guo Chen2, Zhenyuan Ruan1,2, Wencong Xiao2,3,
HyGenICC: Hypervisor-based Generic IP Congestion Control for Virtualized Data Centers Conference Paper in Proceedings of ICC16 By Ahmed M. Abdelmoniem,
Microsoft SharePoint Server 2016
11/13/ :11 PM Memory Efficient Loss Recovery for Hardware-based Transport in Datacenter Yuanwei Lu1,2, Guo Chen2, Zhenyuan Ruan1,2, Wencong Xiao2,3,
Key concepts covered in Midterm III
File Transfer Issues with TCP Acceleration with FileCatalyst
CLUSTER COMPUTING.
AMP: A Better Multipath TCP for Data Center Networks
RDMA over Commodity Ethernet at Scale
Cloud Computing: Concepts
Beyond FTP & hard drives: Accelerating LAN file transfers
TCP Overview.
Computer Networks Topic :User datagram protocol Transmission Control Protocol -Hemashree S( )
In-network datacenter computing draft-he-coin-datacenter-00
Cloud Computing What is it ? Why use it ? Enablers Pros and Cons
Microsoft Virtual Academy
Testing LOSSLESS Network for RDMA
IETF Sidemeeting: Large Scale Data Center HPC/RDMA
Productive + Hybrid + Intelligent + Trusted
SQL Server 2016 High Performance Database Offer.
Presentation transcript:

Towards Hyperscale HPC & RDMA Paul Congdon (Tallac/Huawei) paul.congdon@tallac.com IETF-104 HotRFC

Current HPC/RDMA networks “Future datacenters of all kinds will be built like high performance computers,” said Nvidia CEO Jensen Huang  Traditionally, HPC runs over custom lossless technologies Infiniband Link Layer Credit-based Flow Control More recently designed to run over IP infrastructure iWARP (IETF RFC 5040 – RFC 5044, RFC 6580, RFC 6581, RFC 7306) RoCEv2 (https://www.infinibandta.org/) The results produced by these networks are mainstream through the integration of artificial intelligence, machine learning, data analytics and data science workloads

RoCEv3 ??? Separate Network, Not Ethernet/IP Not Route-able, L2 Data Center, Complex L2 Congestion Control (QCN) Incomplete Congestion Control, reliance on L2 PFC Unspecified TCP tweaks, TCP HW NIC, Slow Start

What does it mean to be Hyperscale The term “hyperscale” refers to a computer architecture’s ability to scale in order to respond to increasing demand. Goals Common cloud scale infrastructure Dynamic and automated provisioning Diverse workload mix Low latency, high throughput Suggestions have been made to scale RDMA/HPC RDMA over commodity Ethernet at scale, SIGCOMM 2016 iWARP Redefined: Scalable Connectionless Communication over High-Speed Ethernet, 2010 International Conference on High Performance Computing Tuning ECN for Data Center Networks, CoNEXT '12 Revisiting Network Support for RDMA, SIGCOMM 2018 https://datatracker.ietf.org/doc/draft-chen-iccrg-rocev3-cm-requirements/ RoCEv3 = Improved retransmission strategy Improved congestion control mechanism (RTT, credit, ECN) Finer grain load balancing with looser re-ordering requirements

What if scenarios for Hyperscale HPC What if networks didn’t have to be lossless, but just very low loss? What if iWARP was run over Enhanced UDP instead of TCP? What if congestion management was fully defined for RDMA? Can we hyperscale HPC? Side Meeting: Monday 10AM Room: Tyrolka