Close

Presentation

This content is available for: Tutorial Reg Pass. Upgrade Registration
Networking Technologies for High-Performance Computing: Principles and Solutions
DescriptionInfiniBand (IB), High-speed Ethernet (HSE), RoCE, Omni-Path, EFA, Tofu, and Slingshot technologies are generating a lot of excitement towards building next generation High-End Computing (HEC) systems including clusters, datacenters, filesystems, storage, cloud computing, Big Data (Spark) and AI (Deep Learning and Machine Learning) environments. This tutorial will provide an overview of these emerging technologies, their offered architectural features, their current market standing, and their suitability for designing HEC systems. It will start with a brief overview of IB, HSE, RoCE, Omni-Path, EFA, Tofu, and Slingshot. In-depth overview of the architectural features of IB, HSE (including iWARP and RoCE), and Omni-Path, their similarities and differences, and the associated protocols will be presented. An overview of the emerging NVLink2, NVSwitch, AMD Infinity Fabric, Slingshot, and Tofu architectures will also be given. Next, an overview of the OpenFabrics stack and Libfabrics software stack to support a range of different interconnects will be provided. Hardware/software solutions and the market trends behind these networking technologies will be highlighted. Sample performance numbers of these technologies and protocols for different environments will be presented. Finally, hands-on exercises will be carried out for the attendees to gain first-hand experience of running experiments with high-performance networks.
Event Type
Tutorial
TimeMonday, 13 November 20238:30am - 12pm MST
Location201
Tags
Architecture and Networks
Distributed Computing
Registration Categories
TUT