Presentation

· Contributors · Organizations · Search Program

ACCL: FPGA-Accelerated Collectives over 100 Gbps TCP-IP

SessionH2RC: Seventh International Workshop on Heterogeneous High-Performance Reconfigurable Computing

Author/Presenters

Event Type

Workshop

Tags

Registration Categories

TimeMonday, 15 November 202110:30am - 11am CST

Location231-232

DescriptionCollective operations such as scatter, gather, reduce, etc are utilized broadly to implement distributed HPC applications and are the target of extensive optimization in all MPI implementations as well as dedicated collective libraries by accelerator vendors (e.g. NCCL and RCCL by NVidia and AMD respectively). We present ACCL, an open-source FPGA-accelerated collectives library designed to serve applications running primarily in Xilinx FPGAs. Compared to previous collective communication solutions for FPGA, ACCL is flexible and extensible, easily portable, and fast. We evaluate ACCL up to 8 nodes and demonstrate that ACCL outperforms OpenMPI over 100 Gbps TCP-IP for large messages.

Author/Presenters

Zhenhao He

ETH Zürich

Daniele Parravicini

Research Labs Xilinx, Ireland

Lucian Petrica

Research Labs Xilinx, Ireland

Kenneth O’Brien

Research Labs Xilinx, Ireland

Gustavo Alonso

ETH Zürich

Michaela Blott

Research Labs Xilinx, Ireland

No Travel? No Problem.