No Travel? No Problem.

Remote Participation
CAKE: Matrix Multiplication Using Constant-Bandwidth Blocks
Event Type
Paper
Tags
Algorithms
Parallel Programming Systems
Reproducibility Badges
Registration Categories
TP
TimeThursday, 18 November 20211:30pm - 2pm CST
Location227-228
DescriptionWe offer a novel approach to matrix-matrix multiplication computation on computing platforms with memory hierarchies. Constant-bandwidth (CB) blocks improve computation throughput for architectures limited by external memory bandwidth. Configuring the shape and size of CB blocks operating from within any memory hierarchy level (e.g., internal SRAM), we achieve high throughput while holding external bandwidth (e.g., with DRAM) constant. We explain how, surprisingly, CB blocks can maintain constant external bandwidth as computation throughput increases. Analogous to partitioning a cake into pieces, we dub our CB-partitioned system CAKE.

We show CAKE outperforms state-of-the-art libraries in computation time on real-world systems where external bandwidth represents a bottleneck, demonstrating CAKE's ability to address the memory wall. CAKE achieves superior performance by directly using theoretically optimal CB-partitioned blocks in tiling and scheduling, obviating the need for extensive design search.
Back To Top Button