SC21 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

LCCG: A Locality-Centric Hardware Accelerator for High Throughput of Concurrent Graph Processing


Authors: Jin Zhao, Yu Zhang, and Xiaofei Liao (Huazhong University of Science and Technology); Ligang He (University of Warwick); Bingsheng He (National University of Singapore); and Hai Jin and Haikun Liu (Huazhong University of Science and Technology)

Abstract: In modern data centers, massive concurrent graph processing jobs are being processed on large graphs. However, existing hardware/- software solutions suffer from irregular graph traversal and intense resource contention. In this paper, we propose LCCG, a Locality-Centric programmable accelerator that augments the many-core processor for achieving higher throughput of Concurrent Graph processing jobs. Specifically, we develop a novel topology-aware execution approach into the accelerator design to regularize the graph traversals for multiple jobs on-the-fly according to the graph topology, which is able to fully consolidate the graph data accesses from concurrent jobs. By reusing the same graph data among more jobs and coalescing the accesses of the vertices’ states for these jobs, LCCG can improve the core utilization. We conduct extensive experiments on a simulated 64-core processor. The results show that LCCG improves the throughput of the cutting-edge software system by 11.3-23.9 times with only 0.5% area cost.


Presentation: file


Back to Technical Papers Archive Listing