SC21 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Exploiting User Activeness for Data Retention in HPC Systems

Authors: Wei Zhang (Texas Tech University), Suren Byna (Lawrence Berkeley National Laboratory (LBNL)), Hyogi Sim (Virginia Tech), SangKeun Lee (Oak Ridge National Laboratory (ORNL)), Sudharshan Vazhkudai (Micron Technology Inc), and Yong Chen (Texas Tech University)

Abstract: HPC systems typically rely on the fixed-lifetime (FLT) data retention strategy, which only considers temporal locality of data accesses to parallel file systems. Our extensive analysis based on the leadership-class HPC system traces, however, suggests that the FLT approach often fails to capture the dynamics in users' behavior and leads to undesired data purge. In this study, we propose an activeness-based data retention (ActiveDR) solution, which advocates considering the data retention approach from a holistic activeness-based perspective. By evaluating the frequency and impact of users' activities, ActiveDR prioritizes the file purge process for inactive users and rewards active users with extended file lifetime on parallel storage. Our extensive evaluations based on the traces of the prior Titan supercomputer show that, when reaching the same purge target, ActiveDR achieves up to 37% file miss reduction as compared to the current FLT retention methodology.

Presentation: file

Back to Technical Papers Archive Listing