SC21 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Production Deployment of Machine-Learned Rotorcraft Surrogate Models on HPC


Workshop:7th Workshop on Machine Learning in High Performance Environment

Authors: Wesley Brewer (General Dynamics Information Technology)


Abstract: We explore how to optimally deploy different types of machine-learned surrogate models used in rotorcraft aerodynamics on HPC. We first developed three different rotorcraft models at three different orders of magnitude (2M, 44M, and 212M trainable parameters) to use as test models. We tested three different types of inference server deployments: (1) a Flask-based HTTP inference server, (2) TensorFlow Serving with gRPC protocol, and (3) RedisAI server with RESP protocol. We investigated deployments on both DoD HPCMP's SCOUT and DoE OLCF's Summit POWER9 supercomputers, demonstrated the ability to inference a million samples per second using 192 GPUs, and studied multiple scenarios on both Nvidia T4 and V100 GPUs. We studied a range of concurrency levels both on the client-side and the server-side, and provide optimal configuration advice based on the type of deployment. Finally, we provide a simple Python-based framework for benchmarking machine-learned surrogate models using the various inference servers.


Website:






Back to 7th Workshop on Machine Learning in High Performance Environment Archive Listing



Back to Full Workshop Archive Listing