High-Performance Computing in the Age of Machine Learning Interatomic Potentials: A Review of Optimization Strategies for Training and Inference

Si-Yu Hu; Er-Lin Yao; Guang-Ming Tan; Wei-Le Jia

doi:10.1007/s11390-026-6331-5

Hu SY, Yao EL, Tan GM et al. High-performance computing in the age of machine learning interatomic potentials: A review of optimization strategies for training and inference. JOURNAL OFCOMPUTER SCIENCE AND TECHNOLOGY, 41(1): 128−146, Jan. 2026. DOI: 10.1007/s11390-026-6331-5

Citation:

High-Performance Computing in the Age of Machine Learning Interatomic Potentials: A Review of Optimization Strategies for Training and Inference

Abstract

Abstract

As one typical AI-for-Science application, machine learning interatomic potentials (MLIPs) have revolutionized the representation of potential energy surfaces. MLIPs can be categorized into specialized MLIPs, which prioritize high accuracy for specific systems, and pretrained MLIPs, which emphasize generalizability across chemical spaces. Specialized MLIPs and pretrained MLIPs differ in the dataset to be trained, model capability (parameters), the training workflow, and the workload in molecular dynamics. We review different high-performance computing (HPC) optimization techniques for training and inference that specialized MLIPs and pretrained MLIPs tend to prefer. For example, from the perspective of the training dataset, we investigate the load balance strategies, which are critical for pretrained MLIPs to enhance scalability. From the perspective of model parameters, we indicate that specialized MLIPs can benefit from curvature-aware optimization algorithms given their moderate model size. We remark that advances in HPC are not merely engineering improvements but play a key role in faster iteration of MLIPs, broader applicability, and sustained progress in MLIP development.

FullText(HTML)

References (108)

Relative Articles

Supplements (3)

Cited By

High-Performance Computing in the Age of Machine Learning Interatomic Potentials: A Review of Optimization Strategies for Training and Inference

Abstract

Catalog

Export File

Citation

Format

Content