HAppA: A Modular Platform for HPC Application Resilience Analysis with LLMs Embedded

Published in 2024 43rd International Symposium on Reliable Distributed Systems (SRDS), 2024

Recommended citation: Jiang, H., Zhu, J., Fang, B., & Guan, Q. (2024). HAppA: A Modular Platform for HPC Application Resilience Analysis with LLMs Embedded. In 2024 43rd International Symposium on Reliable Distributed Systems (SRDS).
Download Paper