Design and Operation of Shared Machine Learning Clusters on Campus
Kaiqiang Xu, Decang Sun, Hao Wang, Zhenghang Ren, Xinchen Wan, Xudong Liao, Zilong Wang, Junxue Zhang, Kai Chen
In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
(to appear)