Full Publications

| Back to Index | Google Scholar | DBLP | Semantic Scholar | ORCID | arXiv Author Page |

The publications are in a chronological order.

* indicates equal contribution and # indicates corresponding author.

| Show All | Conference Papers | Journal Papers | Workshop Papers | Preprints |

    Year
    2025

  1.        arXiv       

    Enhancing Token Filtering Efficiency in Large Language Model Training with Collider

    Di Chai, Pengbo Li, Feiyuan Zhang, Yilun Jin, Han Tian, Junxue Zhang#, Kai Chen

    arXiv:2502.00340 [cs.LG], 2025

    paper | arXiv
  2.        arXiv       

    FLASH-FHE: A Heterogeneous Architecture for Fully Homomorphic Encryption Acceleration

    Junxue Zhang, Xiaodian Cheng, Gang Cao, Meng Dai, Yijun Sun, Han Tian, Dian Shen, Yong Wang, Kai Chen

    arXiv:2501.18371 [cs.AR], 2025

    paper | arXiv
  3.        arXiv       

    Swift: Rethinking RDMA Control Plane for Elastic Computing

    Junxue Zhang, Han Tian, Xinyang Huang, Wenxue Li, Kaiqiang Xu, Dian Shen, Yong Wang, Kai Chen

    arXiv:2501.19051 [cs.NI], 2025

    paper | arXiv
  4.    SIGCOMM   

    CEIO: A Cache-Efficient Network I/O Architecture for NIC-CPU Data Paths

    Bowen Liu, Xinyang Huang, Qijing Li, Zhuobin Huang, Yijun Sun, Wenxue Li, Junxue Zhang, Ping Yin, Kai Chen

    In Proceedings of the ACM SIGCOMM 2025 Conference, 2025

    (To appear)
  5.    SIGCOMM   

    Revisiting RDMA Reliability for Lossy Fabrics

    Wenxue Li, Xiangzhou Liu, Yunxuan Zhang, Zihao Wang, Wei Gu, Tao Qian, Gaoxiong Zeng, Shoushou Ren, Xinyang Huang, Zhenghang Ren, Bowen Liu, Junxue Zhang, Bingyang Liu, Kai Chen

    In Proceedings of the ACM SIGCOMM 2025 Conference, 2025

    (To appear)
  6.         ATC        

    Towards Optimal Rack-scale μs-level CPU Scheduling through In-Network Workload Shaping

    Xudong Liao, Han Tian, Xinchen Wan, Chaoliang Zeng, Hao Wang, Junxue Zhang, Mengyu Ma, Guyue Liu, Kai Chen

    In Proceedings of the USENIX Annual Technical Conference, 2025

    (To appear)
  7.         ATC        

    Accelerating Distributed Graph Learning by Using Collaborative In-Network Multicast and Aggregation

    Zhaoyi Li, Jiawei Huang, Yijun Li, Jingling Liu, Junxue Zhang, Hui Li, Xiaojun Zhu, Shengwen Zhou, Jing Shao, Xiaojuan Lu, Qichen Su, Jianxin Wang, Chee Wei Tan, Yong Cui, Kai Chen

    In Proceedings of the USENIX Annual Technical Conference, 2025

    (To appear)
  8.       NSDI      

    GREEN: Carbon-efficient Resource Scheduling for Machine Learning Clusters

    Kaiqiang Xu, Decang Sun, Han Tian, Junxue Zhang, Kai Chen

    In Proceedings of the 22nd USENIX Symposium on Networked Systems Design and Implementation, 2025

    paper | usenix page
  9.     ASPLOS    

    Design and Operation of Shared Machine Learning Clusters on Campus

    Kaiqiang Xu, Decang Sun, Hao Wang, Zhenghang Ren, Xinchen Wan, Xudong Liao, Zilong Wang, Junxue Zhang, Kai Chen

    In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

    paper | acm dl
  10.     EuroSys    

    eNetSTL: Towards an In-kernel Library for High-Performance eBPF-based Network Functions

    Bin Yang, Dian Shen#, Junxue Zhang#, Hanlin Yang, Lunqi Zhao, Beilun Wang, Guyue Liu, Kai Chen

    In Proceedings of the 20th European Conference on Computer Systems, 2025

    paper | acm dl | supplementary materials | code

    Merged into Coolbpf of OpenAnolis (龙蜥社区)

  11.     EuroSys    

    Achieving Fairness Generalizability for Learning-based Congestion Control with Jury

    Han Tian, Xudong Liao, Decang Sun, Chaoliang Zeng, Yilun Jin, Junxue Zhang, Xinchen Wan, Zilong Wang, Yong Wang, Kai Chen

    In Proceedings of the 20th European Conference on Computer Systems, 2025

    paper | acm dl
  12.     SIGMOD    

    Sequoia: An Accessible and Extensible Framework for Privacy-Preserving Machine Learning over Distributed Data

    Kaiqiang Xu, Di Chai, Junxue Zhang, Fan Lai, Kai Chen

    In Proceedings of the International Conference on Management of Data, 2025

    paper | acm dl
  13.      APNet     

    RhyR: Cache-Aware Rate Control for RDMA I/O Congestion

    Qijing Li, Xinyang Huang, Bowen Liu, Pengbo Li, Junxue Zhang, Kai Chen

    In Proceedings of the 9th Asia-Pacific Workshop on Networking, 2025

    (To appear)
  14. Year
    2024

  15.    SIGCOMM   

    Fast, Scalable, and Accurate Rate Limiter for RDMA NICs

    Zilong Wang, Xinchen Wan, Luyang Li, Yijun Sun, Peng Xie, Xin Wei, Qingsong Ning, Junxue Zhang, Kai Chen

    In Proceedings of the ACM SIGCOMM 2024 Conference, 2024

    paper | acm dl
  16.         ATC        

    Efficient Decentralized Federated Singular Vector Decomposition

    Di Chai, Junxue Zhang#, Liu Yang, Yilun Jin, Leye Wang, Kai Chen#, Qiang Yang

    In Proceedings of the USENIX Annual Technical Conference, 2024

    paper | usenix page | code | docker

    Pass the Artfact Evaluation with all badges: usenixbadges-available usenixbadges-functional usenixbadges-reproduced

  17.     EuroSys    

    Accelerating Privacy-Preserving Machine Learning With GeniBatch

    Xinyang Huang, Junxue Zhang#, Xiaodian Cheng, Hong Zhang, Yilun Jin, Shuihai Hu, Han Tian, Kai Chen#

    In Proceedings of the 19th European Conference on Computer Systems, 2024

    paper | acm dl | code
  18.     Security    

    Accelerating Secure Collaborative Machine Learning with Protocol-Aware RDMA

    Zhenghang Ren, Mingxuan Fan, Zilong Wang, Junxue Zhang, Chaoliang Zeng, Zhicong Huang, Cheng Hong, Kai Chen

    In Proceedings of the 32nd USENIX Security Symposium, 2024

    paper | usenix page
  19.        CSUR       

    SoK: Fully Homomorphic Encryption Accelerators

    Junxue Zhang*, Xiaodian Cheng*, Liu Yang, Jinbin Hu, Ximeng Liu, Kai Chen

    ACM Computing Survey, 2024, Volume: 26, Issue: 12

    paper | acm dl | docker | arXiv
  20.       TPDS      

    High-performance Hardware Acceleration Architecture for Cross-silo Federated Learning

    Junxue Zhang, Xiaodian Cheng, Liu Yang, Jinbin Hu, Han Tian, Kai Chen

    IEEE Transactions on Parallel and Distributed Systems, 2024, Volume: 35, Issue: 8

    paper | ieee dl | supplementary materials
  21.       TON      

    LiteFlow: Towards High-performance Adaptive Neural Networks for Kernel Datapath (extended version)

    Junxue Zhang, Chaoliang Zeng, Hong Zhang, Shuihai Hu, Kai Chen

    IEEE/ACM Transactions on Networking, 2024, Volume: 32, Issue: 1

    paper | ieee dl | code
  22.       TON      

    eMPTCP: A Framework to Fully Extend Multipath TCP

    Dian Shen, Bin Yang, Junxue Zhang, Fang Dong, John. C.S. Lui

    IEEE/ACM Transactions on Networking, 2024, Volume: 32, Issue: 6

    paper | ieee dl
  23.       TKDE      

    A Survey for Federated Learning Evaluations: Goals and Measures

    Di Chai, Leye Wang, Liu Yang, Junxue Zhang, Kai Chen, Qiang Yang

    IEEE Transactions on Knowledge and Data Engineering, 2024, Volume: 36, Issue: 10

    paper | ieee dl | arXiv | code
  24.       TON      

    Load Balancing with Multi-level Signals for Lossless Datacenter Networks

    Jinbin Hu, Chaoliang Zeng, Zilong Wang, Junxue Zhang, Kun Guo, Hong Xu, Jiawei Huang, Kai Chen

    IEEE/ACM Transactions on Networking, 2024, Volume: 32, Issue: 3

    paper | ieee dl
  25.       TON      

    Efficient DRL-based Congestion Control with Ultra-low Overhead (extended version)

    Han Tian, Xudong Liao, Chaoliang Zeng, Decang Sun, Junxue Zhang, Kai Chen

    IEEE/ACM Transactions on Networking, 2024, Volume: 32, Issue: 3

    paper | ieee dl
  26. Year
    2023

  27.       ICNP      

    Enabling Load Balancing for Lossless Datacenters

    Jinbin Hu, Chaoliang Zeng, Zilong Wang, Junxue Zhang, Kun Guo, Hong Xu, Jiawei Huang, Kai Chen

    In Proceedings of the 31th IEEE International Conference on Network Protocols, 2023 (Best Paper Award)

    paper | ieee dl | award certificate | extended version
  28.       NSDI      

    FLASH: Towards a High-performance Hardware Acceleration Architecture for Cross-silo Federated Learning

    Junxue Zhang, Xiaodian Cheng, Wei Wang, Liu Yang, Jinbin Hu, Kai Chen

    In Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

    paper | usenix page | slides | video | extended version
  29.    INFOCOM   

    Communication Efficient Secret Sharing with Dynamic Communication-Computation Conversion

    Zhenghang Ren, Xiaodian Cheng, Mingxuan Fan, Junxue Zhang, Cheng Hong

    In Proceedings of IEEE International Conference on Computer Communications, 2023

    paper | ieee dl
  30.       TCC      

    Enabling ECN for Datacenter Networks with RTT Variations (extended version)

    Junxue Zhang, Wei Bai, Kai Chen

    IEEE Transactions on Cloud Computing, 2023, Volume: 11, Issue: 3

    paper | ieee dl | code
  31. Year
    2022

  32.    SIGCOMM   

    LiteFlow: Towards High-performance Adaptive Neural Networks for Kernel Datapath

    Junxue Zhang, Chaoliang Zeng, Hong Zhang, Shuihai Hu, Kai Chen

    In Proceedings of the ACM SIGCOMM 2022 Conference, 2022

    paper | acm dl | code | slides | video | extended version
  33.     CoNEXT    

    Spine: An Efficient DRL-based Congestion Control with Ultra-low Overhead

    Han Tian, Xudong Liao, Chaoliang Zeng, Junxue Zhang, Kai Chen

    In Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies, 2022

    paper | acm dl
  34.       ICNP      

    Towards the Full Extensibility of Multipath TCP with eMPTCP

    Bin Yang, Dian Shen, Junxue Zhang, Fang Dong, Junzhou Luo, John. C.S. Lui

    In Proceedings of the 30th IEEE International Conference on Network Protocols, 2022

    paper | ieee dl | code | extended version
  35.       KDD      

    Practical Lossless Federated Singular Vector Decomposition Over Billion-Scale Data

    Di Chai, Leye Wang, Junxue Zhang, Liu Yang, Shuowei Cai, Kai Chen, Qiang Yang

    In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022

    paper | acm dl | code
  36.     EuroSys    

    Multi-Objective Congestion Control

    Yiqing Ma, Han Tian, Xudong Liao, Junxue Zhang, Weiyan Wang, Kai Chen, Xin Jin

    In Proceedings of the 17th European Conference on Computer Systems, 2022

    paper | acm dl
  37.     Oakland    

    Sphinx: Enabling Privacy-Preserving Online Learning over the Cloud

    Han Tian, Chaoliang Zeng, Zhenhang Ren, Di Chai, Junxue Zhang, Kai Chen, Qiang Yang

    In Proceedings of the 43rd IEEE Symposium on Security and Privacy, 2022

    paper | ieee dl
  38.     FL-IJCAL    

    Practical and Secure Federated Recommendation with Personalized Masks

    Liu Yang, Junxue Zhang, Di Chai, Leye Wang, Kuo Guo, Kai Chen, Qiang Yang

    In International Workshop on Trustworthy Federated Learning in Conjunction with IJCAI, 2022

    paper | lecture notes
  39.     FL-IJCAL    

    Secure Forward Aggregation for Vertical Federated Neural Networks

    Shuowei Cai, Di Chai, Liu Yang, Junxue Zhang, Yilun Jin, Leye Wang, Kun Guo, Kai Chen

    In International Workshop on Trustworthy Federated Learning in Conjunction with IJCAI, 2022

    paper | lecture notes
  40. Before
    2022

  41.     ICDCS    

    Enabling Low Latency Edge Intelligence based on Multi-exit DNNs in the Wild

    Zhaowu Huang, Fang Dong, Dian Shen, Junxue Zhang, Huitian Wang, Guangxing Cai, Qiang He

    In Proceedings of the 41st IEEE International Conference on Distributed Computing Systems, 2021

    paper | ieee dl
  42.     FL-IJCAL    

    Aegis: A Trusted, Automatic and Accurate Verification Framework for Vertical Federated Learning

    Cengguang Zhang, Junxue Zhang, Di Chai, Kai Chen

    In International Workshop on Federated and Transfer Learning for Data Sparsity and Confidentiality in Conjunction with IJCAI, 2021 (Best Application Paper Award)

    paper | award certificate
  43.       TSC      

    Facilitating Application-aware Bandwidth Allocation in the Cloud with One-step-ahead Traffic Informations

    Dian Shen, Junzhou Luo, Fang Dong, Jiahui Jin, Junxue Zhang, Jun Shen

    IEEE Transactions on Services Computing, 2020, Volume: 13, Issue: 2

    paper | ieee dl
  44.      APNet     

    RAT - Resilient Allreduce Tree for Distributed Machine Learning

    Xinchen Wan, Hong Zhang, Hao Wang, Shuihai Hu, Junxue Zhang, Kai Chen

    In Proceedings of the 4th Asia-Pacific Workshop on Networking, 2020

    paper | acm dl
  45.     CoNEXT    

    Enabling ECN for Datacenter Networks with RTT Variations

    Junxue Zhang, Wei Bai, Kai Chen

    In Proceedings of the 15th International Conference on emerging Networking EXperiments and Technologies, 2019

    paper | acm dl | code | slides | extended version
  46.     FL-IJCAL    

    Quantifying the Performance of Federated Transfer Learning

    Qinghe Jing, Weiyang Wang, Junxue Zhang, Han Tian, Kai Chen

    International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with IJCAI, 2019 (Best Student Paper Award)

    paper | award certificate
  47.      APNet     

    Rethinking Transport Layer Design for Distributed Machine Learning

    Jiachen Xia, Gaoxiong Zeng, Junxue Zhang, Weiyang Wang, Wei Bai, Junchen Jiang, Kai Chen

    In Proceedings of the 3rd Asia-Pacific Workshop on Networking, 2019

    paper | acm dl
  48.     HotCloud    

    Bridging the Edge-Cloud Barrier for Real-time Advanced Vision Analytics

    Yiding Wang, Weiyang Wang, Junxue Zhang, Junchen Jiang, Kai Chen

    In Proceedings of the 11th USENIX Workshop on Hot Topics in Cloud Computing, 2019

    paper | usenix page
  49.    SIGCOMM   

    Resilient Datacenter Load Balancing in the Wilds

    Hong Zhang, Junxue Zhang, Wei Bai, Kai Chen, Chowdhury Mosharaf

    In Proceedings of the ACM SIGCOMM 2017 Conference, 2017

    paper | acm dl | code