Publications

You can also find my articles on my Google Scholar profile

Research Outlines

My main research interests lie in the following aspects. Large Language models and foundation models. i. Model editing and memory management for large language models (LLMs). ii. LLM agent and reasoning. iii. Parametric understanding of LLMs (localization, merging, scaling, pruning, stitching, unlearning, editing, and etc.) iv. Text-to-model generation by diffusion transformers (DiTs). v. Vision-Language (V-L) representation and understanding of multi-modal foundation models. Trustworthy deep learning. i. Privacy-preserving federated learning (FL), efficient & robust algorithm design, and generalization, personalization & training dynamics understanding. ii. Mechanistic interpretability of neural networks, weight decay, loss landscape, permutation invariance, linear mode connectivity, and etc. iii. Socio-technical issues brought by collaborative learning. iv. Responsible and trustworthy AI.

Main Publications

* indicates equal contributions, # indicates corresponding author, † indicates mentorship.

2025

Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?
Zexi Li*†, Xiangzhu Wang*, William F. Shen, Meghdad Kurmanji, Xinchi Qiu, Dongqi Cai, Chao Wu#, Nicholas D. Lane#
preprint. [arxiv]
FedGuCci: Making Local Models More Connected in Landscape for Federated Learning
Zexi Li*, Jie Lin*, Zhiqi Li*, Didi Zhu, Tao Shen, Tao Lin#, Chao Wu#, Nicholas D. Lane
ACM KDD 2025. [arxiv]
Towards Universal Personalization in Federated Learning via Collaborative Foundation Generative Models
Chenrui Wu*, Zexi Li*, Fangxin Wang, Hongyang Chen, Jiajun Bu, and Haishuai Wang#
IEEE Transactions on Mobile Computing. [paper]
FediOS: Decoupling Orthogonal Subspaces for Personalization in Feature-skew Federated Learning
Lingzhi Gao*, Zexi Li*#†, Yang Lu, and Chao Wu#
Machine Learning. [arxiv]
You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed Data
Shanshan Yan, Zexi Li†, Chao Wu, Meng Pang, Yang Lu#, Yan Yan, Hanzi Wang
ICCV 2025. [arxiv]
Text2Weight: Bridging Natural Language and Neural Network Weight Spaces
Bowen Tian*, Wenshuo Chen*, Zexi Li†, Songning Lai, Jiemin Wu, Yutao Yue
ACM MM 2025.
Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Ziyu Zhao, Tao Shen, Didi Zhu, Zexi Li, Jing Su, Xuwu Wang, Kun Kuang, Fei Wu
ICLR 2025. [arxiv]

2024

Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization
Zexi Li*, Lingzhi Gao*, Chao Wu#
preprint. [arxiv]
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
Peng Wang*, Zexi Li*, Ningyu Zhang#, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen#
NeurIPS 2024. [arxiv]
Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models
Didi Zhu, Zhongyi Sun, Zexi Li, Tao Shen, Ke Yan, Shouhong Ding, Kun Kuang#, Chao Wu#
ICML 2024. [arxiv]
Neural Collapse Anchored Prompt Tuning for Generalizable Vision-Language Models
Didi Zhu, Zexi Li, Min Zhang, Junkun Yuan, Yunfeng Shao, Yinchuan Li, Jiashuo Liu, Kun Kuang, Chao Wu#
ACM KDD 2024. [arxiv]
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
Rui Ye, Wenhao Wang, Jingyi Chai, Dihan Li, Zexi Li, Yinda Xu, Yaxin Du, Yanfeng Wang, Siheng Chen
ACM KDD 2024. [arxiv]

2023

Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion
Zexi Li, Zhiqi Li, Jie Lin, Tao Shen, Tao Lin#, and Chao Wu#
preprint. [arxiv]
Revisiting Weighted Aggregation in Federated Learning with Neural Networks
Zexi Li, Tao Lin#, Xinyi Shang, and Chao Wu#
ICML 2023. [paper][github][arxiv]
No Fear of Classifier Biases: Neural Collapse Inspired Federated Learning with Synthetic and Fixed Classifier
Zexi Li, Xinyi Shang, Rui He, Tao Lin#, and Chao Wu#
ICCV 2023. [paper][arxiv]
Universal Domain Adaptation via Compressive Attention Matching
Didi Zhu*, Yinchuan Li*, Junkun Yuan, Zexi Li, Kun Kuang#, Chao Wu#
ICCV 2023. [paper]
Edge-cloud Collaborative Learning with Federated and Centralized Features
Zexi Li*, Qunwei Li*, Yi Zhou, Wenliang Zhong#, Guannan Zhang, and Chao Wu#
SIGIR 2023. [paper][arxiv]
Learning Cautiously in Federated Learning with Noisy and Heterogeneous Clients
Chenrui Wu*, Zexi Li*†, Fangxin Wang#, and Chao Wu#
(Oral) ICME 2023. [paper][arxiv] –

2022

Towards Effective Clustered Federated Learning: A Peer-to-peer Framework with Adaptive Neighbor Matching
Zexi Li, Jiaxun Lu, Shuang Luo, Didi Zhu, Yunfeng Shao, Yinchuan Li, Zhimeng Zhang, Yongheng Wang#, and Chao Wu#
- Long version in IEEE Transactions on Big Data. (JCR Q1, IF: 7.2) [paper][arxiv]
- Short version in MLSys2022 Workshop. [poster]
Can We Share Models If Sharing Data Is Not an Option?
Zexi Li, Feng Mao#, and Chao Wu#
Patterns, Cell Press. (JCR Q1, IF: 6.5) [paper]

2021

Boosting the generalization ability of Vis-NIR-spectroscopy-based regression models through dimension reduction and transfer learning
Xiaoli Li*, Zexi Li*, Xufeng Yang, and Yong He#
Computers and Electronics in Agriculture. (JCR Q1, CAS Top, IF: 8.3) [paper]
An early assessment of the County Medical Community reform in China: a case study of Zhejiang province
Chao Wu, Yixin Tu, Zexi Li, Jianxing Yu#
Journal of Chinese Governance. (SSCI, JCR Q1, IF: 3.0) [paper]

Zexi Li 李則熹