Publications

You can also find my articles on my Google Scholar profile

Research Outlines

My main research interests lie in the following aspects. Large Language models and foundation models. i. Model editing and memory management for large language models (LLMs). ii. LLM agent and reasoning. iii. Parametric understanding of LLMs (localization, merging, scaling, pruning, stitching, unlearning, editing, and etc.) iv. Text-to-model generation by diffusion transformers (DiTs). v. Vision-Language (V-L) representation and understanding of multi-modal foundation models. Trustworthy deep learning. i. Privacy-preserving federated learning (FL), efficient & robust algorithm design, and generalization, personalization & training dynamics understanding. ii. Mechanistic interpretability of neural networks, weight decay, loss landscape, permutation invariance, linear mode connectivity, and etc. iii. Socio-technical issues brought by collaborative learning. iv. Responsible and trustworthy AI.


Main Publications

* indicates equal contributions, # indicates corresponding author.

2025

2024

2023

2022

2021