Publications

It is a collection of my publications

2025

  1. ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning
    Huandong Chang, Zicheng Ma, Mingyuan Ma, and 4 more authors
    arXiv preprint, 2025
  2. Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving
    Shan Yu, Jiarong Xing, Yifan Qiao, and 4 more authors
    Under Review at OSDI 2026, 2025

2024

  1. Octopus: On-device language model for function calling of software APIs
    Wei Chen, Zhiyuan Li, and Mingyuan Ma
    NAACL 2025 Industry Track (Oral), 2024
  2. Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
    Zhenting Qi, Mingyuan Ma, Jiahang Xu, and 3 more authors
    ICLR 2025, 2024

2023

  1. Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
    Zangwei Zheng, Mingyuan Ma, Kai Wang, and 3 more authors
    ICCV 2023, 2023