S-LoRA studies concurrent LoRA adapter serving and is one of the core systems papers in the LMSYS serving line.
S-LoRA
Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica
|
May 1, 2024
