Home
Research
People
Publications
Demos
News
Workshops
Gallery
Light
Dark
Automatic
Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management
Qianli Liu
,
Zicong Hong
,
Peng Li
,
Fahao Chen
,
Song Guo
May 2025
Type
Conference paper
Publication
IEEE International Conference on Computer Communications (INFOCOM) (CCF-A)
Qianli Liu
PhD Student
Zicong Hong
Graduated PhD Student
Song Guo
Chair Professor
Cite
×