Bio
We haven't found any bio for you yet.
Researcher Links
Loading links...
Publications by Type
Loading publications…
The last 5 uploaded publications
MoE-L <scp>ightning</scp> : High-Throughput MoE Inference on Memory-constrained GPUs
Shiyi Cao, Shu Liu, Tyler Griggs, Peter Schafhalter, Xiaoxuan Liu, Ying Sheng, Joseph E. Gonzalez, Matei Zaharia, Ion Stoica (2025). MoE-L <scp>ightning</scp> : High-Throughput MoE Inference on Memory-constrained GPUs. , DOI: https://doi.org/10.1145/3669940.3707267.
Article26 days agoLLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!
Dacheng Li, Shiyi Cao, Tyler Griggs, Shu Liu, Xiangxi Mo, Shishir G. Patil, Matei Zaharia, Joseph E. Gonzalez, Ion Stoica (2025). LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!. , DOI: https://doi.org/10.48550/arxiv.2502.07374.
Preprint26 days agoPrism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving
Shan Yu, Jiarong Xing, Yifan Qiao, Mingyuan Ma, Yangmin Li, Yang Wang, Shuo Yang, Zhiqiang Xie, Shiyi Cao, Ke Bao, Ion Stoica, Harry Xu, Ying Sheng (2025). Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving. , DOI: https://doi.org/10.48550/arxiv.2505.04021.
Preprint26 days ago