Bio
We haven't found any bio for you yet.
Researcher Links
Loading links...
Publications by Type
Loading publications…
The last 5 uploaded publications
PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications
Kuntai Du, Bowen Wang, Chen Zhang, Yi‐Ming Cheng, Qing Lan, H. S. Sang, Yihua Cheng, Jiayi Yao, Xiaoxuan Liu, Yong Qiao, Ion Stoica, Junchen Jiang (2025). PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications. , DOI: https://doi.org/10.48550/arxiv.2505.07203.
Preprint30 days agoBarbarians at the Gate: How AI is Upending Systems Research
Audrey Cheng, Shu Liu, Margaret Pan, Zhifei Li, Bowen Wang, Alex Krentsel, Xia Tian, Mert Cemri, Jongseok Park, Shuo Yang, Jeff Chen, Aditya Desai, Jiarong Xing, Koushik Sen, Matei Zaharia, Ion Stoica (2025). Barbarians at the Gate: How AI is Upending Systems Research. , DOI: https://doi.org/10.48550/arxiv.2510.06189.
Preprint30 days agoPrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications
Kuntai Du, Bowen Wang, Chen Zhang, Yi‐Ming Cheng, Qing Lan, H. S. Sang, Yihua Cheng, Jiayi Yao, Xiaoxuan Liu, Yong Qiao, Ion Stoica, Junchen Jiang (2025). PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications. , DOI: https://doi.org/10.1145/3731569.3764834.
Article30 days ago