—
We haven't found any bio for you yet.
Loading links...
Loading publications…
Tyler Griggs, Xiaoxuan Liu, Jiaxiang Yu, Doyoung Kim, Wei-Lin Chiang, Alvin Cheung, Ion Stoica (2024). Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity. , DOI: https://doi.org/10.48550/arxiv.2404.14527.