by Laks Lakshmanan Papers
3 papers found
BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute
Dujian Ding, Ankur Mallick, Shaokun Zhang et al.
ICML 2025posterarXiv:2506.22716
16
citations
OCCAM: Towards Cost-Efficient and Accuracy-Aware Classification Inference
Dujian Ding, Bicheng Xu, Laks Lakshmanan
ICLR 2025posterarXiv:2406.04508
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
Dujian Ding, Ankur Mallick, Chi Wang et al.
ICLR 2024posterarXiv:2404.14618