ICLR 2025 "latency optimization" Papers
3 papers found
CONGO: Compressive Online Gradient Optimization
Jeremy Carleton, Prathik Vijaykumar, Divyanshu Saxena et al.
ICLR 2025posterarXiv:2407.06325
IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION
Chuanyang Zheng
ICLR 2025posterarXiv:2501.15369
4
citations
Preble: Efficient Distributed Prompt Scheduling for LLM Serving
Vikranth Srivatsa, Zijian He, Reyna Abhyankar et al.
ICLR 2025posterarXiv:2407.00023
41
citations