NEURIPS "model deployment" Papers
3 papers found
Estimating Model Performance Under Covariate Shift Without Labels
Jakub Białek, Juhani Kivimäki, Wojciech Kuberski et al.
NEURIPS 2025posterarXiv:2401.08348
5
citations
FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic
Kanghyun Choi, Hyeyoon Lee, Sunjong Park et al.
NEURIPS 2025arXiv:2510.24061
Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression
Xi Zhang, Xiaolin Wu, Jiamang Wang et al.
NEURIPS 2025posterarXiv:2510.20984