"hyper-parameter optimization" Papers
3 papers found
How Does Critical Batch Size Scale in Pre-training?
Hanlin Zhang, Depen Morwani, Nikhil Vyas et al.
ICLR 2025posterarXiv:2410.21676
37
citations
Towards Effective Evaluations and Comparisons for LLM Unlearning Methods
Qizhou Wang, Bo Han, Puning Yang et al.
ICLR 2025posterarXiv:2406.09179
21
citations
Multi-Objective Bayesian Optimization with Active Preference Learning
Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki et al.
AAAI 2024paperarXiv:2311.13460
14
citations