2025 "proximal policy optimization" Papers
3 papers found
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs
Zijia Zhao, Longteng Guo, Jie Cheng et al.
ICLR 2025posterarXiv:2410.10456
8
citations
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
Yuzheng Hu, Fan Wu, Haotian Ye et al.
NeurIPS 2025oralarXiv:2505.19281
2
citations
AutoEdit: Automatic Hyperparameter Tuning for Image Editing
Chau Pham, Quan Dao, Mahesh Bhosale et al.
NeurIPS 2025posterarXiv:2509.15031
1
citations