2024 "reinforce algorithm" Papers
3 papers found
Finding Visual Task Vectors
Alberto Hojel, Yutong Bai, Trevor Darrell et al.
ECCV 2024posterarXiv:2404.05729
14
citations
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Ziniu Li, Tian Xu, Yushun Zhang et al.
ICML 2024poster
Response Enhanced Semi-supervised Dialogue Query Generation
Jianheng Huang, Ante Wang, Linfeng Gao et al.
AAAI 2024paperarXiv:2312.12713