"ppo algorithm" Papers
2 papers found
MaFeRw: Query Rewriting with Multi-Aspect Feedbacks for Retrieval-Augmented Large Language Models
Yujing Wang, Hainan Zhang, Liang Pang et al.
AAAI 2025paperarXiv:2408.17072
8
citations
Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Chung-En Sun, Sicun Gao, Lily Weng
ICML 2024posterarXiv:2406.18062