Poster by Ahmed H Awadallah Papers
3 papers found
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Tengyang Xie, Dylan Foster, Akshay Krishnamurthy et al.
ICLR 2025posterarXiv:2405.21046
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
Dujian Ding, Ankur Mallick, Chi Wang et al.
ICLR 2024posterarXiv:2404.14618
Teaching Language Models to Hallucinate Less with Synthetic Tasks
Erik Jones, Hamid Palangi, Clarisse Ribeiro et al.
ICLR 2024posterarXiv:2310.06827