2024 Poster "human preferences" Papers
2 papers found
Limited Preference Aided Imitation Learning from Imperfect Demonstrations
Xingchen Cao, Fan-Ming Luo, Junyin Ye et al.
ICML 2024poster
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
songyang gao, Qiming Ge, Wei Shen et al.
ICML 2024posterarXiv:2401.11458