by Yiting He Papers
2 papers found
Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity
Rheeya Uppaal, Apratim Dey, Yiting He et al.
ICLR 2025posterarXiv:2405.13967
Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction
Yiting He, Zhishuai Liu, Weixin Wang et al.
ICML 2025posterarXiv:2511.05396