by Yiqiao Zhong Papers
2 papers found
Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity
Rheeya Uppaal, Apratim Dey, Yiting He et al.
ICLR 2025poster
Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning
Haolin Yang, Hakaze Cho, Yiqiao Zhong et al.
NeurIPS 2025posterarXiv:2505.18752
2
citations