2025 "inference-time alignment" Papers
3 papers found
ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time
Yi Ding, Bolian Li, Ruqi Zhang
ICLR 2025posterarXiv:2410.06625
42
citations
Inference-Time Reward Hacking in Large Language Models
Hadi Khalaf, Claudio Mayrink Verdun, Alex Oesterling et al.
NeurIPS 2025spotlightarXiv:2506.19248
2
citations
Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search
Yuta Oshima, Masahiro Suzuki, Yutaka Matsuo et al.
NeurIPS 2025posterarXiv:2501.19252
20
citations