NEURIPS Poster "post-training alignment" Papers
2 papers found
Self Iterative Label Refinement via Robust Unlabeled Learning
Hikaru Asano, Tadashi Kozuno, Yukino Baba
NEURIPS 2025posterarXiv:2502.12565
1
citations
Tracing the Representation Geometry of Language Models from Pretraining to Post-training
Melody Li, Kumar Krishna Agrawal, Arna Ghosh et al.
NEURIPS 2025posterarXiv:2509.23024
6
citations