"post-training" Papers
4 papers found
Conference
Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
Rosie Zhao, Alexandru Meterez, Sham M. Kakade et al.
COLM 2025paperarXiv:2504.07912
87
citations
MALT: Improving Reasoning with Multi-Agent LLM Training
Sumeet Ramesh Motwani, Chandler Smith, Rocktim Jyoti Das et al.
COLM 2025paperarXiv:2412.01928
37
citations
Modifying Large Language Model Post-Training for Diverse Creative Writing
John Joon Young Chung, Vishakh Padmakumar, Melissa Roemmele et al.
COLM 2025paperarXiv:2503.17126
25
citations
Tulu 3: Pushing Frontiers in Open Language Model Post-Training
Nathan Lambert, Jacob Morrison, Valentina Pyatkin et al.
COLM 2025paperarXiv:2411.15124
491
citations