by Min Cai Papers
2 papers found
Conference
How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence
Hongzhe Du, Weikai Li, Min Cai et al.
COLM 2025paperarXiv:2504.02904
5
citations
Strategist: Self-improvement of LLM Decision Making via Bi-Level Tree Search
Jonathan Light, Min Cai, Weiqin Chen et al.
ICLR 2025posterarXiv:2408.10635