NEURIPS 2025 "training efficiency optimization" Papers
2 papers found
AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners
Reiss Koh, Wonbeen Oh, Jaein Jang et al.
NEURIPS 2025posterarXiv:2505.16322
2
citations
Transferring Linear Features Across Language Models With Model Stitching
Alan Chen, Jack Merullo, Alessandro Stolfo et al.
NEURIPS 2025spotlightarXiv:2506.06609
1
citations