"code execution" Papers
3 papers found
Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs
Yi Hu, Shijia Kang, Haotong Yang et al.
NEURIPS 2025posterarXiv:2502.11525
4
citations
Chain of Execution Supervision Promotes General Reasoning in Large Language Models
Nuo Chen, Zehua Li, Keqin Bao et al.
NEURIPS 2025posterarXiv:2510.23629
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Alex Gu, Baptiste Roziere, Hugh Leather et al.
ICML 2024posterarXiv:2401.03065