2025 "solution space exploration" Papers
2 papers found
Generation as Search Operator for Test-Time Scaling of Diffusion-based Combinatorial Optimization
Yang Li, Lvda Chen, Haonan Wang et al.
NeurIPS 2025poster
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Mingjie Liu, Shizhe Diao, Ximing Lu et al.
NeurIPS 2025posterarXiv:2505.24864
99
citations