"monte carlo tree search" Papers

24 papers found

AFlow: Automating Agentic Workflow Generation

Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu et al.

ICLR 2025posterarXiv:2410.10762
135
citations

AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise

Dhruv Agarwal, Bodhisattwa Prasad Majumder, Reece Adamson et al.

NeurIPS 2025oralarXiv:2507.00310
3
citations

Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks

Bowei He, Lihao Yin, Huiling Zhen et al.

ICLR 2025posterarXiv:2502.06892
3
citations

CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models

Shengzhuang Chen, Yikai Liao, Xiaoxiao Sun et al.

ICLR 2025posterarXiv:2503.04655
1
citations

Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach

Jason Piquenot, Maxime Berar, Romain Raveaux et al.

ICLR 2025poster

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

Zhi Jing, Siyuan Yang, Jicong Ao et al.

NeurIPS 2025posterarXiv:2507.00833
6
citations

Improving Monte Carlo Tree Search for Symbolic Regression

Zhengyao Huang, Daniel Huang, Tiannan Xiao et al.

NeurIPS 2025posterarXiv:2509.15929

MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning

Sizhe Tang, Jiayu Chen, Tian Lan

NeurIPS 2025posterarXiv:2511.06142
1
citations

PlanU: Large Language Model Reasoning through Planning under Uncertainty

Ziwei Deng, Mian Deng, Chenjing Liang et al.

NeurIPS 2025posterarXiv:2510.18442

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

Ning Gao, Xiuhui Zhang, Xingyu Jiang et al.

NeurIPS 2025spotlight

Strength Estimation and Human-Like Strength Adjustment in Games

Chun Jung Chen, Chung-Chin Shih, Ti-Rong Wu

ICLR 2025posterarXiv:2502.17109
1
citations

Uncertainty-Guided Exploration for Efficient AlphaZero Training

Scott Cheng, Meng-Yu Tsai, Ding-Yong Hong et al.

NeurIPS 2025poster

Understanding Methods for Scalable MCTS

Will Knipe

ICLR 2025poster

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Yuichi Inoue, Kou Misaki, Yuki Imajuku et al.

NeurIPS 2025spotlightarXiv:2503.04412
18
citations

A Bayesian Approach to Online Planning

Nir Greshler, David Ben Eli, Carmel Rabinovitz et al.

ICML 2024poster

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)

Zongxin Yang, Guikun Chen, Xiaodi Li et al.

ICML 2024oral

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

Yizhe Huang, Anji Liu, Fanqi Kong et al.

ICML 2024poster

Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models

Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman et al.

ICML 2024poster

Monte Carlo Tree Search in the Presence of Transition Uncertainty

Farnaz Kohankhaki, Kiarash Aghakasiri, Hongming Zhang et al.

AAAI 2024paperarXiv:2312.11348
3
citations

Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing

Amutheezan Sivagnanam, Ava Pettet, Hunter Lee et al.

ICML 2024poster

Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems

Yifan Xia, Xianliang Yang, Zichuan Liu et al.

ICML 2024poster

Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization

Liam Schramm, Abdeslam Boularias

ICML 2024poster

Sample-and-Bound for Non-convex Optimization

Yaoguang Zhai, Zhizhen Qin, Sicun Gao

AAAI 2024paperarXiv:2401.04812
1
citations

Scalable Safe Policy Improvement for Factored Multi-Agent MDPs

Federico Bianchi, Edoardo Zorzi, Alberto Castellini et al.

ICML 2024poster