"sequential decision-making" Papers
16 papers found
Adaptive teachers for amortized samplers
Minsu Kim, Sanghyeok Choi, Taeyoung Yun et al.
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan, Elias Stengel-Eskin, Jaemin Cho et al.
Learning from negative feedback, or positive feedback or both
Abbas Abdolmaleki, Bilal Piot, Bobak Shahriari et al.
Learning Robust Representations with Long-Term Information for Generalization in Visual Reinforcement Learning
Rui Yang, Jie Wang, Qijie Peng et al.
Prediction with expert advice under additive noise
Alankrita Bhatt, Victoria Kostina
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks
Vishnu Sarukkai, Zhiqiang Xie, Kayvon Fatahalian
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu, Chenghao Deng, Yanchao Sun et al.
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs
Stelios Triantafyllou, Aleksa Sukovic, Debmalya Mandal et al.
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou, Andrea Zanette, Jiayi Pan et al.
Best Arm Identification for Stochastic Rising Bandits
Marco Mussi, Alessandro Montenegro, Francesco Trovò et al.
Imitation Learning from Purified Demonstrations
Yunke Wang, Minjing Dong, Yukun Zhao et al.
Limited Preference Aided Imitation Learning from Imperfect Demonstrations
Xingchen Cao, Fan-Ming Luo, Junyin Ye et al.
Offline Transition Modeling via Contrastive Energy Learning
Ruifeng Chen, Chengxing Jia, Zefang Huang et al.
Parameterized Projected Bellman Operator
Théo Vincent, Alberto Maria Metelli, Boris Belousov et al.
Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
Ming Nie, Renyuan Peng, Chunwei Wang et al.
Rethinking Transformers in Solving POMDPs
Chenhao Lu, Ruizhe Shi, Yuyao Liu et al.