Yaodong Yang

37
Papers
67
Total Citations

Papers (37)

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

AAAI 2024arXiv
25
citations

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

ICLR 2025
20
citations

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors

AAAI 2025
12
citations

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

ICML 2025
6
citations

Differentiable Information Enhanced Model-Based Reinforcement Learning

AAAI 2025
3
citations

InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback

NeurIPS 2025
1
citations

Sample-Efficient Multiagent Reinforcement Learning with Reset Replay

ICML 2024
0
citations

End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations

ICML 2024
0
citations

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

ICML 2024
0
citations

UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-Aware Curriculum and Iterative Generalist-Specialist Learning

ICCV 2023
0
citations

Social World Model-Augmented Mechanism Design Policy Learning

NeurIPS 2025
0
citations

Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning

NeurIPS 2025
0
citations

Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning

AAAI 2025
0
citations

ProAgent: Building Proactive Cooperative Agents with Large Language Models

AAAI 2024
0
citations

STAS: Spatial-Temporal Return Decomposition for Multi-Agent Reinforcement Learning

AAAI 2024arXiv
0
citations

AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents

CVPR 2024
0
citations

Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation

ICML 2024
0
citations

Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning

NeurIPS 2023
0
citations

Hierarchical Multi-Agent Skill Discovery

NeurIPS 2023
0
citations

Policy Space Diversity for Non-Transitive Games

NeurIPS 2023arXiv
0
citations

Mean Field Multi-Agent Reinforcement Learning

ICML 2018
0
citations

Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning

NeurIPS 2018
0
citations

Replica-Exchange Nos\'e-Hoover Dynamics for Bayesian Learning on Large Datasets

NeurIPS 2020
0
citations

Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

NeurIPS 2021
0
citations

Neural Auto-Curricula in Two-Player Zero-Sum Games

NeurIPS 2021
0
citations

Settling the Variance of Multi-Agent Policy Gradients

NeurIPS 2021
0
citations

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

NeurIPS 2022
0
citations

Constrained Update Projection Approach to Safe Policy Optimization

NeurIPS 2022
0
citations

A Unified Diversity Measure for Multiagent Reinforcement Learning

NeurIPS 2022
0
citations

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

NeurIPS 2022
0
citations

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning

NeurIPS 2022
0
citations

MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control

NeurIPS 2022
0
citations

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

NeurIPS 2022
0
citations

Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing

NeurIPS 2022
0
citations

Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark

NeurIPS 2023
0
citations

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

NeurIPS 2023
0
citations

Multi-Agent First Order Constrained Optimization in Policy Space

NeurIPS 2023
0
citations