Yaodong Yang
37
Papers
67
Total Citations
Papers (37)
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024arXiv
25
citations
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
ICLR 2025
20
citations
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
AAAI 2025
12
citations
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
ICML 2025
6
citations
Differentiable Information Enhanced Model-Based Reinforcement Learning
AAAI 2025
3
citations
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback
NeurIPS 2025
1
citations
Sample-Efficient Multiagent Reinforcement Learning with Reset Replay
ICML 2024
0
citations
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
ICML 2024
0
citations
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning
ICML 2024
0
citations
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-Aware Curriculum and Iterative Generalist-Specialist Learning
ICCV 2023
0
citations
Social World Model-Augmented Mechanism Design Policy Learning
NeurIPS 2025
0
citations
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
NeurIPS 2025
0
citations
Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning
AAAI 2025
0
citations
ProAgent: Building Proactive Cooperative Agents with Large Language Models
AAAI 2024
0
citations
STAS: Spatial-Temporal Return Decomposition for Multi-Agent Reinforcement Learning
AAAI 2024arXiv
0
citations
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents
CVPR 2024
0
citations
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
ICML 2024
0
citations
Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning
NeurIPS 2023
0
citations
Hierarchical Multi-Agent Skill Discovery
NeurIPS 2023
0
citations
Policy Space Diversity for Non-Transitive Games
NeurIPS 2023arXiv
0
citations
Mean Field Multi-Agent Reinforcement Learning
ICML 2018
0
citations
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
NeurIPS 2018
0
citations
Replica-Exchange Nos\'e-Hoover Dynamics for Bayesian Learning on Large Datasets
NeurIPS 2020
0
citations
Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
NeurIPS 2021
0
citations
Neural Auto-Curricula in Two-Player Zero-Sum Games
NeurIPS 2021
0
citations
Settling the Variance of Multi-Agent Policy Gradients
NeurIPS 2021
0
citations
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
NeurIPS 2022
0
citations
Constrained Update Projection Approach to Safe Policy Optimization
NeurIPS 2022
0
citations
A Unified Diversity Measure for Multiagent Reinforcement Learning
NeurIPS 2022
0
citations
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
NeurIPS 2022
0
citations
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
NeurIPS 2022
0
citations
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control
NeurIPS 2022
0
citations
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
NeurIPS 2022
0
citations
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing
NeurIPS 2022
0
citations
Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark
NeurIPS 2023
0
citations
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset
NeurIPS 2023
0
citations
Multi-Agent First Order Constrained Optimization in Policy Space
NeurIPS 2023
0
citations