AAAI Paper "proximal policy optimization" Papers
3 papers found
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob Hollenstein, Georg Martius, Justus Piater
AAAI 2024paperarXiv:2312.11091
8
citations
Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning
Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.
AAAI 2024paperarXiv:2312.05784
Learning Diverse Risk Preferences in Population-Based Self-Play
Yuhua Jiang, Qihan Liu, Xiaoteng Ma et al.
AAAI 2024paperarXiv:2305.11476