2024 Paper "policy gradient methods" Papers
2 papers found
Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning
Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.
AAAI 2024paperarXiv:2308.07272
7
citations
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property
I. Anagnostides, Ioannis Panageas, Gabriele Farina et al.
AAAI 2024paperarXiv:2312.12067
3
citations