Paul Weng
5
Papers
3
Total Citations
Papers (5)
Reinforcement Learning from Imperfect Corrective Actions and Proxy Rewards
ICLR 2025
3
citations
Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning
NeurIPS 2025arXiv
0
citations
Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data
AAAI 2025
0
citations
DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback
AAAI 2025
0
citations
INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer
ICML 2024
0
citations