David Abel
3
Papers
8
Total Citations
Papers (3)
A Black Swan Hypothesis: The Role of Human Irrationality in AI Safety
ICLR 2025arXiv
4
citations
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
ICLR 2025
4
citations
Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input
ICML 2024
0
citations