Sanmi Koyejo
4
Papers
55
Total Citations
Papers (4)
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
ICML 2025
33
citations
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
ICLR 2025
22
citations
Implicit Regularization in Feedback Alignment Learning Mechanisms for Neural Networks
ICML 2024
0
citations
Transforming and Combining Rewards for Aligning Large Language Models
ICML 2024
0
citations