α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Teun van der Weij
Teun van der Weij
2
Papers
64
Total Citations
Papers (2)
AI Sandbagging: Language Models can Strategically Underperform on Evaluations
ICLR 2025
58
citations
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models
NeurIPS 2025
arXiv
6
citations