Poster "frontier ai systems" Papers
2 papers found
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models
Cameron Tice, Philipp Kreer, Nathan Helm-Burger et al.
NeurIPS 2025posterarXiv:2412.01784
6
citations
Position: Require Frontier AI Labs To Release Small "Analog" Models
Shriyash Upadhyay, Philip Quirke, Narmeen Oozeer et al.
NeurIPS 2025poster