α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Nathan Helm-Burger
Nathan Helm-Burger
3
Papers
18
Total Citations
Papers (3)
CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring
NeurIPS 2025
12
citations
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models
NeurIPS 2025
arXiv
6
citations
The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning
ICML 2024
0
citations