Shauli Ravfogel
4
Papers
11
Total Citations
Papers (4)
Gumbel Counterfactual Generation From Language Models
ICLR 2025
8
citations
Emergence of Linear Truth Encodings in Language Models
NeurIPS 2025arXiv
3
citations
Preserving Task-Relevant Information Under Linear Concept Removal
NeurIPS 2025arXiv
0
citations
Representation Surgery: Theory and Practice of Affine Steering
ICML 2024
0
citations