David Bau
16
Papers
510
Total Citations
Papers (16)
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
ICLR 2025arXiv
252
citations
Linearity of Relation Decoding in Transformer Language Models
ICLR 2024arXiv
140
citations
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
ICLR 2024arXiv
97
citations
MIB: A Mechanistic Interpretability Benchmark
ICML 2025arXiv
9
citations
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
ICCV 2025arXiv
7
citations
When Are Concepts Erased From Diffusion Models?
NeurIPS 2025arXiv
5
citations
Diverse Image Generation via Self-Conditioned GANs
CVPR 2020arXiv
0
citations
Disentangling Visual and Written Concepts in CLIP
CVPR 2022
0
citations
Sketch Your Own GAN
ICCV 2021arXiv
0
citations
Toward a Visual Concept Vocabulary for GAN Latent Space
ICCV 2021arXiv
0
citations
Erasing Concepts from Diffusion Models
ICCV 2023arXiv
0
citations
Rewriting a Deep Generative Model
ECCV 2020
0
citations
What makes fake images detectable? Understanding properties that generalize
ECCV 2020
0
citations
Editing a classifier by rewriting its prediction rules
NeurIPS 2021arXiv
0
citations
Locating and Editing Factual Associations in GPT
NeurIPS 2022arXiv
0
citations
FIND: A Function Description Benchmark for Evaluating Interpretability Methods
NeurIPS 2023arXiv
0
citations