David Bau
19
Papers
503
Total Citations
Papers (19)
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
ICLR 2025
252
citations
Linearity of Relation Decoding in Transformer Language Models
ICLR 2024
140
citations
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
ICLR 2024
97
citations
MIB: A Mechanistic Interpretability Benchmark
ICML 2025
9
citations
When Are Concepts Erased From Diffusion Models?
NeurIPS 2025
5
citations
Seeing What a GAN Cannot Generate
ICCV 2019
0
citations
Sketch Your Own GAN
ICCV 2021arXiv
0
citations
Toward a Visual Concept Vocabulary for GAN Latent Space
ICCV 2021arXiv
0
citations
Erasing Concepts from Diffusion Models
ICCV 2023arXiv
0
citations
Rewriting a Deep Generative Model
ECCV 2020
0
citations
What makes fake images detectable? Understanding properties that generalize
ECCV 2020
0
citations
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
ICCV 2025
0
citations
Network Dissection: Quantifying Interpretability of Deep Visual Representations
CVPR 2017arXiv
0
citations
Learning Words by Drawing Images
CVPR 2019
0
citations
Diverse Image Generation via Self-Conditioned GANs
CVPR 2020arXiv
0
citations
Disentangling Visual and Written Concepts in CLIP
CVPR 2022
0
citations
Editing a classifier by rewriting its prediction rules
NeurIPS 2021
0
citations
Locating and Editing Factual Associations in GPT
NeurIPS 2022
0
citations
FIND: A Function Description Benchmark for Evaluating Interpretability Methods
NeurIPS 2023
0
citations