Christopher Potts
8
Papers
114
Total Citations
Papers (8)
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
ICML 2025
100
citations
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
ICLR 2025
14
citations
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
NeurIPS 2021
0
citations
Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as a Target for NLP
NeurIPS 2021
0
citations
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval
NeurIPS 2021
0
citations
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior
NeurIPS 2022
0
citations
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
NeurIPS 2023
0
citations
Causal Abstractions of Neural Networks
NeurIPS 2021
0
citations