by Chris Potts Papers
3 papers found
Blackbox Model Provenance via Palimpsestic Membership Inference
Rohith Kuditipudi, Jing Huang, Sally Zhu et al.
NeurIPS 2025spotlightarXiv:2510.19796
1
citations
Do Language Models Use Their Depth Efficiently?
Róbert Csordás, Christopher D Manning, Chris Potts
NeurIPS 2025poster
Improved Representation Steering for Language Models
Zhengxuan Wu, Qinan Yu, Aryaman Arora et al.
NeurIPS 2025spotlight