Poster "activation steering" Papers
3 papers found
Conference
Controlling Language and Diffusion Models by Transporting Activations
Pau Rodriguez, Arno Blaas, Michal Klein et al.
ICLR 2025arXiv:2410.23054
22
citations
Steering Protein Language Models
Long-Kai Huang, Rongyi Zhu, Bing He et al.
ICML 2025arXiv:2509.07983
3
citations
Steering When Necessary: Flexible Steering Large Language Models with Backtracking
Zifeng Cheng, Jinwei Gan, Zhiwei Jiang et al.
NEURIPS 2025arXiv:2508.17621
1
citations