by Hinrich Schuetze Papers
3 papers found
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Clemencia Siro, Guy Gur-Ari, Gaurav Mishra et al.
ICLR 2025oralarXiv:2206.04615
2192
citations
NoLiMa: Long-Context Evaluation Beyond Literal Matching
Ali Modarressi, Hanieh Deilamsalehy, Franck Dernoncourt et al.
ICML 2025posterarXiv:2502.05167
51
citations
Refusal Direction is Universal Across Safety-Aligned Languages
Xinpeng Wang, Mingyang Wang, Yihong Liu et al.
NeurIPS 2025posterarXiv:2505.17306
4
citations