"model comparison" Papers
3 papers found
A Curious Case of the Missing Measure: Better Scores and Worse Generation
Joseph Turian, Jordie Shier
ICLR 2025poster
Re-evaluating Open-ended Evaluation of Large Language Models
Si-Qi Liu, Ian Gemp, Luke Marris et al.
ICLR 2025posterarXiv:2502.20170
5
citations
Representational Difference Explanations
Neehar Kondapaneni, Oisin Mac Aodha, Pietro Perona
NeurIPS 2025posterarXiv:2505.23917