2025 Poster "model comparison" Papers
5 papers found
A Curious Case of the Missing Measure: Better Scores and Worse Generation
Joseph Turian, Jordie Shier
ICLR 2025poster
Limits to scalable evaluation at the frontier: LLM as judge won’t beat twice the data
Florian Eddie Dorner, Vivian Nastl, Moritz Hardt
ICLR 2025poster
23
citations
Re-evaluating Open-ended Evaluation of Large Language Models
Si-Qi Liu, Ian Gemp, Luke Marris et al.
ICLR 2025posterarXiv:2502.20170
5
citations
Representational Difference Explanations
Neehar Kondapaneni, Oisin Mac Aodha, Pietro Perona
NeurIPS 2025posterarXiv:2505.23917
Representational Similarity via Interpretable Visual Concepts
Neehar Kondapaneni, Oisin Mac Aodha, Pietro Perona
ICLR 2025posterarXiv:2503.15699
3
citations