Paper "llm-as-a-judge" Papers
2 papers found
Conference
M-Prometheus: A Suite of Open Multilingual LLM Judges
José Pombal, Dongkeun Yoon, Patrick Fernandes et al.
COLM 2025paperarXiv:2504.04953
23
citations
Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models
José Pombal, Nuno M Guerreiro, Ricardo Rei et al.
COLM 2025paperarXiv:2504.01001
8
citations