ICLR "cost-effective evaluation" Papers
2 papers found
Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Jasper Dekoninck, Maximilian Baader, Martin Vechev
ICLR 2025posterarXiv:2409.00696
3
citations
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
Jaehun Jung, Faeze Brahman, Yejin Choi
ICLR 2025posterarXiv:2407.18370
42
citations