ICLR "language model evaluation" Papers
2 papers found
ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences
Yuxin Wang, Xiaomeng Zhu, Weimin Lyu et al.
ICLR 2025posterarXiv:2411.05172
2
citations
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Jiayi Ye, Yanbo Wang, Yue Huang et al.
ICLR 2025posterarXiv:2410.02736
207
citations