Poster "generative ai evaluation" Papers
2 papers found
Validating LLM-as-a-Judge Systems under Rating Indeterminacy
Luke Guerdan, Solon Barocas, Kenneth Holstein et al.
NEURIPS 2025posterarXiv:2503.05965
6
citations
Evaluating Text-to-Visual Generation with Image-to-Text Generation
Zhiqiu Lin, Deepak Pathak, Baiqi Li et al.
ECCV 2024posterarXiv:2404.01291
347
citations