ICLR 2025 "multi-domain evaluation" Papers
2 papers found
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan, Elias Stengel-Eskin, Jaemin Cho et al.
ICLR 2025posterarXiv:2410.06215
8
citations
MetaMetrics: Calibrating Metrics for Generation Tasks Using Human Preferences
Genta Winata, David Anugraha, Lucky Susanto et al.
ICLR 2025posterarXiv:2410.02381
17
citations