AAAI 2024 "language model evaluation" Papers
2 papers found
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time
Sensitive Test Construction - Yucheng Li, Frank Guerin, Chenghua Lin
AAAI 2024paperarXiv:2312.12343
53
citations
Task Contamination: Language Models May Not Be Few-Shot Anymore
Changmao Li, Jeffrey Flanigan
AAAI 2024paperarXiv:2312.16337
130
citations