AAAI "black-box evaluation" Papers
2 papers found
A Black-Box Evaluation Framework for Semantic Robustness in Bird’s Eye View Detection
Fu Wang, Yanghao Zhang, Xiangyu Yin et al.
AAAI 2025paperarXiv:2412.13913
Underspecification in Language Modeling Tasks: A Causality-Informed Study of Gendered Pronoun Resolution
Emily McMilin
AAAI 2024paperarXiv:2210.00131