Poster by Qingcheng Zeng Papers
2 papers found
CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs
Sijia Chen, Xiaomin Li, mengxue zhang et al.
NeurIPS 2025poster
ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning
Shulin Huang, Linyi Yang, Yan Song et al.
NeurIPS 2025posterarXiv:2502.16268
14
citations