"benchmark datasets" Papers
4 papers found
Is Large-scale Pretraining the Secret to Good Domain Generalization?
Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis et al.
ICLR 2025posterarXiv:2412.02856
5
citations
Directly Denoising Diffusion Models
Dan Zhang, Jingjing Wang, Feng Luo
ICML 2024poster
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models
Xin Liu, Yichen Zhu, Jindong Gu et al.
ECCV 2024posterarXiv:2311.17600
183
citations
RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction
Yemin Yu, Luotian Yuan, Ying WEI et al.
AAAI 2024paperarXiv:2312.10900