"vision-language benchmark" Papers

2 papers found