"vision-language benchmarks" Papers

2 papers found