Serena Yeung

5

Papers

72

Total Citations

Papers (5)

Describing Differences in Image Sets with Natural Language

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Apollo: An Exploration of Video Understanding in Large Multimodal Models