Yu Su
16
Papers
513
Total Citations
2
Affiliations
Affiliations
MicrosoftThe Ohio State University
Papers (16)
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
ICLR 2024
252
citations
BioCLIP: A Vision Foundation Model for the Tree of Life
CVPR 2024
165
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
67
citations
BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning
NeurIPS 2025
15
citations
Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation
CVPR 2025
8
citations
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis
CVPR 2025
6
citations
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
0
citations
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
ICML 2024
0
citations
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
ICML 2024
0
citations
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
CVPR 2025
0
citations
GPT-4V(ision) is a Generalist Web Agent, if Grounded
ICML 2024
0
citations
Distribution-Driven Dense Retrieval: Modeling Many-to-One Query-Document Relationship
AAAI 2025
0
citations
VERSE: Verification-based Self-Play for Code Instructions
AAAI 2025
0
citations
ScholarGEC: Enhancing Controllability of Large Language Model for Chinese Academic Grammatical Error Correction
AAAI 2025
0
citations
CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework
AAAI 2024
0
citations
Dual-View Visual Contextualization for Web Navigation
CVPR 2024
0
citations