Yu Su

21
Papers
513
Total Citations
2
Affiliations

Affiliations

MicrosoftThe Ohio State University

Papers (21)

Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts

ICLR 2024
252
citations

BioCLIP: A Vision Foundation Model for the Tree of Life

CVPR 2024
165
citations

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

ICLR 2025
67
citations

BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning

NeurIPS 2025
15
citations

Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation

CVPR 2025
8
citations

Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis

CVPR 2025
6
citations

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

CVPR 2024
0
citations

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

ICML 2024
0
citations

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

ICML 2024
0
citations

GPT-4V(ision) is a Generalist Web Agent, if Grounded

ICML 2024
0
citations

One Step at a Time: Long-Horizon Vision-and-Language Navigation With Milestones

CVPR 2022arXiv
0
citations

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

ICCV 2023
0
citations

RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics

CVPR 2025
0
citations

Distribution-Driven Dense Retrieval: Modeling Many-to-One Query-Document Relationship

AAAI 2025
0
citations

VERSE: Verification-based Self-Play for Code Instructions

AAAI 2025
0
citations

ScholarGEC: Enhancing Controllability of Large Language Model for Chinese Academic Grammatical Error Correction

AAAI 2025
0
citations

CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework

AAAI 2024
0
citations

Dual-View Visual Contextualization for Web Navigation

CVPR 2024
0
citations

Mind2Web: Towards a Generalist Agent for the Web

NeurIPS 2023
0
citations

Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data

NeurIPS 2023
0
citations

MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

NeurIPS 2023
0
citations