Yue Fan
8
Papers
97
Total Citations
Papers (8)
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage
ICLR 2025
37
citations
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
ICLR 2025arXiv
34
citations
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices
CVPR 2025
15
citations
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding
ICCV 2025
11
citations
Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects
CVPR 2025
0
citations
CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning
CVPR 2022arXiv
0
citations
SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning
ICCV 2023
0
citations
USB: A Unified Semi-supervised Learning Benchmark for Classification
NeurIPS 2022
0
citations