Yunhang Shen

20
Papers
2,136
Total Citations
10
h-index

Papers (20)

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

NeurIPS 2025
1,227
citations

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

Weakly Supervised Open-Vocabulary Object Detection

AAAI 2024arXiv
16
citations

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

AAAI 2024arXiv
13
citations

Feature Denoising Diffusion Model for Blind Image Quality Assessment

AAAI 2025
8
citations

FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression

CVPR 2025
4
citations

Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration

AAAI 2025
4
citations

From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning

ICCV 2025
2
citations

Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models

ICCV 2025arXiv
2
citations

BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution

AAAI 2025
2
citations

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity

ICML 2024
0
citations

Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion

CVPR 2025
0
citations

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

ICLR 2025
0
citations

Probability-Density-aware Semi-supervised Learning

AAAI 2025
0
citations

Semi-supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning

AAAI 2024
0
citations

Solving the Catastrophic Forgetting Problem in Generalized Category Discovery

CVPR 2024
0
citations

A General and Efficient Training for Transformer via Token Expansion

CVPR 2024
0
citations

Aligning and Prompting Everything All at Once for Universal Visual Perception

CVPR 2024
0
citations

DS-VLM: Diffusion Supervision Vision Language Model

ICML 2025
0
citations

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment

ICML 2024
0
citations