Yunhang Shen

36
Papers
2,185
Total Citations
10
h-index

Papers (36)

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

NeurIPS 2025
1,227
citations

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

Enabling Deep Residual Networks for Weakly Supervised Object Detection

ECCV 2020
49
citations

Weakly Supervised Open-Vocabulary Object Detection

AAAI 2024arXiv
16
citations

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

AAAI 2024arXiv
13
citations

Feature Denoising Diffusion Model for Blind Image Quality Assessment

AAAI 2025
8
citations

FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression

CVPR 2025
4
citations

Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration

AAAI 2025
4
citations

Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models

ICCV 2025arXiv
2
citations

From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning

ICCV 2025
2
citations

BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution

AAAI 2025
2
citations

Noise-Aware Fully Webly Supervised Object Detection

CVPR 2020
0
citations

Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation

CVPR 2021
0
citations

Active Teacher for Semi-Supervised Object Detection

CVPR 2022
0
citations

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization

CVPR 2022
0
citations

Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation

ICCV 2021
0
citations

Category-aware Allocation Transformer for Weakly Supervised Object Localization

ICCV 2023
0
citations

Efficient Decoder-Free Object Detection with Transformers

ECCV 2022
0
citations

ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement

ECCV 2022
0
citations

Fine-Grained Data Distribution Alignment for Post-Training Quantization

ECCV 2022
0
citations

PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation

ECCV 2022arXiv
0
citations

Dynamic Dual Trainable Bounds for Ultra-Low Precision Super-Resolution Networks

ECCV 2022
0
citations

SeqTR: A Simple Yet Universal Network for Visual Grounding

ECCV 2022
0
citations

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment

ICML 2024
0
citations

Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion

CVPR 2025
0
citations

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

ICLR 2025
0
citations

Probability-Density-aware Semi-supervised Learning

AAAI 2025
0
citations

Semi-supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning

AAAI 2024
0
citations

Solving the Catastrophic Forgetting Problem in Generalized Category Discovery

CVPR 2024
0
citations

A General and Efficient Training for Transformer via Token Expansion

CVPR 2024
0
citations

Aligning and Prompting Everything All at Once for Universal Visual Perception

CVPR 2024
0
citations

DS-VLM: Diffusion Supervision Vision Language Model

ICML 2025
0
citations

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity

ICML 2024
0
citations

Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation

CVPR 2019
0
citations

UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection

NeurIPS 2020
0
citations

CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes

NeurIPS 2023
0
citations