Yunhang Shen
36
Papers
2,185
Total Citations
10
h-index
Papers (36)
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
NeurIPS 2025
1,227
citations
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
CVPR 2025
858
citations
Enabling Deep Residual Networks for Weakly Supervised Object Detection
ECCV 2020
49
citations
Weakly Supervised Open-Vocabulary Object Detection
AAAI 2024arXiv
16
citations
SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space
AAAI 2024arXiv
13
citations
Feature Denoising Diffusion Model for Blind Image Quality Assessment
AAAI 2025
8
citations
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression
CVPR 2025
4
citations
Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration
AAAI 2025
4
citations
Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models
ICCV 2025arXiv
2
citations
From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning
ICCV 2025
2
citations
BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution
AAAI 2025
2
citations
Noise-Aware Fully Webly Supervised Object Detection
CVPR 2020
0
citations
Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation
CVPR 2021
0
citations
Active Teacher for Semi-Supervised Object Detection
CVPR 2022
0
citations
HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization
CVPR 2022
0
citations
Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation
ICCV 2021
0
citations
Category-aware Allocation Transformer for Weakly Supervised Object Localization
ICCV 2023
0
citations
Efficient Decoder-Free Object Detection with Transformers
ECCV 2022
0
citations
ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement
ECCV 2022
0
citations
Fine-Grained Data Distribution Alignment for Post-Training Quantization
ECCV 2022
0
citations
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation
ECCV 2022arXiv
0
citations
Dynamic Dual Trainable Bounds for Ultra-Low Precision Super-Resolution Networks
ECCV 2022
0
citations
SeqTR: A Simple Yet Universal Network for Visual Grounding
ECCV 2022
0
citations
Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment
ICML 2024
0
citations
Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion
CVPR 2025
0
citations
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
ICLR 2025
0
citations
Probability-Density-aware Semi-supervised Learning
AAAI 2025
0
citations
Semi-supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning
AAAI 2024
0
citations
Solving the Catastrophic Forgetting Problem in Generalized Category Discovery
CVPR 2024
0
citations
A General and Efficient Training for Transformer via Token Expansion
CVPR 2024
0
citations
Aligning and Prompting Everything All at Once for Universal Visual Perception
CVPR 2024
0
citations
DS-VLM: Diffusion Supervision Vision Language Model
ICML 2025
0
citations
Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity
ICML 2024
0
citations
Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation
CVPR 2019
0
citations
UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection
NeurIPS 2020
0
citations
CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes
NeurIPS 2023
0
citations