Shen Yan

17
Papers
96
Total Citations

Papers (17)

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

ICML 2025
88
citations

CompCap: Improving Multimodal Large Language Models with Composite Captions

ICCV 2025
6
citations

NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics

CVPR 2025arXiv
2
citations

Pixel-Aligned Language Model

CVPR 2024
0
citations

VideoPrism: A Foundational Visual Encoder for Video Understanding

ICML 2024
0
citations

Multiview Transformers for Video Recognition

CVPR 2022arXiv
0
citations

Long-Term Visual Localization With Mobile Sensors

CVPR 2023arXiv
0
citations

Soft Augmentation for Image Classification

CVPR 2023
0
citations

Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks

ICCV 2023
0
citations

UnLoc: A Unified Framework for Video Localization Tasks

ICCV 2023arXiv
0
citations

Deep Active Contours for Real-time 6-DoF Object Tracking

ICCV 2023
0
citations

MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution

ECCV 2020
0
citations

Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation

CVPR 2022arXiv
0
citations

LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment

ICCV 2025
0
citations

Streaming Dense Video Captioning

CVPR 2024
0
citations

Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?

NeurIPS 2020
0
citations

NAS-Bench-x11 and the Power of Learning Curves

NeurIPS 2021
0
citations