Shen Yan
17
Papers
96
Total Citations
Papers (17)
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
ICML 2025
88
citations
CompCap: Improving Multimodal Large Language Models with Composite Captions
ICCV 2025
6
citations
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics
CVPR 2025arXiv
2
citations
Pixel-Aligned Language Model
CVPR 2024
0
citations
VideoPrism: A Foundational Visual Encoder for Video Understanding
ICML 2024
0
citations
Multiview Transformers for Video Recognition
CVPR 2022arXiv
0
citations
Long-Term Visual Localization With Mobile Sensors
CVPR 2023arXiv
0
citations
Soft Augmentation for Image Classification
CVPR 2023
0
citations
Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks
ICCV 2023
0
citations
UnLoc: A Unified Framework for Video Localization Tasks
ICCV 2023arXiv
0
citations
Deep Active Contours for Real-time 6-DoF Object Tracking
ICCV 2023
0
citations
MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution
ECCV 2020
0
citations
Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation
CVPR 2022arXiv
0
citations
LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment
ICCV 2025
0
citations
Streaming Dense Video Captioning
CVPR 2024
0
citations
Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?
NeurIPS 2020
0
citations
NAS-Bench-x11 and the Power of Learning Curves
NeurIPS 2021
0
citations