Lin Song
4
Papers
6
Total Citations
Papers (4)
HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding
ICML 2025
6
citations
YOLO-World: Real-Time Open-Vocabulary Object Detection
CVPR 2024
0
citations
Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs
CVPR 2024
0
citations
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition
CVPR 2024
0
citations