Lin Song
14
Papers
6
Total Citations
Papers (14)
HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding
ICML 2025
6
citations
Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs
CVPR 2024
0
citations
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition
CVPR 2024
0
citations
TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection
CVPR 2019
0
citations
Learning Dynamic Routing for Semantic Segmentation
CVPR 2020arXiv
0
citations
End-to-End Object Detection With Fully Convolutional Network
CVPR 2021arXiv
0
citations
BoxSnake: Polygonal Instance Segmentation with Box Supervision
ICCV 2023arXiv
0
citations
YOLO-World: Real-Time Open-Vocabulary Object Detection
CVPR 2024
0
citations
Learnable Tree Filter for Structure-preserving Feature Transform
NeurIPS 2019
0
citations
Rethinking Learnable Tree Filter for Generic Feature Transform
NeurIPS 2020
0
citations
Fine-Grained Dynamic Head for Object Detection
NeurIPS 2020
0
citations
Dynamic Grained Encoder for Vision Transformers
NeurIPS 2021
0
citations
Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
NeurIPS 2023
0
citations
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
NeurIPS 2023
0
citations