Xiaohan Ding
4
Papers
11
Total Citations
Papers (4)
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
CVPR 2024arXiv
11
citations
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition
CVPR 2024
0
citations
Quantized Prompt for Efficient Generalization of Vision-Language Models
ECCV 2024arXiv
0
citations
Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs
CVPR 2024
0
citations