Yuxuan Cai
4
Papers
28
Total Citations
Papers (4)
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
ICCV 2025arXiv
23
citations
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
CVPR 2025
5
citations
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
CVPR 2025
0
citations
High-Performance Temporal Reversible Spiking Neural Networks with $\mathcal{O}(L)$ Training Memory and $\mathcal{O}(1)$ Inference Cost
ICML 2024
0
citations