Yuxuan Cai

4

Papers

28

Total Citations

Papers (4)

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network

High-Performance Temporal Reversible Spiking Neural Networks with $\mathcal{O}(L)$ Training Memory and $\mathcal{O}(1)$ Inference Cost