Qingkai Fang
4
papers
235
total citations
papers (4)
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
ICLR 2025arXiv
127
citations
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
ICLR 2025arXiv
106
citations
FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing
NeurIPS 2025arXiv
2
citations
DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation
NeurIPS 2023arXiv
0
citations