Qingkai Fang

4

papers

235

total citations

papers (4)

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing

NeurIPS 2025arXiv

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation

NeurIPS 2023arXiv