"speech-language models" Papers
2 papers found
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Kai Chen, Yunhao Gou, Runhui Huang et al.
CVPR 2025posterarXiv:2409.18042
44
citations
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Qingkai Fang, Shoutao Guo, Yan Zhou et al.
ICLR 2025posterarXiv:2409.06666
127
citations