"speech synthesis" Papers
6 papers found
ELF: Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis
Jungil Kong, Junmo Lee, Jeongmin Kim et al.
ICML 2024poster
Scaling Speech Technology to 1,000+ Languages
Vineel Pratap Konduru, Andros Tjandra, Bowen Shi et al.
ICML 2024poster
SECap: Speech Emotion Captioning with Large Language Model
Yaoxun Xu, Hangting Chen, Jianwei Yu et al.
AAAI 2024paperarXiv:2312.10381
56
citations
SelfVC: Voice Conversion With Iterative Refinement using Self Transformations
Paarth Neekhara, Shehzeen Hussain, Rafael Valle et al.
ICML 2024poster
UniAudio: Towards Universal Audio Generation with Large Language Models
Dongchao Yang, Jinchuan Tian, Xu Tan et al.
ICML 2024poster
What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection
XiaoHui Zhang, Jiangyan Yi, Chenglong Wang et al.
AAAI 2024paperarXiv:2312.09651