Poster "text-to-speech synthesis" Papers
3 papers found
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
Keon Lee, Dong Won Kim, Jaehyeon Kim et al.
ICLR 2025posterarXiv:2406.11427
28
citations
HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis
Yuto Nishimura, Takumi Hirose, Masanari Ohi et al.
ICLR 2025posterarXiv:2410.04380
5
citations
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Yuancheng Wang, Haoyue Zhan, Liwei Liu et al.
ICLR 2025posterarXiv:2409.00750
156
citations