by Koichi Saito Papers
2 papers found
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation
Koichi Saito, Dongjun Kim, Takashi Shibuya et al.
ICLR 2025posterarXiv:2405.18503
9
citations
TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation
Jiaben Chen, Zixin Wang, AILING ZENG et al.
NeurIPS 2025posterarXiv:2510.07249
3
citations