Xiaoda Yang
3
Papers
136
Total Citations
Papers (3)
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
ICLR 2025arXiv
125
citations
Diff-Prompt: Diffusion-driven Prompt Generator with Mask Supervision
ICLR 2025
6
citations
Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection
AAAI 2025
5
citations