Xu Tan
8
Papers
231
Total Citations
1
Affiliations
Affiliations
Microsoft Research Asia
Papers (8)
PromptTTS 2: Describing and Generating Voices with Text Prompt
ICLR 2024
70
citations
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
AAAI 2025
65
citations
GAIA: Zero-shot Talking Avatar Generation
ICLR 2024
46
citations
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
CVPR 2025
31
citations
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
AAAI 2025
19
citations
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
ICCV 2025
0
citations
UniAudio: Towards Universal Audio Generation with Large Language Models
ICML 2024
0
citations
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
ICML 2024
0
citations