Shengpeng Ji
6
Papers
209
Total Citations
Papers (6)
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
ICLR 2025arXiv
125
citations
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis
ICLR 2024
74
citations
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
ICLR 2025
10
citations
SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language
CVPR 2025
0
citations
Open-set Cross Modal Generalization via Multimodal Unified Representation
ICCV 2025
0
citations
Speech Watermarking with Discrete Intermediate Representations
AAAI 2025
0
citations