Minghui Fang
5
Papers
135
Total Citations
Papers (5)
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
ICLR 2025arXiv
125
citations
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
ICLR 2025
10
citations
Open-set Cross Modal Generalization via Multimodal Unified Representation
ICCV 2025
0
citations
Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling
AAAI 2025
0
citations
Speech Watermarking with Discrete Intermediate Representations
AAAI 2025
0
citations