Xuxin Cheng

16
Papers
113
Total Citations

Papers (16)

PolyVoice: Language Models for Speech to Speech Translation

ICLR 2024arXiv
29
citations

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

ICLR 2025arXiv
23
citations

Exploiting Auxiliary Caption for Video Grounding

AAAI 2024arXiv
14
citations

Retrieval is Accurate Generation

ICLR 2024arXiv
11
citations

Uncertainty-aware sign language video retrieval with probability distribution modeling

ECCV 2024arXiv
10
citations

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval

ECCV 2024
10
citations

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

AAAI 2024arXiv
9
citations

UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation

ICLR 2025
7
citations

CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model

CVPR 2025arXiv
0
citations

EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction

AAAI 2025arXiv
0
citations

Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling

AAAI 2024
0
citations

Aligner$^2$: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment

AAAI 2024
0
citations

Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport

AAAI 2024
0
citations

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

ICCV 2023arXiv
0
citations

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

ICCV 2023arXiv
0
citations

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning

NeurIPS 2023
0
citations