Chenliang Xu
14
Papers
138
Total Citations
Papers (14)
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
AAAI 2025
47
citations
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
AAAI 2025
24
citations
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
CVPR 2025
16
citations
One Forward is Enough for Neural Network Training via Likelihood Ratio Method
ICLR 2024
14
citations
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion
AAAI 2025
13
citations
Discover and Mitigate Multiple Biased Subgroups in Image Classifiers
CVPR 2024
12
citations
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
NeurIPS 2025
4
citations
Learning to Highlight Audio by Watching Movies
CVPR 2025arXiv
4
citations
ZeroSep: Separate Anything in Audio with Zero Training
NeurIPS 2025
3
citations
Targeted Forgetting of Image Subgroups in CLIP Models
CVPR 2025
1
citations
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach
CVPR 2025
0
citations
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling
ICCV 2025
0
citations
Learning to Transform Dynamically for Better Adversarial Transferability
CVPR 2024
0
citations
π-AVAS: Can Physics-Integrated Audio-Visual Modeling Boost Neural Acoustic Synthesis?
ICCV 2025
0
citations