Kecheng Zheng
12
Papers
324
Total Citations
Papers (12)
Paying More Attention to Images: A Training-Free Method for Alleviating Hallucination in LVLMs
ECCV 2024
121
citations
Language-Image Pre-training with Long Captions
ECCV 2024
63
citations
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
ICLR 2025
59
citations
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
CVPR 2025
18
citations
Mimir: Improving Video Diffusion Models for Precise Text Understanding
CVPR 2025
16
citations
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
CVPR 2025
15
citations
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
CVPR 2025
13
citations
Aligned Better, Listen Better for Audio-Visual Large Language Models
ICLR 2025
8
citations
Contextual AD Narration with Interleaved Multimodal Sequence
CVPR 2025arXiv
7
citations
Learning Visual Generative Priors without Text
CVPR 2025arXiv
4
citations
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
CVPR 2024
0
citations
CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
CVPR 2024
0
citations