Wenhao Chai
14
Papers
649
Total Citations
Papers (14)
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
CVPR 2024
457
citations
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
ICLR 2025arXiv
102
citations
Learning Diffusion Texture Priors for Image Restoration
CVPR 2024
39
citations
PAD: Personalized Alignment of LLMs at Decoding-time
ICLR 2025
35
citations
RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark
ECCV 2024
11
citations
Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression
CVPR 2025
5
citations
Global Adaptation Meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
ICCV 2023arXiv
0
citations
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
CVPR 2025
0
citations
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
ICCV 2023arXiv
0
citations
Science-T2I: Addressing Scientific Illusions in Image Synthesis
CVPR 2025
0
citations
Bringing RNNs Back to Efficient Open-Ended Video Understanding
ICCV 2025
0
citations
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement
AAAI 2025
0
citations
PromptHaze: Prompting Real-world Dehazing via Depth Anything Model
AAAI 2025
0
citations
UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning
AAAI 2024
0
citations