Wei Zhai
12
Papers
58
Total Citations
Papers (12)
Improved Video VAE for Latent Video Diffusion Model
CVPR 2025arXiv
19
citations
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
CVPR 2025
18
citations
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling
CVPR 2025
13
citations
Bidirectional Progressive Transformer for Interaction Intention Anticipation
ECCV 2024arXiv
8
citations
MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking
ICCV 2025arXiv
0
citations
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
ICCV 2025
0
citations
Hypercorrelation Evolution for Video Class-Incremental Learning
AAAI 2024
0
citations
Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning
CVPR 2025
0
citations
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
CVPR 2024
0
citations
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
CVPR 2025
0
citations
SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets
ICCV 2025
0
citations
HERO: Human Reaction Generation from Videos
ICCV 2025
0
citations