Wei Zhai
23
Papers
58
Total Citations
Papers (23)
Improved Video VAE for Latent Video Diffusion Model
CVPR 2025arXiv
19
citations
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
CVPR 2025
18
citations
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling
CVPR 2025
13
citations
Bidirectional Progressive Transformer for Interaction Intention Anticipation
ECCV 2024
8
citations
MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking
ICCV 2025
0
citations
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
ICCV 2025
0
citations
Hypercorrelation Evolution for Video Class-Incremental Learning
AAAI 2024
0
citations
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
CVPR 2024
0
citations
Deep Structure-Revealed Network for Texture Recognition
CVPR 2020
0
citations
Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning
CVPR 2021
0
citations
Background Activation Suppression for Weakly Supervised Object Localization
CVPR 2022arXiv
0
citations
Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning
CVPR 2022arXiv
0
citations
Learning Affordance Grounding From Exocentric Images
CVPR 2022arXiv
0
citations
Leverage Interactive Affinity for Affordance Learning
CVPR 2023
0
citations
Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection
CVPR 2023arXiv
0
citations
Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition
ICCV 2019
0
citations
Spatial-Aware Token for Weakly Supervised Object Localization
ICCV 2023arXiv
0
citations
Grounding 3D Object Affordance from 2D Interactions in Images
ICCV 2023arXiv
0
citations
Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning
CVPR 2025
0
citations
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
CVPR 2025
0
citations
SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets
ICCV 2025
0
citations
HERO: Human Reaction Generation from Videos
ICCV 2025
0
citations
Exploring Figure-Ground Assignment Mechanism in Perceptual Organization
NeurIPS 2022
0
citations