Wei Zhai

23
Papers
58
Total Citations

Papers (23)

Improved Video VAE for Latent Video Diffusion Model

CVPR 2025arXiv
19
citations

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

CVPR 2025
18
citations

MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling

CVPR 2025
13
citations

Bidirectional Progressive Transformer for Interaction Intention Anticipation

ECCV 2024
8
citations

MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking

ICCV 2025
0
citations

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

ICCV 2025
0
citations

Hypercorrelation Evolution for Video Class-Incremental Learning

AAAI 2024
0
citations

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images

CVPR 2024
0
citations

Deep Structure-Revealed Network for Texture Recognition

CVPR 2020
0
citations

Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning

CVPR 2021
0
citations

Background Activation Suppression for Weakly Supervised Object Localization

CVPR 2022arXiv
0
citations

Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning

CVPR 2022arXiv
0
citations

Learning Affordance Grounding From Exocentric Images

CVPR 2022arXiv
0
citations

Leverage Interactive Affinity for Affordance Learning

CVPR 2023
0
citations

Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection

CVPR 2023arXiv
0
citations

Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition

ICCV 2019
0
citations

Spatial-Aware Token for Weakly Supervised Object Localization

ICCV 2023arXiv
0
citations

Grounding 3D Object Affordance from 2D Interactions in Images

ICCV 2023arXiv
0
citations

Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning

CVPR 2025
0
citations

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

CVPR 2025
0
citations

SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets

ICCV 2025
0
citations

HERO: Human Reaction Generation from Videos

ICCV 2025
0
citations

Exploring Figure-Ground Assignment Mechanism in Perceptual Organization

NeurIPS 2022
0
citations