Wenzhao Zheng

29
Papers
243
Total Citations

Papers (29)

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference

ICML 2025
190
citations

DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes

CVPR 2025arXiv
29
citations

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding

ICCV 2025
16
citations

UniDrive: Towards Universal Driving Perception Across Camera Configurations

ICLR 2025
4
citations

Path Choice Matters for Clear Attributions in Path Methods

ICLR 2024
4
citations

PlaneRAS: Learning Planar Primitives for 3D Plane Recovery

ICCV 2025
0
citations

D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

ICCV 2025
0
citations

Authentic 4D Driving Simulation with a Video Generation Model

ICCV 2025
0
citations

SpectralAR: Spectral Autoregressive Visual Generation

ICCV 2025
0
citations

LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction

CVPR 2024
0
citations

SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

CVPR 2024
0
citations

Segment Any Motion in Videos

CVPR 2025
0
citations

Deep Adversarial Metric Learning

CVPR 2018arXiv
0
citations

Hardness-Aware Deep Metric Learning

CVPR 2019
0
citations

Deep Metric Learning via Adaptive Learnable Assessment

CVPR 2020
0
citations

Deep Compositional Metric Learning

CVPR 2021
0
citations

Dimension Embeddings for Monocular 3D Object Detection

CVPR 2022
0
citations

Attributable Visual Similarity Learning

CVPR 2022arXiv
0
citations

Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction

CVPR 2023arXiv
0
citations

Deep Factorized Metric Learning

CVPR 2023
0
citations

Deep Relational Metric Learning

ICCV 2021arXiv
0
citations

OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions

ICCV 2023arXiv
0
citations

Token-Label Alignment for Vision Transformers

ICCV 2023arXiv
0
citations

SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving

ICCV 2023arXiv
0
citations

Structural Deep Metric Learning for Room Layout Estimation

ECCV 2020
0
citations

GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction

CVPR 2025
0
citations

Dynamic Metric Learning with Cross-Level Concept Distillation

ECCV 2022
0
citations

GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

CVPR 2025
0
citations

Learning Counterfactually Decoupled Attention for Open-World Model Attribution

ICCV 2025
0
citations