Zhaoxiang Zhang

26
Papers
218
Total Citations

Papers (26)

OmniBench: Towards The Future of Universal Omni-Language Models

NeurIPS 2025
51
citations

DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers

ICCV 2025
44
citations

FreeVS: Generative View Synthesis on Free Driving Trajectory

ICLR 2025
34
citations

Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness

ICCV 2025
28
citations

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

ICCV 2025
16
citations

Robust Depth Enhancement via Polarization Prompt Fusion Tuning

CVPR 2024
11
citations

MemoNav: Working Memory Model for Visual Navigation

CVPR 2024
10
citations

RCL: Reliable Continual Learning for Unified Failure Detection

CVPR 2024
6
citations

DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving

NeurIPS 2025
6
citations

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

NeurIPS 2025
4
citations

FIRM: Flexible Interactive Reflection ReMoval

AAAI 2025
3
citations

Point-supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance

ECCV 2024
2
citations

FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering

CVPR 2025
2
citations

MCOP: Multi-UAV Collaborative Occupancy Prediction

ICCV 2025arXiv
1
citations

PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation

CVPR 2024
0
citations

End-to-End Driving with Online Trajectory Evaluation via BEV World Model

ICCV 2025
0
citations

UIPro: Unleashing Superior Interaction Capability For GUI Agents

ICCV 2025
0
citations

Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation

ICCV 2025
0
citations

LayerAnimate: Layer-level Control for Animation

ICCV 2025
0
citations

SceneX: Procedural Controllable Large-Scale Scene Generation

AAAI 2025
0
citations

Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation

AAAI 2024
0
citations

HardMo: A Large-Scale Hardcase Dataset for Motion Capture

CVPR 2024
0
citations

Continual Forgetting for Pre-trained Vision Models

CVPR 2024
0
citations

Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving

CVPR 2024
0
citations

Enhancing Visual Continual Learning with Language-Guided Supervision

CVPR 2024
0
citations

FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes

CVPR 2025
0
citations