Zhaoxiang Zhang
26
Papers
218
Total Citations
Papers (26)
OmniBench: Towards The Future of Universal Omni-Language Models
NeurIPS 2025
51
citations
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
ICCV 2025
44
citations
FreeVS: Generative View Synthesis on Free Driving Trajectory
ICLR 2025
34
citations
Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
ICCV 2025
28
citations
DexVLG: Dexterous Vision-Language-Grasp Model at Scale
ICCV 2025
16
citations
Robust Depth Enhancement via Polarization Prompt Fusion Tuning
CVPR 2024
11
citations
MemoNav: Working Memory Model for Visual Navigation
CVPR 2024
10
citations
RCL: Reliable Continual Learning for Unified Failure Detection
CVPR 2024
6
citations
DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving
NeurIPS 2025
6
citations
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
NeurIPS 2025
4
citations
FIRM: Flexible Interactive Reflection ReMoval
AAAI 2025
3
citations
Point-supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance
ECCV 2024
2
citations
FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering
CVPR 2025
2
citations
MCOP: Multi-UAV Collaborative Occupancy Prediction
ICCV 2025arXiv
1
citations
PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
CVPR 2024
0
citations
End-to-End Driving with Online Trajectory Evaluation via BEV World Model
ICCV 2025
0
citations
UIPro: Unleashing Superior Interaction Capability For GUI Agents
ICCV 2025
0
citations
Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation
ICCV 2025
0
citations
LayerAnimate: Layer-level Control for Animation
ICCV 2025
0
citations
SceneX: Procedural Controllable Large-Scale Scene Generation
AAAI 2025
0
citations
Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation
AAAI 2024
0
citations
HardMo: A Large-Scale Hardcase Dataset for Motion Capture
CVPR 2024
0
citations
Continual Forgetting for Pre-trained Vision Models
CVPR 2024
0
citations
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
CVPR 2024
0
citations
Enhancing Visual Continual Learning with Language-Guided Supervision
CVPR 2024
0
citations
FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
CVPR 2025
0
citations