Xiaoqin Zhang
16
Papers
28
Total Citations
Papers (16)
VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
AAAI 2024arXiv
15
citations
Weakly Supervised Monocular 3D Detection with a Single-View Image
CVPR 2024arXiv
12
citations
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations
ICCV 2025arXiv
1
citations
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
CVPR 2025arXiv
0
citations
Spatial Preference Rewarding for MLLMs Spatial Understanding
ICCV 2025arXiv
0
citations
Face Retouching with Diffusion Data Generation and Spectral Restorement
ICCV 2025
0
citations
SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking
ICCV 2025
0
citations
PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency
ICCV 2025arXiv
0
citations
SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation
AAAI 2025
0
citations
Masked AutoDecoder is Effective Multi-Task Vision Generalist
CVPR 2024arXiv
0
citations
FAC: 3D Representation Learning via Foreground Aware Feature Contrast
CVPR 2023arXiv
0
citations
DA-DETR: Domain Adaptive Detection Transformer With Information Fusion
CVPR 2023
0
citations
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
CVPR 2023arXiv
0
citations
UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration
CVPR 2023arXiv
0
citations
Pose-Free Neural Radiance Fields via Implicit Pose Regularization
ICCV 2023arXiv
0
citations
WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields
ICCV 2023arXiv
0
citations