Jingyi Zhang
16
Papers
231
Total Citations
Papers (16)
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
ICCV 2025
206
citations
RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation
CVPR 2025
12
citations
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI
ICCV 2025
7
citations
Neighborhood-Enhanced 3D Human Pose Estimation with Monocular LiDAR in Long-Range Outdoor Scenes
AAAI 2024
6
citations
LiDARCap: Long-Range Marker-Less 3D Human Motion Capture With LiDAR Point Clouds
CVPR 2022arXiv
0
citations
Indescribable Multi-Modal Spatial Evaluator
CVPR 2023arXiv
0
citations
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
CVPR 2023arXiv
0
citations
UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration
CVPR 2023arXiv
0
citations
Black-Box Unsupervised Domain Adaptation with Bi-Directional Atkinson-Shiffrin Memory
ICCV 2023arXiv
0
citations
Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution
ECCV 2022
0
citations
DA-DETR: Domain Adaptive Detection Transformer With Information Fusion
CVPR 2023
0
citations
Building Detail-Sensitive Semantic Segmentation Networks With Polynomial Pooling
CVPR 2019
0
citations
Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
CVPR 2021arXiv
0
citations
Spectral Unsupervised Domain Adaptation for Visual Recognition
CVPR 2022arXiv
0
citations
Large-scale optimal transport map estimation using projection pursuit
NeurIPS 2019
0
citations
Sufficient dimension reduction for classification using principal optimal transport direction
NeurIPS 2020
0
citations