Jingyi Zhang

16
Papers
231
Total Citations

Papers (16)

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

ICCV 2025
206
citations

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

CVPR 2025
12
citations

MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI

ICCV 2025
7
citations

Neighborhood-Enhanced 3D Human Pose Estimation with Monocular LiDAR in Long-Range Outdoor Scenes

AAAI 2024
6
citations

LiDARCap: Long-Range Marker-Less 3D Human Motion Capture With LiDAR Point Clouds

CVPR 2022arXiv
0
citations

Indescribable Multi-Modal Spatial Evaluator

CVPR 2023arXiv
0
citations

Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors

CVPR 2023arXiv
0
citations

UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration

CVPR 2023arXiv
0
citations

Black-Box Unsupervised Domain Adaptation with Bi-Directional Atkinson-Shiffrin Memory

ICCV 2023arXiv
0
citations

Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution

ECCV 2022
0
citations

DA-DETR: Domain Adaptive Detection Transformer With Information Fusion

CVPR 2023
0
citations

Building Detail-Sensitive Semantic Segmentation Networks With Polynomial Pooling

CVPR 2019
0
citations

Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation

CVPR 2021arXiv
0
citations

Spectral Unsupervised Domain Adaptation for Visual Recognition

CVPR 2022arXiv
0
citations

Large-scale optimal transport map estimation using projection pursuit

NeurIPS 2019
0
citations

Sufficient dimension reduction for classification using principal optimal transport direction

NeurIPS 2020
0
citations