Jianbing Shen

71
Papers
109
Total Citations

Papers (71)

IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection

CVPR 2024
81
citations

OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving

AAAI 2025
12
citations

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

ECCV 2024
10
citations

Semantic Causality-Aware Vision-Based 3D Occupancy Prediction

ICCV 2025arXiv
3
citations

RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping

ICCV 2025
3
citations

ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Predictions

ICCV 2025
0
citations

DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving

AAAI 2025
0
citations

Language Prompt for Autonomous Driving

AAAI 2025
0
citations

DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection

AAAI 2024
0
citations

Fine-Grained Distillation for Long Document Retrieval

AAAI 2024
0
citations

Leveraging Frame Affinity for sRGB-to-RAW Video De-rendering

CVPR 2024
0
citations

Saliency-Aware Geodesic Video Object Segmentation

CVPR 2015
0
citations

Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning

CVPR 2018
0
citations

Salient Object Detection Driven by Fixation Prediction

CVPR 2018
0
citations

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification

CVPR 2018
0
citations

Revisiting Video Saliency: A Large-Scale Benchmark and a New Model

CVPR 2018arXiv
0
citations

Striking the Right Balance With Uncertainty

CVPR 2019
0
citations

Salient Object Detection With Pyramid Attention and Salient Edges

CVPR 2019
0
citations

Learning Unsupervised Video Object Segmentation Through Visual Attention

CVPR 2019
0
citations

See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks

CVPR 2019
0
citations

An Iterative and Cooperative Top-Down and Bottom-Up Inference Network for Salient Object Detection

CVPR 2019
0
citations

Shifting More Attention to Video Salient Object Detection

CVPR 2019
0
citations

Camouflaged Object Detection

CVPR 2020
0
citations

Cascaded Human-Object Interaction Recognition

CVPR 2020arXiv
0
citations

Self-Learning With Rectification Strategy for Human Parsing

CVPR 2020arXiv
0
citations

Probabilistic Structural Latent Representation for Unsupervised Embedding

CVPR 2020
0
citations

Hierarchical Human Parsing With Typed Part-Relation Reasoning

CVPR 2020arXiv
0
citations

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

CVPR 2020arXiv
0
citations

Learning Video Object Segmentation From Unlabeled Videos

CVPR 2020arXiv
0
citations

NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection

CVPR 2020arXiv
0
citations

Multi-Mutual Consistency Induced Transfer Subspace Learning for Human Motion Segmentation

CVPR 2020
0
citations

LiDAR-Based Online 3D Video Object Detection With Graph-Based Message Passing and Spatiotemporal Transformer Attention

CVPR 2020arXiv
0
citations

Structured Scene Memory for Vision-Language Navigation

CVPR 2021arXiv
0
citations

Video Object Segmentation Using Global and Instance Embedding Learning

CVPR 2021
0
citations

Face Forensics in the Wild

CVPR 2021arXiv
0
citations

Learning To Fuse Asymmetric Feature Maps in Siamese Trackers

CVPR 2021arXiv
0
citations

Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation

CVPR 2022arXiv
0
citations

Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation

CVPR 2022arXiv
0
citations

Multi-Level Representation Learning With Semantic Alignment for Referring Video Object Segmentation

CVPR 2022
0
citations

A Graph Matching Perspective With Transformers on Video Instance Segmentation

CVPR 2022
0
citations

Weakly Supervised Monocular 3D Object Detection Using Multi-View Projection and Direction Consistency

CVPR 2023arXiv
0
citations

Referring Multi-Object Tracking

CVPR 2023arXiv
0
citations

Linearization to Nonlinear Learning for Visual Tracking

ICCV 2015
0
citations

Super-Trajectory for Video Segmentation

ICCV 2017arXiv
0
citations

Deep Cropping via Attention Box Prediction and Aesthetics Assessment

ICCV 2017arXiv
0
citations

Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks

ICCV 2019
0
citations

Towards Bridging Semantic Gap to Improve Semantic Segmentation

ICCV 2019
0
citations

Human-Aware Motion Deblurring

ICCV 2019
0
citations

Learning Compositional Neural Information Fusion for Human Parsing

ICCV 2019
0
citations

Gaussian Affinity for Max-Margin Class Imbalanced Learning

ICCV 2019
0
citations

Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks

ICCV 2019
0
citations

Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation

ICCV 2021
0
citations

Full-Duplex Strategy for Video Object Segmentation

ICCV 2021arXiv
0
citations

Self-Supervised Monocular Depth Estimation by Direction-aware Cumulative Convolution Network

ICCV 2023arXiv
0
citations

OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation

ICCV 2023arXiv
0
citations

Video Object Segmentation with Episodic Graph Memory Networks

ECCV 2020
0
citations

Weakly Supervised 3D Object Detection from Lidar Point Cloud

ECCV 2020
0
citations

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification

ECCV 2020
0
citations

CLNet: A Compact Latent Network for Fast Adjusting Siamese Trackers

ECCV 2020
0
citations

Active Visual Information Gathering for Vision-Language Navigation

ECCV 2020
0
citations

Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification

ECCV 2022
0
citations

Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning

ECCV 2022
0
citations

Learning Disentanglement with Decoupled Labels for Vision-Language Navigation

ECCV 2022
0
citations

BRNet: Exploring Comprehensive Features for Monocular Depth Estimation

ECCV 2022
0
citations

Semi-Supervised 3D Object Detection with Proficient Teachers

ECCV 2022
0
citations

LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning

CVPR 2025
0
citations

ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection

ECCV 2022
0
citations

DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation

CVPR 2025
0
citations

Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution

CVPR 2025
0
citations

Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction

CVPR 2025
0
citations

DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models

ICCV 2025
0
citations