Jing Zhang

109
Papers
1,784
Total Citations
2
Affiliations

Affiliations

Hefei University of TechnologyGent University-imec

Papers (109)

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion

AAAI 2024arXiv
1,423
citations

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation

AAAI 2024arXiv
110
citations

A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint

CVPR 2024
52
citations

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

CVPR 2025
25
citations

XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

CVPR 2025
24
citations

SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection

AAAI 2024arXiv
24
citations

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering

AAAI 2024arXiv
24
citations

Question Calibration and Multi-Hop Modeling for Temporal Question Answering

AAAI 2024arXiv
21
citations

IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models

ICLR 2024
21
citations

Decomposing Semantic Shifts for Composed Image Retrieval

AAAI 2024arXiv
17
citations

LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images

CVPR 2024
10
citations

RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing

NeurIPS 2025
10
citations

CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward

NeurIPS 2025
9
citations

Probability Density Geodesics in Image Diffusion Latent Space

CVPR 2025
7
citations

MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

AAAI 2025
6
citations

Adversarial Exploitation of Data Diversity Improves Visual Localization

ICCV 2025
1
citations

MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection

AAAI 2025
0
citations

Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning

AAAI 2025
0
citations

Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation

AAAI 2024
0
citations

Data-Free Generalized Zero-Shot Learning

AAAI 2024arXiv
0
citations

Adversarial Purification with the Manifold Hypothesis

AAAI 2024
0
citations

Quantum-Inspired Neural Network with Runge-Kutta Method

AAAI 2024
0
citations

LaViP: Language-Grounded Visual Prompting

AAAI 2024
0
citations

ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models

CVPR 2024
0
citations

SVGDreamer: Text Guided SVG Generation with Diffusion Model

CVPR 2024
0
citations

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather

CVPR 2024
0
citations

OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning

ICML 2024
0
citations

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

ICML 2024
0
citations

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

CVPR 2017arXiv
0
citations

Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior

CVPR 2017
0
citations

Importance Weighted Adversarial Nets for Partial Domain Adaptation

CVPR 2018arXiv
0
citations

Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective

CVPR 2018arXiv
0
citations

MirrorGAN: Learning Text-To-Image Generation by Redescription

CVPR 2019
0
citations

Few-Shot Learning via Saliency-Guided Hallucination of Samples

CVPR 2019
0
citations

ShieldNets: Defending Against Adversarial Attacks Using Probabilistic Adversarial Robustness

CVPR 2019
0
citations

UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders

CVPR 2020
0
citations

Deep Degradation Prior for Low-Quality Image Classification

CVPR 2020
0
citations

Weakly-Supervised Salient Object Detection via Scribble Annotations

CVPR 2020arXiv
0
citations

Simultaneously Localize, Segment and Rank the Camouflaged Objects

CVPR 2021arXiv
0
citations

Weakly Supervised Video Salient Object Detection

CVPR 2021arXiv
0
citations

Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection

CVPR 2021arXiv
0
citations

3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds

CVPR 2022
0
citations

DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers

CVPR 2022arXiv
0
citations

GMFlow: Learning Optical Flow via Global Matching

CVPR 2022arXiv
0
citations

Recurrent Glimpse-Based Decoder for Detection With Transformer

CVPR 2022arXiv
0
citations

Learning Affordance Grounding From Exocentric Images

CVPR 2022arXiv
0
citations

ISNet: Shape Matters for Infrared Small Target Detection

CVPR 2022
0
citations

RU-Net: Regularized Unrolling Network for Scene Graph Generation

CVPR 2022
0
citations

FIBA: Frequency-Injection Based Backdoor Attack in Medical Image Analysis

CVPR 2022arXiv
0
citations

Dynamic Focus-Aware Positional Queries for Semantic Segmentation

CVPR 2023arXiv
0
citations

Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection

CVPR 2023arXiv
0
citations

Leverage Interactive Affinity for Affordance Learning

CVPR 2023
0
citations

Modeling the Distributional Uncertainty for Salient Object Detection Models

CVPR 2023
0
citations

CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose

CVPR 2023arXiv
0
citations

DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting

CVPR 2023arXiv
0
citations

Decoupling Learning and Remembering: A Bilevel Memory Framework With Knowledge Projection for Task-Incremental Learning

CVPR 2023
0
citations

Referring Image Matting

CVPR 2023arXiv
0
citations

Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition

ICCV 2019
0
citations

Out-of-Boundary View Synthesis Towards Full-Frame Video Stabilization

ICCV 2021arXiv
0
citations

RGB-D Saliency Detection via Cascaded Mutual Information Minimization

ICCV 2021
0
citations

LPFF: A Portrait Dataset for Face Generators Across Large Poses

ICCV 2023arXiv
0
citations

Domain Specified Optimization for Deployment Authorization

ICCV 2023
0
citations

Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning

ICCV 2023arXiv
0
citations

RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation

ICCV 2023
0
citations

Multimodal Variational Auto-encoder based Audio-Visual Segmentation

ICCV 2023
0
citations

ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution

ICCV 2023arXiv
0
citations

Model Calibration in Dense Classification with Adaptive Label Perturbation

ICCV 2023arXiv
0
citations

Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection

ECCV 2020
0
citations

MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis

ECCV 2022
0
citations

Towards Data-Efficient Detection Transformers

ECCV 2022
0
citations

ReAct: Temporal Action Detection with Relational Queries

ECCV 2022
0
citations

FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs

ECCV 2022
0
citations

VSA: Learning Varied-Size Window Attention in Vision Transformers

ECCV 2022
0
citations

PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation

ECCV 2022
0
citations

Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation

ECCV 2022
0
citations

RegionCL: Exploring Contrastive Region Pairs for Self-Supervised Representation Learning

ECCV 2022
0
citations

BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation

ECCV 2022
0
citations

Audio—Visual Segmentation

ECCV 2022
0
citations

"Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics"

ECCV 2022
0
citations

"JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes"

ECCV 2022
0
citations

P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds

ICCV 2023arXiv
0
citations

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction

CVPR 2025
0
citations

SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining

CVPR 2025
0
citations

Empowering LLMs to Understand and Generate Complex Vector Graphics

CVPR 2025
0
citations

Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection

CVPR 2025
0
citations

Identifying and Mitigating Position Bias of Multi-image Vision-Language Models

CVPR 2025
0
citations

GARF: Learning Generalizable 3D Reassembly for Real-World Fractures

ICCV 2025
0
citations

Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling

ICCV 2025
0
citations

Synergistic Prompting for Robust Visual Recognition with Missing Modalities

ICCV 2025
0
citations

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

ICCV 2025
0
citations

Rethink Sparse Signals for Pose-guided Text-to-image Generation

ICCV 2025
0
citations

ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking

ICCV 2025
0
citations

Patch-level Sounding Object Tracking for Audio-Visual Question Answering

AAAI 2025
0
citations

Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration

AAAI 2025
0
citations

UAWTrack: Universal 3D Single Object Tracking in Adverse Weather

AAAI 2025
0
citations

Semi-supervised Infrared Small Target Detection with Thermodynamic-Inspired Uneven Perturbation and Confidence Adaptation

AAAI 2025
0
citations

Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation

NeurIPS 2019
0
citations

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

NeurIPS 2019
0
citations

Auto Learning Attention

NeurIPS 2020
0
citations

Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction

NeurIPS 2021
0
citations

ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

NeurIPS 2021
0
citations

Watermarking for Out-of-distribution Detection

NeurIPS 2022
0
citations

Exploring Figure-Ground Assignment Mechanism in Perceptual Organization

NeurIPS 2022
0
citations

APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking

NeurIPS 2022
0
citations

SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification

NeurIPS 2022
0
citations

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation

NeurIPS 2022
0
citations

Constrained Policy Optimization with Explicit Behavior Density For Offline Reinforcement Learning

NeurIPS 2023
0
citations

SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model

NeurIPS 2023
0
citations

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

NeurIPS 2023
0
citations