Jing Zhang

43
Papers
1,784
Total Citations
2
Affiliations

Affiliations

Hefei University of TechnologyGent University-imec

Papers (43)

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion

AAAI 2024arXiv
1,423
citations

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation

AAAI 2024arXiv
110
citations

A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint

CVPR 2024
52
citations

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

CVPR 2025
25
citations

SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection

AAAI 2024arXiv
24
citations

XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

CVPR 2025
24
citations

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering

AAAI 2024arXiv
24
citations

Question Calibration and Multi-Hop Modeling for Temporal Question Answering

AAAI 2024arXiv
21
citations

IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models

ICLR 2024
21
citations

Decomposing Semantic Shifts for Composed Image Retrieval

AAAI 2024arXiv
17
citations

RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing

NeurIPS 2025
10
citations

LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images

CVPR 2024
10
citations

CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward

NeurIPS 2025
9
citations

Probability Density Geodesics in Image Diffusion Latent Space

CVPR 2025
7
citations

MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

AAAI 2025
6
citations

Adversarial Exploitation of Data Diversity Improves Visual Localization

ICCV 2025
1
citations

MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection

AAAI 2025
0
citations

Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning

AAAI 2025
0
citations

Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation

AAAI 2024
0
citations

Data-Free Generalized Zero-Shot Learning

AAAI 2024arXiv
0
citations

Adversarial Purification with the Manifold Hypothesis

AAAI 2024
0
citations

Quantum-Inspired Neural Network with Runge-Kutta Method

AAAI 2024
0
citations

LaViP: Language-Grounded Visual Prompting

AAAI 2024
0
citations

ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models

CVPR 2024
0
citations

SVGDreamer: Text Guided SVG Generation with Diffusion Model

CVPR 2024
0
citations

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather

CVPR 2024
0
citations

OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning

ICML 2024
0
citations

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

ICML 2024
0
citations

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction

CVPR 2025
0
citations

SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining

CVPR 2025
0
citations

Empowering LLMs to Understand and Generate Complex Vector Graphics

CVPR 2025
0
citations

Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection

CVPR 2025
0
citations

Identifying and Mitigating Position Bias of Multi-image Vision-Language Models

CVPR 2025
0
citations

GARF: Learning Generalizable 3D Reassembly for Real-World Fractures

ICCV 2025
0
citations

Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling

ICCV 2025
0
citations

Synergistic Prompting for Robust Visual Recognition with Missing Modalities

ICCV 2025
0
citations

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

ICCV 2025
0
citations

Rethink Sparse Signals for Pose-guided Text-to-image Generation

ICCV 2025
0
citations

ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking

ICCV 2025
0
citations

Patch-level Sounding Object Tracking for Audio-Visual Question Answering

AAAI 2025
0
citations

Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration

AAAI 2025
0
citations

UAWTrack: Universal 3D Single Object Tracking in Adverse Weather

AAAI 2025
0
citations

Semi-supervised Infrared Small Target Detection with Thermodynamic-Inspired Uneven Perturbation and Confidence Adaptation

AAAI 2025
0
citations