Jing Zhang

43
Papers
1,796
Total Citations
2
Affiliations

Affiliations

Hefei University of TechnologyGent University-imec

Papers (43)

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion

AAAI 2024arXiv
1,423
citations

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation

AAAI 2024arXiv
110
citations

A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint

CVPR 2024
52
citations

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

CVPR 2025
25
citations

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering

AAAI 2024arXiv
24
citations

XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

CVPR 2025
24
citations

SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection

AAAI 2024arXiv
24
citations

Question Calibration and Multi-Hop Modeling for Temporal Question Answering

AAAI 2024arXiv
21
citations

IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models

ICLR 2024
21
citations

Decomposing Semantic Shifts for Composed Image Retrieval

AAAI 2024arXiv
17
citations

Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling

ICCV 2025arXiv
10
citations

LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images

CVPR 2024
10
citations

RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing

NeurIPS 2025
10
citations

CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward

NeurIPS 2025
9
citations

Probability Density Geodesics in Image Diffusion Latent Space

CVPR 2025arXiv
9
citations

MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

AAAI 2025
6
citations

Adversarial Exploitation of Data Diversity Improves Visual Localization

ICCV 2025
1
citations

Patch-level Sounding Object Tracking for Audio-Visual Question Answering

AAAI 2025
0
citations

Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration

AAAI 2025
0
citations

UAWTrack: Universal 3D Single Object Tracking in Adverse Weather

AAAI 2025
0
citations

Semi-supervised Infrared Small Target Detection with Thermodynamic-Inspired Uneven Perturbation and Confidence Adaptation

AAAI 2025
0
citations

MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection

AAAI 2025
0
citations

Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning

AAAI 2025
0
citations

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

ICCV 2025
0
citations

Synergistic Prompting for Robust Visual Recognition with Missing Modalities

ICCV 2025
0
citations

GARF: Learning Generalizable 3D Reassembly for Real-World Fractures

ICCV 2025
0
citations

Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation

AAAI 2024
0
citations

Identifying and Mitigating Position Bias of Multi-image Vision-Language Models

CVPR 2025
0
citations

Data-Free Generalized Zero-Shot Learning

AAAI 2024arXiv
0
citations

Adversarial Purification with the Manifold Hypothesis

AAAI 2024
0
citations

Quantum-Inspired Neural Network with Runge-Kutta Method

AAAI 2024
0
citations

LaViP: Language-Grounded Visual Prompting

AAAI 2024
0
citations

Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection

CVPR 2025
0
citations

Empowering LLMs to Understand and Generate Complex Vector Graphics

CVPR 2025
0
citations

ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models

CVPR 2024
0
citations

SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining

CVPR 2025
0
citations

SVGDreamer: Text Guided SVG Generation with Diffusion Model

CVPR 2024
0
citations

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather

CVPR 2024
0
citations

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction

CVPR 2025
0
citations

OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning

ICML 2024
0
citations

ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking

ICCV 2025
0
citations

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

ICML 2024
0
citations

Rethink Sparse Signals for Pose-guided Text-to-image Generation

ICCV 2025
0
citations