Jing Zhang
43
Papers
1,784
Total Citations
2
Affiliations
Affiliations
Hefei University of TechnologyGent University-imec
Papers (43)
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion
AAAI 2024arXiv
1,423
citations
SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation
AAAI 2024arXiv
110
citations
A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint
CVPR 2024
52
citations
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
CVPR 2025
25
citations
SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection
AAAI 2024arXiv
24
citations
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?
CVPR 2025
24
citations
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AAAI 2024arXiv
24
citations
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
AAAI 2024arXiv
21
citations
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models
ICLR 2024
21
citations
Decomposing Semantic Shifts for Composed Image Retrieval
AAAI 2024arXiv
17
citations
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing
NeurIPS 2025
10
citations
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images
CVPR 2024
10
citations
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
NeurIPS 2025
9
citations
Probability Density Geodesics in Image Diffusion Latent Space
CVPR 2025
7
citations
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights
AAAI 2025
6
citations
Adversarial Exploitation of Data Diversity Improves Visual Localization
ICCV 2025
1
citations
MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection
AAAI 2025
0
citations
Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning
AAAI 2025
0
citations
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
AAAI 2024
0
citations
Data-Free Generalized Zero-Shot Learning
AAAI 2024arXiv
0
citations
Adversarial Purification with the Manifold Hypothesis
AAAI 2024
0
citations
Quantum-Inspired Neural Network with Runge-Kutta Method
AAAI 2024
0
citations
LaViP: Language-Grounded Visual Prompting
AAAI 2024
0
citations
ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
CVPR 2024
0
citations
SVGDreamer: Text Guided SVG Generation with Diffusion Model
CVPR 2024
0
citations
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
CVPR 2024
0
citations
OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning
ICML 2024
0
citations
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming
ICML 2024
0
citations
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
CVPR 2025
0
citations
SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining
CVPR 2025
0
citations
Empowering LLMs to Understand and Generate Complex Vector Graphics
CVPR 2025
0
citations
Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection
CVPR 2025
0
citations
Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
CVPR 2025
0
citations
GARF: Learning Generalizable 3D Reassembly for Real-World Fractures
ICCV 2025
0
citations
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling
ICCV 2025
0
citations
Synergistic Prompting for Robust Visual Recognition with Missing Modalities
ICCV 2025
0
citations
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
ICCV 2025
0
citations
Rethink Sparse Signals for Pose-guided Text-to-image Generation
ICCV 2025
0
citations
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
ICCV 2025
0
citations
Patch-level Sounding Object Tracking for Audio-Visual Question Answering
AAAI 2025
0
citations
Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration
AAAI 2025
0
citations
UAWTrack: Universal 3D Single Object Tracking in Adverse Weather
AAAI 2025
0
citations
Semi-supervised Infrared Small Target Detection with Thermodynamic-Inspired Uneven Perturbation and Confidence Adaptation
AAAI 2025
0
citations