Jing Zhang
43
Papers
1,796
Total Citations
2
Affiliations
Affiliations
Hefei University of TechnologyGent University-imec
Papers (43)
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion
AAAI 2024arXiv
1,423
citations
SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation
AAAI 2024arXiv
110
citations
A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint
CVPR 2024
52
citations
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
CVPR 2025
25
citations
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AAAI 2024arXiv
24
citations
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?
CVPR 2025
24
citations
SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection
AAAI 2024arXiv
24
citations
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
AAAI 2024arXiv
21
citations
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models
ICLR 2024
21
citations
Decomposing Semantic Shifts for Composed Image Retrieval
AAAI 2024arXiv
17
citations
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling
ICCV 2025arXiv
10
citations
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images
CVPR 2024
10
citations
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing
NeurIPS 2025
10
citations
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
NeurIPS 2025
9
citations
Probability Density Geodesics in Image Diffusion Latent Space
CVPR 2025arXiv
9
citations
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights
AAAI 2025
6
citations
Adversarial Exploitation of Data Diversity Improves Visual Localization
ICCV 2025
1
citations
Patch-level Sounding Object Tracking for Audio-Visual Question Answering
AAAI 2025
0
citations
Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration
AAAI 2025
0
citations
UAWTrack: Universal 3D Single Object Tracking in Adverse Weather
AAAI 2025
0
citations
Semi-supervised Infrared Small Target Detection with Thermodynamic-Inspired Uneven Perturbation and Confidence Adaptation
AAAI 2025
0
citations
MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection
AAAI 2025
0
citations
Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning
AAAI 2025
0
citations
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
ICCV 2025
0
citations
Synergistic Prompting for Robust Visual Recognition with Missing Modalities
ICCV 2025
0
citations
GARF: Learning Generalizable 3D Reassembly for Real-World Fractures
ICCV 2025
0
citations
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
AAAI 2024
0
citations
Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
CVPR 2025
0
citations
Data-Free Generalized Zero-Shot Learning
AAAI 2024arXiv
0
citations
Adversarial Purification with the Manifold Hypothesis
AAAI 2024
0
citations
Quantum-Inspired Neural Network with Runge-Kutta Method
AAAI 2024
0
citations
LaViP: Language-Grounded Visual Prompting
AAAI 2024
0
citations
Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection
CVPR 2025
0
citations
Empowering LLMs to Understand and Generate Complex Vector Graphics
CVPR 2025
0
citations
ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
CVPR 2024
0
citations
SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining
CVPR 2025
0
citations
SVGDreamer: Text Guided SVG Generation with Diffusion Model
CVPR 2024
0
citations
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
CVPR 2024
0
citations
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
CVPR 2025
0
citations
OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning
ICML 2024
0
citations
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
ICCV 2025
0
citations
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming
ICML 2024
0
citations
Rethink Sparse Signals for Pose-guided Text-to-image Generation
ICCV 2025
0
citations