Jing Zhang
109
Papers
1,784
Total Citations
2
Affiliations
Affiliations
Hefei University of TechnologyGent University-imec
Papers (109)
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion
AAAI 2024arXiv
1,423
citations
SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation
AAAI 2024arXiv
110
citations
A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint
CVPR 2024
52
citations
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
CVPR 2025
25
citations
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?
CVPR 2025
24
citations
SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection
AAAI 2024arXiv
24
citations
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AAAI 2024arXiv
24
citations
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
AAAI 2024arXiv
21
citations
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models
ICLR 2024
21
citations
Decomposing Semantic Shifts for Composed Image Retrieval
AAAI 2024arXiv
17
citations
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images
CVPR 2024
10
citations
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing
NeurIPS 2025
10
citations
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
NeurIPS 2025
9
citations
Probability Density Geodesics in Image Diffusion Latent Space
CVPR 2025
7
citations
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights
AAAI 2025
6
citations
Adversarial Exploitation of Data Diversity Improves Visual Localization
ICCV 2025
1
citations
MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection
AAAI 2025
0
citations
Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning
AAAI 2025
0
citations
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
AAAI 2024
0
citations
Data-Free Generalized Zero-Shot Learning
AAAI 2024arXiv
0
citations
Adversarial Purification with the Manifold Hypothesis
AAAI 2024
0
citations
Quantum-Inspired Neural Network with Runge-Kutta Method
AAAI 2024
0
citations
LaViP: Language-Grounded Visual Prompting
AAAI 2024
0
citations
ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
CVPR 2024
0
citations
SVGDreamer: Text Guided SVG Generation with Diffusion Model
CVPR 2024
0
citations
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
CVPR 2024
0
citations
OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning
ICML 2024
0
citations
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming
ICML 2024
0
citations
Joint Geometrical and Statistical Alignment for Visual Domain Adaptation
CVPR 2017arXiv
0
citations
Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior
CVPR 2017
0
citations
Importance Weighted Adversarial Nets for Partial Domain Adaptation
CVPR 2018arXiv
0
citations
Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective
CVPR 2018arXiv
0
citations
MirrorGAN: Learning Text-To-Image Generation by Redescription
CVPR 2019
0
citations
Few-Shot Learning via Saliency-Guided Hallucination of Samples
CVPR 2019
0
citations
ShieldNets: Defending Against Adversarial Attacks Using Probabilistic Adversarial Robustness
CVPR 2019
0
citations
UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders
CVPR 2020
0
citations
Deep Degradation Prior for Low-Quality Image Classification
CVPR 2020
0
citations
Weakly-Supervised Salient Object Detection via Scribble Annotations
CVPR 2020arXiv
0
citations
Simultaneously Localize, Segment and Rank the Camouflaged Objects
CVPR 2021arXiv
0
citations
Weakly Supervised Video Salient Object Detection
CVPR 2021arXiv
0
citations
Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection
CVPR 2021arXiv
0
citations
3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
CVPR 2022
0
citations
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
CVPR 2022arXiv
0
citations
GMFlow: Learning Optical Flow via Global Matching
CVPR 2022arXiv
0
citations
Recurrent Glimpse-Based Decoder for Detection With Transformer
CVPR 2022arXiv
0
citations
Learning Affordance Grounding From Exocentric Images
CVPR 2022arXiv
0
citations
ISNet: Shape Matters for Infrared Small Target Detection
CVPR 2022
0
citations
RU-Net: Regularized Unrolling Network for Scene Graph Generation
CVPR 2022
0
citations
FIBA: Frequency-Injection Based Backdoor Attack in Medical Image Analysis
CVPR 2022arXiv
0
citations
Dynamic Focus-Aware Positional Queries for Semantic Segmentation
CVPR 2023arXiv
0
citations
Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection
CVPR 2023arXiv
0
citations
Leverage Interactive Affinity for Affordance Learning
CVPR 2023
0
citations
Modeling the Distributional Uncertainty for Salient Object Detection Models
CVPR 2023
0
citations
CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose
CVPR 2023arXiv
0
citations
DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting
CVPR 2023arXiv
0
citations
Decoupling Learning and Remembering: A Bilevel Memory Framework With Knowledge Projection for Task-Incremental Learning
CVPR 2023
0
citations
Referring Image Matting
CVPR 2023arXiv
0
citations
Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition
ICCV 2019
0
citations
Out-of-Boundary View Synthesis Towards Full-Frame Video Stabilization
ICCV 2021arXiv
0
citations
RGB-D Saliency Detection via Cascaded Mutual Information Minimization
ICCV 2021
0
citations
LPFF: A Portrait Dataset for Face Generators Across Large Poses
ICCV 2023arXiv
0
citations
Domain Specified Optimization for Deployment Authorization
ICCV 2023
0
citations
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
ICCV 2023arXiv
0
citations
RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation
ICCV 2023
0
citations
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
ICCV 2023
0
citations
ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution
ICCV 2023arXiv
0
citations
Model Calibration in Dense Classification with Adaptive Label Perturbation
ICCV 2023arXiv
0
citations
Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection
ECCV 2020
0
citations
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
ECCV 2022
0
citations
Towards Data-Efficient Detection Transformers
ECCV 2022
0
citations
ReAct: Temporal Action Detection with Relational Queries
ECCV 2022
0
citations
FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs
ECCV 2022
0
citations
VSA: Learning Varied-Size Window Attention in Vision Transformers
ECCV 2022
0
citations
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation
ECCV 2022
0
citations
Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation
ECCV 2022
0
citations
RegionCL: Exploring Contrastive Region Pairs for Self-Supervised Representation Learning
ECCV 2022
0
citations
BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation
ECCV 2022
0
citations
Audio—Visual Segmentation
ECCV 2022
0
citations
"Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics"
ECCV 2022
0
citations
"JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes"
ECCV 2022
0
citations
P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
ICCV 2023arXiv
0
citations
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
CVPR 2025
0
citations
SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining
CVPR 2025
0
citations
Empowering LLMs to Understand and Generate Complex Vector Graphics
CVPR 2025
0
citations
Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection
CVPR 2025
0
citations
Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
CVPR 2025
0
citations
GARF: Learning Generalizable 3D Reassembly for Real-World Fractures
ICCV 2025
0
citations
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling
ICCV 2025
0
citations
Synergistic Prompting for Robust Visual Recognition with Missing Modalities
ICCV 2025
0
citations
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
ICCV 2025
0
citations
Rethink Sparse Signals for Pose-guided Text-to-image Generation
ICCV 2025
0
citations
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
ICCV 2025
0
citations
Patch-level Sounding Object Tracking for Audio-Visual Question Answering
AAAI 2025
0
citations
Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration
AAAI 2025
0
citations
UAWTrack: Universal 3D Single Object Tracking in Adverse Weather
AAAI 2025
0
citations
Semi-supervised Infrared Small Target Detection with Thermodynamic-Inspired Uneven Perturbation and Confidence Adaptation
AAAI 2025
0
citations
Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation
NeurIPS 2019
0
citations
Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge
NeurIPS 2019
0
citations
Auto Learning Attention
NeurIPS 2020
0
citations
Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction
NeurIPS 2021
0
citations
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias
NeurIPS 2021
0
citations
Watermarking for Out-of-distribution Detection
NeurIPS 2022
0
citations
Exploring Figure-Ground Assignment Mechanism in Perceptual Organization
NeurIPS 2022
0
citations
APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
NeurIPS 2022
0
citations
SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification
NeurIPS 2022
0
citations
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
NeurIPS 2022
0
citations
Constrained Policy Optimization with Explicit Behavior Density For Offline Reinforcement Learning
NeurIPS 2023
0
citations
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model
NeurIPS 2023
0
citations
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models
NeurIPS 2023
0
citations