Zheng-Jun Zha

101
Papers
115
Total Citations

Papers (101)

Revisiting Single Image Reflection Removal In the Wild

CVPR 2024
37
citations

QMambaBSR: Burst Image Super-Resolution with Query State Space Model

CVPR 2025
19
citations

Improved Video VAE for Latent Video Diffusion Model

CVPR 2025arXiv
19
citations

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

CVPR 2025
18
citations

MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling

CVPR 2025
13
citations

PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement

NeurIPS 2025
9
citations

EVDM: Event-based Real-world Video Deblurring with Mamba

ICCV 2025
0
citations

Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion

ICCV 2025
0
citations

HERO: Human Reaction Generation from Videos

ICCV 2025
0
citations

MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking

ICCV 2025
0
citations

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

ICCV 2025
0
citations

Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions

ICCV 2025
0
citations

EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction

AAAI 2025
0
citations

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

AAAI 2025
0
citations

Boosting Image De-Raining via Central-Surrounding Synergistic Convolution

AAAI 2025
0
citations

DCTMamba: Advancing JPEG Image Restoration Through Long-Sequence Modeling and Adaptive Frequency Strategy

AAAI 2025
0
citations

HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection

AAAI 2025
0
citations

A Lottery Ticket Hypothesis Approach with Sparse Fine-tuning and MAE for Image Forgery Detection and Localization

AAAI 2025
0
citations

Fusion-Vital: Video-RF Fusion Transformer for Advanced Remote Physiological Measurement

AAAI 2024
0
citations

780 Learning Discriminative Noise Guidance for Image Forgery Detection and Localization

AAAI 2024
0
citations

HomoFormer: Homogenized Transformer for Image Shadow Removal

CVPR 2024
0
citations

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images

CVPR 2024
0
citations

Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

CVPR 2024
0
citations

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models

ICML 2024
0
citations

Comparative Deep Learning of Hybrid Representations for Image Recommendations

CVPR 2016
0
citations

MiCT: Mixed 3D/2D Convolutional Tube for Human Action Recognition

CVPR 2018
0
citations

Camera Lens Super-Resolution

CVPR 2019
0
citations

Context-Reinforced Semantic Segmentation

CVPR 2019
0
citations

Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-Grained Image Recognition

CVPR 2019
0
citations

Adaptive Transfer Network for Cross-Domain Person Re-Identification

CVPR 2019
0
citations

State-Relabeling Adversarial Active Learning

CVPR 2020arXiv
0
citations

Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification

CVPR 2020arXiv
0
citations

Deep Structure-Revealed Network for Texture Recognition

CVPR 2020
0
citations

ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection

CVPR 2020arXiv
0
citations

Deep Degradation Prior for Low-Quality Image Classification

CVPR 2020
0
citations

Real-World Person Re-Identification via Degradation Invariance Learning

CVPR 2020arXiv
0
citations

Domain-Aware Visual Bias Eliminating for Generalized Zero-Shot Learning

CVPR 2020arXiv
0
citations

Iterative Context-Aware Graph Inference for Visual Dialog

CVPR 2020arXiv
0
citations

Spatiotemporal Fusion in 3D CNNs: A Probabilistic View

CVPR 2020arXiv
0
citations

Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning

CVPR 2020
0
citations

Object Relational Graph With Teacher-Recommended Learning for Video Captioning

CVPR 2020arXiv
0
citations

Image De-Raining via Continual Learning

CVPR 2021
0
citations

Structured Multi-Level Interaction Network for Video Moment Localization via Language Query

CVPR 2021
0
citations

Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning

CVPR 2021
0
citations

Light Field Super-Resolution With Zero-Shot Learning

CVPR 2021
0
citations

Group-aware Label Transfer for Domain Adaptive Person Re-identification

CVPR 2021arXiv
0
citations

Rethinking Graph Neural Architecture Search From Message-Passing

CVPR 2021arXiv
0
citations

Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos

CVPR 2021arXiv
0
citations

Weakly Supervised High-Fidelity Clothing Model Generation

CVPR 2022arXiv
0
citations

Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment

CVPR 2022arXiv
0
citations

Lifelong Unsupervised Domain Adaptive Person Re-Identification With Coordinated Anti-Forgetting and Adaptation

CVPR 2022arXiv
0
citations

Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning

CVPR 2022arXiv
0
citations

Automatic Relation-Aware Graph Network Proliferation

CVPR 2022arXiv
0
citations

Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading

CVPR 2022
0
citations

Bijective Mapping Network for Shadow Removal

CVPR 2022
0
citations

Degradation-Agnostic Correspondence From Resolution-Asymmetric Stereo

CVPR 2022arXiv
0
citations

EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

CVPR 2022arXiv
0
citations

Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification

CVPR 2022
0
citations

Decoupling-and-Aggregating for Image Exposure Correction

CVPR 2023
0
citations

Edge-Aware Regional Message Passing Controller for Image Forgery Localization

CVPR 2023
0
citations

Neural Dependencies Emerging From Learning Massive Categories

CVPR 2023arXiv
0
citations

Learning To Dub Movies via Hierarchical Prosody Models

CVPR 2023arXiv
0
citations

Generalized UAV Object Detection via Frequency Domain Disentanglement

CVPR 2023
0
citations

Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning

CVPR 2023
0
citations

Streaming Video Model

CVPR 2023arXiv
0
citations

JPEG Artifacts Reduction via Deep Convolutional Sparse Coding

ICCV 2019
0
citations

Making History Matter: History-Advantage Sequence Training for Visual Dialog

ICCV 2019
0
citations

Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding

ICCV 2019
0
citations

Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition

ICCV 2019
0
citations

Learning to Assemble Neural Module Tree Networks for Visual Grounding

ICCV 2019
0
citations

Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning

CVPR 2025
0
citations

Learning Dual Priors for JPEG Compression Artifacts Removal

ICCV 2021
0
citations

Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment

ICCV 2021arXiv
0
citations

Attack-Guided Perceptual Data Generation for Real-World Re-Identification

ICCV 2021
0
citations

Self-Supervised Visual Representations Learning by Contrastive Mask Prediction

ICCV 2021arXiv
0
citations

Cross-Patch Graph Convolutional Network for Image Denoising

ICCV 2021
0
citations

Self-supervised Cross-view Representation Reconstruction for Change Captioning

ICCV 2023
0
citations

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-Trained Vision-Language Models

ICCV 2023arXiv
0
citations

Self-Organizing Pathway Expansion for Non-Exemplar Class-Incremental Learning

ICCV 2023
0
citations

Adaptive Frequency Filters As Efficient Global Token Mixers

ICCV 2023arXiv
0
citations

Text-Driven Generative Domain Adaptation with Spectral Consistency Regularization

ICCV 2023
0
citations

Spatial-Aware Token for Weakly Supervised Object Localization

ICCV 2023arXiv
0
citations

Grounding 3D Object Affordance from 2D Interactions in Images

ICCV 2023arXiv
0
citations

Learning Cross-Representation Affinity Consistency for Sparsely Supervised Biomedical Instance Segmentation

ICCV 2023
0
citations

S2N: Suppression-Strengthen Network for Event-Based Recognition under Variant Illuminations

ECCV 2022
0
citations

JPEG Artifacts Removal via Contrastive Representation Learning

ECCV 2022
0
citations

Improving De-Raining Generalization via Neural Reorganization

ICCV 2021
0
citations

UHD-processer: Unified UHD Image Restoration with Progressive Frequency Learning and Degradation-aware Prompts

CVPR 2025
0
citations

Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation

CVPR 2025
0
citations

WeGen: A Unified Model for Interactive Multimodal Generation as We Chat

CVPR 2025
0
citations

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

CVPR 2025
0
citations

SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets

ICCV 2025
0
citations

Learning Deep Bilinear Transformation for Fine-grained Image Representation

NeurIPS 2019
0
citations

Abstract Reasoning with Distracting Features

NeurIPS 2019
0
citations

Hierarchical Granularity Transfer Learning

NeurIPS 2020
0
citations

Learning Semantic-aware Normalization for Generative Adversarial Networks

NeurIPS 2020
0
citations

Low-Rank Subspaces in GANs

NeurIPS 2021
0
citations

Stochastic Window Transformer for Image Restoration

NeurIPS 2022
0
citations

Exploring Figure-Ground Assignment Mechanism in Perceptual Organization

NeurIPS 2022
0
citations

Rank Diminishing in Deep Neural Networks

NeurIPS 2022
0
citations

DreamWaltz: Make a Scene with Complex 3D Animatable Avatars

NeurIPS 2023
0
citations