Zheng-Jun Zha

30
Papers
115
Total Citations

Papers (30)

Revisiting Single Image Reflection Removal In the Wild

CVPR 2024
37
citations

Improved Video VAE for Latent Video Diffusion Model

CVPR 2025arXiv
19
citations

QMambaBSR: Burst Image Super-Resolution with Query State Space Model

CVPR 2025
19
citations

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

CVPR 2025
18
citations

MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling

CVPR 2025
13
citations

PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement

NeurIPS 2025
9
citations

EVDM: Event-based Real-world Video Deblurring with Mamba

ICCV 2025
0
citations

Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion

ICCV 2025
0
citations

HERO: Human Reaction Generation from Videos

ICCV 2025
0
citations

MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking

ICCV 2025
0
citations

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

ICCV 2025
0
citations

Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions

ICCV 2025
0
citations

EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction

AAAI 2025
0
citations

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

AAAI 2025
0
citations

Boosting Image De-Raining via Central-Surrounding Synergistic Convolution

AAAI 2025
0
citations

DCTMamba: Advancing JPEG Image Restoration Through Long-Sequence Modeling and Adaptive Frequency Strategy

AAAI 2025
0
citations

HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection

AAAI 2025
0
citations

A Lottery Ticket Hypothesis Approach with Sparse Fine-tuning and MAE for Image Forgery Detection and Localization

AAAI 2025
0
citations

Fusion-Vital: Video-RF Fusion Transformer for Advanced Remote Physiological Measurement

AAAI 2024
0
citations

780 Learning Discriminative Noise Guidance for Image Forgery Detection and Localization

AAAI 2024
0
citations

HomoFormer: Homogenized Transformer for Image Shadow Removal

CVPR 2024
0
citations

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images

CVPR 2024
0
citations

Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

CVPR 2024
0
citations

Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning

CVPR 2025
0
citations

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models

ICML 2024
0
citations

UHD-processer: Unified UHD Image Restoration with Progressive Frequency Learning and Degradation-aware Prompts

CVPR 2025
0
citations

Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation

CVPR 2025
0
citations

WeGen: A Unified Model for Interactive Multimodal Generation as We Chat

CVPR 2025
0
citations

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

CVPR 2025
0
citations

SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets

ICCV 2025
0
citations