Zheng-Jun Zha
30
Papers
115
Total Citations
Papers (30)
Revisiting Single Image Reflection Removal In the Wild
CVPR 2024
37
citations
Improved Video VAE for Latent Video Diffusion Model
CVPR 2025arXiv
19
citations
QMambaBSR: Burst Image Super-Resolution with Query State Space Model
CVPR 2025
19
citations
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
CVPR 2025
18
citations
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling
CVPR 2025
13
citations
PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement
NeurIPS 2025
9
citations
EVDM: Event-based Real-world Video Deblurring with Mamba
ICCV 2025
0
citations
Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion
ICCV 2025
0
citations
HERO: Human Reaction Generation from Videos
ICCV 2025
0
citations
MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking
ICCV 2025
0
citations
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
ICCV 2025
0
citations
Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions
ICCV 2025
0
citations
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction
AAAI 2025
0
citations
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation
AAAI 2025
0
citations
Boosting Image De-Raining via Central-Surrounding Synergistic Convolution
AAAI 2025
0
citations
DCTMamba: Advancing JPEG Image Restoration Through Long-Sequence Modeling and Adaptive Frequency Strategy
AAAI 2025
0
citations
HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection
AAAI 2025
0
citations
A Lottery Ticket Hypothesis Approach with Sparse Fine-tuning and MAE for Image Forgery Detection and Localization
AAAI 2025
0
citations
Fusion-Vital: Video-RF Fusion Transformer for Advanced Remote Physiological Measurement
AAAI 2024
0
citations
780 Learning Discriminative Noise Guidance for Image Forgery Detection and Localization
AAAI 2024
0
citations
HomoFormer: Homogenized Transformer for Image Shadow Removal
CVPR 2024
0
citations
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
CVPR 2024
0
citations
Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
CVPR 2024
0
citations
Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning
CVPR 2025
0
citations
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models
ICML 2024
0
citations
UHD-processer: Unified UHD Image Restoration with Progressive Frequency Learning and Degradation-aware Prompts
CVPR 2025
0
citations
Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation
CVPR 2025
0
citations
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
CVPR 2025
0
citations
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
CVPR 2025
0
citations
SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets
ICCV 2025
0
citations