Jinguo Zhu
9
Papers
34
Total Citations
Papers (9)
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
CVPR 2025
34
citations
V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
ICCV 2025
0
citations
Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
NeurIPS 2025
0
citations
Complementary Relation Contrastive Distillation
CVPR 2021arXiv
0
citations
Layerwise Optimization by Gradient Decomposition for Continual Learning
CVPR 2021arXiv
0
citations
Uni-Perceiver: Pre-Training Unified Architecture for Generic Perception for Zero-Shot and Few-Shot Tasks
CVPR 2022
0
citations
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
CVPR 2023arXiv
0
citations
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
NeurIPS 2022
0
citations
VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models
NeurIPS 2023
0
citations