Shikun Zhang
6
Papers
133
Total Citations
Papers (6)
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
CVPR 2024
116
citations
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
CVPR 2025
7
citations
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training
AAAI 2024arXiv
6
citations
Labels Need Prompts Too: Mask Matching for Natural Language Understanding Tasks
AAAI 2024arXiv
4
citations
All-Optical Nonlinear Diffractive Deep Network for Ultrafast Image Denoising
CVPR 2025
0
citations
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
ICML 2024
0
citations