Shikun Zhang

6

Papers

133

Total Citations

Papers (6)

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training

Labels Need Prompts Too: Mask Matching for Natural Language Understanding Tasks

All-Optical Nonlinear Diffractive Deep Network for Ultrafast Image Denoising

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models