Ming Yan
10
Papers
784
Total Citations
Papers (10)
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
CVPR 2024
601
citations
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
CVPR 2024
116
citations
WritingBench: A Comprehensive Benchmark for Generative Writing
NeurIPS 2025arXiv
41
citations
RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
CVPR 2024
9
citations
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
CVPR 2025
7
citations
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training
AAAI 2024arXiv
6
citations
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
CVPR 2025
4
citations
DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels
AAAI 2024
0
citations
RoDA: Robust Domain Alignment for Cross-Domain Retrieval Against Label Noise
AAAI 2025
0
citations
ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate
CVPR 2025arXiv
0
citations