Zhiliang Peng
6
Papers
1,032
Total Citations
Papers (6)
Grounding Multimodal Large Language Models to the World
ICLR 2024
1,032
citations
Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks
CVPR 2023
0
citations
Generic-to-Specific Distillation of Masked Autoencoders
CVPR 2023arXiv
0
citations
Conformer: Local Features Coupling Global Representations for Visual Recognition
ICCV 2021arXiv
0
citations
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization
ICCV 2021
0
citations
Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
ICCV 2023arXiv
0
citations