Xiaohui Shen
8
Papers
143
Total Citations
Papers (8)
MaskBit: Embedding-free Image Generation via Bit Tokens
ICLR 2025
72
citations
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation
ICCV 2025arXiv
49
citations
COCONut: Modernizing COCO Segmentation
CVPR 2024
22
citations
Randomized Autoregressive Visual Generation
ICCV 2025
0
citations
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
ICCV 2025
0
citations
MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval
CVPR 2024
0
citations
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
CVPR 2024
0
citations
D-Attn: Decomposed Attention for Large Vision-and-Language Model
ICCV 2025
0
citations