Shijie Geng
6
Papers
8
Total Citations
Papers (6)
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025
8
citations
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
ICML 2024
0
citations
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens
CVPR 2023arXiv
0
citations
Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning
ECCV 2022
0
citations
COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality
ECCV 2022
0
citations
Frozen CLIP Models Are Efficient Video Learners
ECCV 2022
0
citations