Longtian Qiu
4
Papers
8
Total Citations
Papers (4)
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025
8
citations
NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation
NeurIPS 2025arXiv
0
citations
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
AAAI 2024
0
citations
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
ICML 2024
0
citations