Shuming Ma
6
Papers
1,128
Total Citations
Papers (6)
Grounding Multimodal Large Language Models to the World
ICLR 2024
1,032
citations
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning
NeurIPS 2025arXiv
96
citations
On the Representation Collapse of Sparse Mixture of Experts
NeurIPS 2022
0
citations
On the Pareto Front of Multilingual Neural Machine Translation
NeurIPS 2023
0
citations
Language Is Not All You Need: Aligning Perception with Language Models
NeurIPS 2023
0
citations
meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting
ICML 2017
0
citations