Tongran Liu
3
Papers
0
Total Citations
Papers (3)
MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
NeurIPS 2025arXiv
0
citations
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
AAAI 2025
0
citations
ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
AAAI 2024
0
citations