Poster by Derek Wong Papers
4 papers found
Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Runzhe Zhan, Zhihong Huang, Xinyi Yang et al.
NeurIPS 2025poster
1
citations
DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Yutong Wang, Jiali Zeng, Xuebo Liu et al.
ICLR 2025poster
20
citations
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Zihao Zhou, Shudong Liu, Maizhen Ning et al.
ICLR 2025poster
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Yiming Wang, Pei Zhang, Baosong Yang et al.
ICLR 2025poster
32
citations