ICLR Poster "benchmark development" Papers
2 papers found
Do as We Do, Not as You Think: the Conformity of Large Language Models
Zhiyuan Weng, Guikun Chen, Wenguan Wang
ICLR 2025posterarXiv:2501.13381
18
citations
How efficient is LLM-generated code? A rigorous & high-standard benchmark
Ruizhong Qiu, Weiliang Zeng, James Ezick et al.
ICLR 2025posterarXiv:2406.06647
43
citations