Poster "llm benchmarking" Papers
2 papers found
ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments
Hojae Han, seung-won hwang, Rajhans Samdani et al.
ICLR 2025posterarXiv:2502.19852
12
citations
DataGen: Unified Synthetic Dataset Generation via Large Language Models
Yue Huang, Siyuan Wu, Chujie Gao et al.
ICLR 2025posterarXiv:2406.18966
21
citations