Greg Durrett
5
Papers
385
Total Citations
1
Affiliations
Affiliations
The University of Texas at Austin
Papers (5)
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
ICLR 2025arXiv
239
citations
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning
ICLR 2024
131
citations
CLEVER: A Curated Benchmark for Formally Verified Code Generation
NeurIPS 2025arXiv
10
citations
Sparta Alignment: Collectively Aligning Multiple Language Models through Combat
NeurIPS 2025arXiv
3
citations
AstroVisBench: A Code Benchmark for Scientific Computing and Visualization in Astronomy
NeurIPS 2025arXiv
2
citations