Sayash Kapoor
5
Papers
19
Total Citations
Papers (5)
Establishing Best Practices in Building Rigorous Agentic Benchmarks
NeurIPS 2025
12
citations
Position: Build Agent Advocates, Not Platform Agents
ICML 2025
5
citations
Position: In-House Evaluation Is Not Enough. Towards Robust Third-Party Evaluation and Flaw Disclosure for General-Purpose AI
ICML 2025
2
citations
Position: A Safe Harbor for AI Evaluation and Red Teaming
ICML 2024
0
citations
Position: On the Societal Impact of Open Foundation Models
ICML 2024
0
citations