Faeze Brahman
3
Papers
190
Total Citations
Papers (3)
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
ICLR 2025arXiv
142
citations
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
ICLR 2025arXiv
42
citations
PlaSma: Procedural Knowledge Models for Language-based Planning and Re-Planning
ICLR 2024
6
citations