Wei-Lin Chiang
7
Papers
555
Total Citations
Papers (7)
From Crowdsourced Data to High-quality Benchmarks: Arena-Hard and Benchbuilder Pipeline
ICML 2025
329
citations
OR-Bench: An Over-Refusal Benchmark for Large Language Models
ICML 2025
97
citations
How to Evaluate Reward Models for RLHF
ICLR 2025
50
citations
LLM-Assisted Code Cleaning For Training Accurate Code Generators
ICLR 2024
43
citations
RouteLLM: Learning to Route LLMs from Preference Data
ICLR 2025
24
citations
VisionArena: 230k Real World User-VLM Conversations with Preference Labels
CVPR 2025
12
citations
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
ICML 2024
0
citations