Shi Feng

4

Papers

78

Total Citations

Papers (4)

Language Models Learn to Mislead Humans via RLHF

Predicting Empirical AI Research Outcomes with Language Models

Peer Prediction for Learning Agents

Understanding Impacts of High-Order Loss Approximations and Features in Deep Learning Interpretation