Yao Shu
4
Papers
7
Total Citations
1
Affiliations
Affiliations
Hong Kong University of Science and Technology (Guangzhou)
Papers (4)
ReDit: Reward Dithering for Improved LLM Policy Optimization
NeurIPS 2025
4
citations
Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models
ICML 2025
3
citations
FSL-Rectifier: Rectify Outliers in Few-Shot Learning via Test-Time Augmentation
AAAI 2025
0
citations
Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers
ICML 2024
0
citations