Huaimin Wang
5
Papers
21
Total Citations
Papers (5)
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models
AAAI 2025
21
citations
Knowledge Memorization and Rumination for Pre-trained Model-based Class-Incremental Learning
CVPR 2025
0
citations
Maintaining Fairness in Logit-based Knowledge Distillation for Class-Incremental Learning
AAAI 2025
0
citations
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
AAAI 2024arXiv
0
citations
Iterative Regularized Policy Optimization with Imperfect Demonstrations
ICML 2024
0
citations