Weinan Zhang

27
Papers
365
Total Citations

Papers (27)

Vision-Language Foundation Models as Effective Robot Imitators

ICLR 2024
310
citations

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

NeurIPS 2025arXiv
36
citations

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation

AAAI 2025
5
citations

Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPs

AAAI 2025
4
citations

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport

ICML 2025
4
citations

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

NeurIPS 2025
3
citations

Information-Theoretic Reward Decomposition for Generalizable RLHF

NeurIPS 2025
3
citations

ContraDiff: Planning Towards High Return States via Contrastive Learning

ICLR 2025
0
citations

AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training

ICML 2024
0
citations

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching

ICML 2024
0
citations

Bootstrapped Transformer for Offline Reinforcement Learning

NeurIPS 2022
0
citations

PerfectDou: Dominating DouDizhu with Perfect Information Distillation

NeurIPS 2022
0
citations

Lending Interaction Wings to Recommender Systems with Conversational Agents

NeurIPS 2023
0
citations

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

NeurIPS 2023
0
citations

Path-Level Network Transformation for Efficient Architecture Search

ICML 2018
0
citations

Mean Field Multi-Agent Reinforcement Learning

ICML 2018
0
citations

CoT: Cooperative Training for Generative Modeling of Discrete Data

ICML 2019
0
citations

Lipschitz Generative Adversarial Nets

ICML 2019
0
citations

Model-based Policy Optimization with Unsupervised Model Adaptation

NeurIPS 2020
0
citations

Efficient Projection-free Algorithms for Saddle Point Problems

NeurIPS 2020
0
citations

On Effective Scheduling of Model-based Reinforcement Learning

NeurIPS 2021
0
citations

Curriculum Offline Imitating Learning

NeurIPS 2021
0
citations

Reinforcement Learning with Automated Auxiliary Loss Search

NeurIPS 2022
0
citations

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning

NeurIPS 2022
0
citations

Learning Enhanced Representation for Tabular Data via Neighborhood Propagation

NeurIPS 2022
0
citations

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

NeurIPS 2022
0
citations

NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning

NeurIPS 2022
0
citations