Yang Yu
20
Papers
68
Total Citations
Papers (20)
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
AAAI 2024arXiv
25
citations
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
AAAI 2024arXiv
14
citations
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
ICLR 2024
14
citations
Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching
ICLR 2025
5
citations
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention
AAAI 2025
4
citations
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward
AAAI 2024arXiv
3
citations
GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs
AAAI 2025
2
citations
LLM Data Selection and Utilization via Dynamic Bi-level Optimization
ICML 2025
1
citations
Causality Based Front-door Defense Against Backdoor Attack on Language Models
ICML 2024
0
citations
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
ICML 2024
0
citations
Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration
ICML 2024
0
citations
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
CVPR 2025
0
citations
Policy-conditioned Environment Models are More Generalizable
ICML 2024
0
citations
GuideNER: Annotation Guidelines Are Better than Examples for In-Context Named Entity Recognition
AAAI 2025
0
citations
Unmixing Before Fusion: A Generalized Paradigm for Multi-Source-based Hyperspectral Image Synthesis
CVPR 2024
0
citations
Learning to Reuse Policies in State Evolvable Environments
ICML 2025
0
citations
Limited Preference Aided Imitation Learning from Imperfect Demonstrations
ICML 2024
0
citations
Offline Transition Modeling via Contrastive Energy Learning
ICML 2024
0
citations
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
ICML 2024
0
citations
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
ICML 2024
0
citations