Bo An
33
Papers
197
Total Citations
Papers (33)
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
ICLR 2024
103
citations
Cradle: Empowering Foundation Agents towards General Computer Control
ICML 2025
67
citations
EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading
AAAI 2024arXiv
24
citations
Representation Surgery in Model Merging with Probabilistic Modeling
ICML 2025
2
citations
OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit
CVPR 2025
1
citations
Improving Unsupervised Hierarchical Representation with Reinforcement Learning
CVPR 2024
0
citations
Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree
ICML 2024
0
citations
Safe and Robust Subgame Exploitation in Imperfect Information Games
ICML 2024
0
citations
Configurable Mirror Descent: Towards a Unification of Decision Making
ICML 2024
0
citations
Latent Logic Tree Extraction for Event Sequence Explanation from LLMs
ICML 2024
0
citations
DAG-Based Column Generation for Adversarial Team Games
ICML 2024
0
citations
Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization
CVPR 2020arXiv
0
citations
DO-GAN: A Double Oracle Framework for Generative Adversarial Networks
CVPR 2022
0
citations
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
NeurIPS 2025
0
citations
Influence-Based Fair Selection for Sample-Discriminative Backdoor Attack
AAAI 2025
0
citations
Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context
AAAI 2024
0
citations
Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games
AAAI 2024
0
citations
Manipulating a Learning Defender and Ways to Counteract
NeurIPS 2019
0
citations
Provably Consistent Partial-Label Learning
NeurIPS 2020
0
citations
Open-set Label Noise Can Improve Robustness Against Inherent Label Noise
NeurIPS 2021
0
citations
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
NeurIPS 2021
0
citations
Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses
NeurIPS 2022
0
citations
Out-of-Distribution Detection with An Adaptive Likelihood Ratio on Informative Hierarchical VAE
NeurIPS 2022
0
citations
Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient
NeurIPS 2022
0
citations
Deep Attentive Belief Propagation: Integrating Reasoning and Learning for Solving Constraint Optimization Problems
NeurIPS 2022
0
citations
Few-shot Generation via Recalling Brain-Inspired Episodic-Semantic Memory
NeurIPS 2023
0
citations
Computing Optimal Nash Equilibria in Multiplayer Games
NeurIPS 2023
0
citations
On the Importance of Feature Separability in Predicting Out-Of-Distribution Error
NeurIPS 2023
0
citations
State Regularized Policy Optimization on Data with Dynamics Shift
NeurIPS 2023
0
citations
In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer
NeurIPS 2023
0
citations
Regression with Cost-based Rejection
NeurIPS 2023
0
citations
TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning
NeurIPS 2023
0
citations
Offline RL with Discrete Proxy Representations for Generalizability in POMDPs
NeurIPS 2023
0
citations