In value-based deep reinforcement learning, a pruned network is a good network

0citations

PDF

Citations

#10

in ICML 2024

of 2635 papers

Authors

Data Points

Authors

Johan Obando Ceron Aaron Courville Pablo Samuel Castro

Topics

deep reinforcement learning value-based methods sparse training network pruning parameter efficiency gradual magnitude pruning

Abstract

Recent work has shown that deep reinforcement learning agents have difficulty in effectively using their network parameters. We leverage prior insights into the advantages of sparse training techniques and demonstrate that gradual magnitude pruning enables value-based agents to maximize parameter effectiveness. This results in networks that yield dramatic performance improvements over traditional networks, using only a small fraction of the full network parameters. Our code is publicly available, see Appendix A for details.

Citation History

Jan 28, 2026