by Simon Du Toit Papers
3 papers found
Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies
Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Ruan John de Kock et al.
NeurIPS 2025oral
Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL
Juan Formanek, Omayma Mahjoub, Louay Nessir et al.
NeurIPS 2025oral
Sable: a Performant, Efficient and Scalable Sequence Model for MARL
Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock et al.
ICML 2025oral
4
citations