Pierre Ablin
13
Papers
62
Total Citations
Papers (13)
Theory, Analysis, and Best Practices for Sigmoid Self-Attention
ICLR 2025arXiv
34
citations
The AdEMAMix Optimizer: Better, Faster, Older
ICLR 2025arXiv
23
citations
Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency
ICML 2025arXiv
5
citations
Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging
ICML 2025arXiv
0
citations
Careful with that Scalpel: Improving Gradient Surgery with an EMA
ICML 2024arXiv
0
citations
How Smooth Is Attention?
ICML 2024arXiv
0
citations
Optimization without Retraction on the Random Generalized Stiefel Manifold
ICML 2024arXiv
0
citations
Modeling Shared responses in Neuroimaging Studies through MultiView ICA
NeurIPS 2020arXiv
0
citations
Shared Independent Component Analysis for Multi-Subject Neuroimaging
NeurIPS 2021arXiv
0
citations
Benchopt: Reproducible, efficient and collaborative optimization benchmarks
NeurIPS 2022arXiv
0
citations
A framework for bilevel optimization that enables stochastic and global variance reduction algorithms
NeurIPS 2022arXiv
0
citations
Do Residual Neural Networks discretize Neural Ordinary Differential Equations?
NeurIPS 2022arXiv
0
citations
How to Scale Your EMA
NeurIPS 2023arXiv
0
citations