NEURIPS 2025 "clustering dynamics" Papers
2 papers found
A multiscale analysis of mean-field transformers in the moderate interaction regime
Giuseppe Bruno, Federico Pasqualotto, Andrea Agazzi
NEURIPS 2025oralarXiv:2509.25040
7
citations
Normalization in Attention Dynamics
Nikita Karagodin, Shu Ge, Yury Polyanskiy et al.
NEURIPS 2025posterarXiv:2510.22026
2
citations