Ryo Karakida
4
Papers
1
Total Citations
Papers (4)
Infinite-Width Limit of a Single Attention Layer: Analysis via Tensor Programs
NeurIPS 2025arXiv
1
citations
Local Loss Optimization in the Infinite Width: Stable Parameterization of Predictive Coding Networks and Target Propagation
ICLR 2025arXiv
0
citations
Understanding MLP-Mixer as a wide and sparse MLP
ICML 2024
0
citations
Self-attention Networks Localize When QK-eigenspectrum Concentrates
ICML 2024
0
citations