Rio Yokota
4
Papers
9
Total Citations
Papers (4)
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
ICLR 2025
8
citations
Variational Learning Finds Flatter Solutions at the Edge of Stability
NeurIPS 2025arXiv
1
citations
Local Loss Optimization in the Infinite Width: Stable Parameterization of Predictive Coding Networks and Target Propagation
ICLR 2025
0
citations
Variational Learning is Effective for Large Deep Networks
ICML 2024
0
citations