"gradient vanishing" Papers
3 papers found
Autaptic Synaptic Circuit Enhances Spatio-temporal Predictive Learning of Spiking Neural Networks
Lihao Wang, Zhaofei Yu
ICML 2024oral
Bi-ViT: Pushing the Limit of Vision Transformer Quantization
Yanjing Li, Sheng Xu, Mingbao Lin et al.
AAAI 2024paperarXiv:2305.12354
DiNADO: Norm-Disentangled Neurally-Decomposed Oracles for Controlling Language Models
Sidi Lu, Wenbo Zhao, Chenyang Tao et al.
ICML 2024poster