Banggu Wu
4
Papers
46
Total Citations
Papers (4)
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
ICML 2025
26
citations
Hyper-Connections
ICLR 2025
20
citations
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks
CVPR 2020
0
citations
What Deep CNNs Benefit From Global Covariance Pooling: An Optimization Perspective
CVPR 2020arXiv
0
citations