CVPR 2025 "transformer architecture" Papers
4 papers found
ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices
Hao Yu, Tangyu Jiang, Shuning Jia et al.
CVPR 2025posterarXiv:2506.03737
3
citations
Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels
Pierre Vuillecard, Jean-marc Odobez
CVPR 2025posterarXiv:2502.20249
7
citations
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag, Xianghao Kong, Jingtao Li et al.
CVPR 2025posterarXiv:2407.15811
26
citations
Transformers without Normalization
Jiachen Zhu, Xinlei Chen, Kaiming He et al.
CVPR 2025posterarXiv:2503.10622
96
citations