Poster "sub-quadratic complexity" Papers
2 papers found
A Training-Free Sub-quadratic Cost Transformer Model Serving Framework with Hierarchically Pruned Attention
Heejun Lee, Geon Park, Youngwan Lee et al.
ICLR 2025posterarXiv:2406.09827
8
citations
Fast attention mechanisms: a tale of parallelism
Jingwen Liu, Hantao Yu, Clayton Sanford et al.
NeurIPS 2025posterarXiv:2509.09001