Poster "inference latency reduction" Papers
3 papers found
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
ICLR 2025posterarXiv:2502.00745
3
citations
Block-Attention for Efficient Prefilling
Dongyang Ma, Yan Wang, Tian Lan
ICLR 2025posterarXiv:2409.15355
14
citations
3D Human Pose Estimation via Non-Causal Retentive Networks
Kaili Zheng, Feixiang Lu, Yihao Lv et al.
ECCV 2024poster