2024 Poster "inference efficiency" Papers
4 papers found
APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference
Bowen Zhao, Hannaneh Hajishirzi, Qingqing Cao
ICML 2024poster
Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective
Cheng Tan, Zhangyang Gao, Hanqun CAO et al.
ICML 2024poster
Efficient Denoising Diffusion via Probabilistic Masking
Weizhong Zhang, Zhiwei Zhang, Renjie Pi et al.
ICML 2024poster
Tandem Transformers for Inference Efficient LLMs
Aishwarya P S, Pranav Nair, Yashas Samaga et al.
ICML 2024poster