Poster "pipeline parallelism" Papers
4 papers found
DynaPipe: Dynamic Layer Redistribution for Efficient Serving of LLMs with Pipeline Parallelism
HongXin Xu, Tianyu Guo, Xianwei Zhang
NeurIPS 2025poster
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
Yanxi Chen, Xuchen Pan, Yaliang Li et al.
ICML 2024poster
HexGen: Generative Inference of Large Language Model over Heterogeneous Environment
Youhe Jiang, Ran Yan, Xiaozhe Yao et al.
ICML 2024poster
Position: Exploring the Robustness of Pipeline-Parallelism-Based Decentralized Training
Lin Lu, Chenxi Dai, Wangcheng Tao et al.
ICML 2024poster