Poster "inference speedup" Papers
2 papers found
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs
Zijia Zhao, Longteng Guo, Jie Cheng et al.
ICLR 2025posterarXiv:2410.10456
8
citations
Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Dingyuan Zhang, Dingkang Liang, Zichang Tan et al.
ECCV 2024posterarXiv:2409.00633
4
citations