by XUCHEN Papers
4 papers found
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods
Dawei Yang, Yuxuan Yue, Xing Hu et al.
ICLR 2025poster
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
Zhixuan Chen, Xing Hu, Dawei Yang et al.
ICML 2025poster
8
citations
OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting
Xing Hu, Yuan Cheng, Dawei Yang et al.
ICLR 2025poster
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
XUCHEN, Yuxuan Yue, Zukang Xu et al.
ICML 2025poster