Xuan Shen
10
Papers
30
Total Citations
Papers (10)
Numerical Pruning for Efficient Autoregressive Models
AAAI 2025
22
citations
Sparse Learning for State Space Models on Mobile
ICLR 2025
8
citations
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
AAAI 2025
0
citations
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge
AAAI 2024
0
citations
NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
CVPR 2021arXiv
0
citations
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
CVPR 2023arXiv
0
citations
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
CVPR 2025
0
citations
Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment
AAAI 2025
0
citations
SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning
ECCV 2022
0
citations
Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot?
NeurIPS 2021
0
citations