Zhihang Yuan
10
Papers
132
Total Citations
Papers (10)
PB-LLM: Partially Binarized Large Language Models
ICLR 2024
80
citations
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
CVPR 2025
24
citations
DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
ICCV 2025
10
citations
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
NeurIPS 2025
9
citations
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
ICML 2025
8
citations
DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate
ICCV 2025
1
citations
Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding
ICML 2024
0
citations
EA-Vit: Efficient Adaptation for Elastic Vision Transformer
ICCV 2025
0
citations
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
ICCV 2025
0
citations
PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram
CVPR 2025
0
citations