Zhihang Yuan
16
Papers
132
Total Citations
Papers (16)
PB-LLM: Partially Binarized Large Language Models
ICLR 2024
80
citations
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
CVPR 2025
24
citations
DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
ICCV 2025
10
citations
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
NeurIPS 2025
9
citations
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
ICML 2025
8
citations
DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate
ICCV 2025
1
citations
S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search
ECCV 2020
0
citations
PTQ4ViT: Post-Training Quantization for Vision Transformers with Twin Uniform Quantization
ECCV 2022
0
citations
PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram
CVPR 2025
0
citations
EA-Vit: Efficient Adaptation for Elastic Vision Transformer
ICCV 2025
0
citations
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
ICCV 2025
0
citations
Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding
ICML 2024
0
citations
Post-Training Quantization on Diffusion Models
CVPR 2023arXiv
0
citations
PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
CVPR 2023
0
citations
Latency-aware Spatial-wise Dynamic Networks
NeurIPS 2022
0
citations
MIM4DD: Mutual Information Maximization for Dataset Distillation
NeurIPS 2023
0
citations