Zhihang Yuan

16
Papers
132
Total Citations

Papers (16)

PB-LLM: Partially Binarized Large Language Models

ICLR 2024
80
citations

A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

CVPR 2025
24
citations

DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers

ICCV 2025
10
citations

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

NeurIPS 2025
9
citations

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

ICML 2025
8
citations

DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate

ICCV 2025
1
citations

S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search

ECCV 2020
0
citations

PTQ4ViT: Post-Training Quantization for Vision Transformers with Twin Uniform Quantization

ECCV 2022
0
citations

PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram

CVPR 2025
0
citations

EA-Vit: Efficient Adaptation for Elastic Vision Transformer

ICCV 2025
0
citations

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

ICCV 2025
0
citations

Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding

ICML 2024
0
citations

Post-Training Quantization on Diffusion Models

CVPR 2023arXiv
0
citations

PD-Quant: Post-Training Quantization Based on Prediction Difference Metric

CVPR 2023
0
citations

Latency-aware Spatial-wise Dynamic Networks

NeurIPS 2022
0
citations

MIM4DD: Mutual Information Maximization for Dataset Distillation

NeurIPS 2023
0
citations