Song Han

38
Papers
992
Total Citations

Papers (38)

VILA: On Pre-training for Visual Language Models

CVPR 2024
685
citations

CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

CVPR 2025
203
citations

DataMix: Efficient Privacy-Preserving Edge-Cloud Inference

ECCV 2020
40
citations

WorldModelBench: Judging Video Generation Models As World Models

NeurIPS 2025
31
citations

Condition-Aware Neural Network for Controlled Image Generation

CVPR 2024
17
citations

Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning

NeurIPS 2025
11
citations

DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer

ICCV 2025arXiv
4
citations

SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference

ICCV 2025
1
citations

APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

CVPR 2020arXiv
0
citations

GAN Compression: Efficient Architectures for Interactive Conditional GANs

CVPR 2020arXiv
0
citations

Anycost GANs for Interactive Image Synthesis and Editing

CVPR 2021arXiv
0
citations

Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation

CVPR 2022arXiv
0
citations

FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer

CVPR 2023arXiv
0
citations

TSM: Temporal Shift Module for Efficient Video Understanding

ICCV 2019
0
citations

LocTex: Learning Data-Efficient Visual Representations From Localized Textual Supervision

ICCV 2021arXiv
0
citations

EfficientViT: Lightweight Multi-Scale Attention for High-Resolution Dense Prediction

ICCV 2023
0
citations

Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

ECCV 2020
0
citations

Learning both Weights and Connections for Efficient Neural Network

NeurIPS 2015
0
citations

SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

CVPR 2023arXiv
0
citations

Scaling Vision Pre-Training to 4K Resolution

CVPR 2025
0
citations

NVILA: Efficient Frontier Visual Language Models

CVPR 2025
0
citations

DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space

ICCV 2025
0
citations

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

ICCV 2025
0
citations

DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

CVPR 2024
0
citations

QUEST: Query-Aware Sparsity for Efficient Long-Context LLM Inference

ICML 2024
0
citations

HAQ: Hardware-Aware Automated Quantization With Mixed Precision

CVPR 2019
0
citations

Point-Voxel CNN for Efficient 3D Deep Learning

NeurIPS 2019
0
citations

Deep Leakage from Gradients

NeurIPS 2019
0
citations

Park: An Open Platform for Learning-Augmented Computer Systems

NeurIPS 2019
0
citations

Differentiable Augmentation for Data-Efficient GAN Training

NeurIPS 2020
0
citations

TinyTL: Reduce Memory, Not Parameters for Efficient On-Device Learning

NeurIPS 2020
0
citations

MCUNet: Tiny Deep Learning on IoT Devices

NeurIPS 2020
0
citations

Memory-efficient Patch-based Inference for Tiny Deep Learning

NeurIPS 2021
0
citations

Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning

NeurIPS 2021
0
citations

On-Device Training Under 256KB Memory

NeurIPS 2022
0
citations

Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

NeurIPS 2022
0
citations

Path-Level Network Transformation for Efficient Architecture Search

ICML 2018
0
citations

Improved Dynamic Graph Learning through Fault-Tolerant Sparsification

ICML 2019
0
citations