Bohan Zhuang

35
Papers
238
Total Citations

Papers (35)

LongVLM: Efficient Long Video Understanding via Large Language Models

ECCV 2024arXiv
128
citations

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models

ICLR 2024
69
citations

Neighboring Autoregressive Modeling for Efficient Visual Generation

ICCV 2025
16
citations

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

NeurIPS 2025
8
citations

Efficient Stitchable Task Adaptation

CVPR 2024
7
citations

ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS

NeurIPS 2025
6
citations

Stitched ViTs are Flexible Vision Backbones

ECCV 2024arXiv
4
citations

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data

CVPR 2017arXiv
0
citations

Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries

CVPR 2018arXiv
0
citations

Towards Effective Low-Bitwidth Convolutional Neural Networks

CVPR 2018arXiv
0
citations

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

CVPR 2019
0
citations

AQD: Towards Accurate Quantized Object Detection

CVPR 2021arXiv
0
citations

Automated Progressive Learning for Efficient Training of Vision Transformers

CVPR 2022arXiv
0
citations

Dynamic Focus-Aware Positional Queries for Semantic Segmentation

CVPR 2023arXiv
0
citations

Stitchable Neural Networks

CVPR 2023arXiv
0
citations

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

ICCV 2017
0
citations

FATNN: Fast and Accurate Ternary Neural Networks

ICCV 2021arXiv
0
citations

Scalable Vision Transformers With Hierarchical Pooling

ICCV 2021arXiv
0
citations

Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning

ICCV 2023arXiv
0
citations

BiViT: Extremely Compressed Binary Vision Transformers

ICCV 2023arXiv
0
citations

Generative Low-bitwidth Data Free Quantization

ECCV 2020
0
citations

An Efficient Spatio-Temporal Pyramid Transformer for Action Detection

ECCV 2022
0
citations

Training Quantized Neural Networks With a Full-Precision Auxiliary Module

CVPR 2020arXiv
0
citations

ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity

ICCV 2025
0
citations

Frequency-Aware Autoregressive Modeling for Efficient High-Resolution Image Synthesis

ICCV 2025
0
citations

Channel Merging: Preserving Specialization for Merged Experts

AAAI 2025
0
citations

ModaVerse: Efficiently Transforming Modalities with LLMs

CVPR 2024
0
citations

Fast Training of Triplet-Based Deep Binary Embedding Networks

CVPR 2016
0
citations

Sequential Person Recognition in Photo Albums With a Recurrent Network

CVPR 2017arXiv
0
citations

Discrimination-aware Channel Pruning for Deep Neural Networks

NeurIPS 2018
0
citations

EcoFormer: Energy-Saving Attention with Linear Complexity

NeurIPS 2022
0
citations

Fast Vision Transformers with HiLo Attention

NeurIPS 2022
0
citations

Mask Propagation for Efficient Video Semantic Segmentation

NeurIPS 2023
0
citations

PTQD: Accurate Post-Training Quantization for Diffusion Models

NeurIPS 2023
0
citations

Efficient Test-Time Adaptation for Super-Resolution with Second-Order Degradation and Reconstruction

NeurIPS 2023
0
citations