Shifeng Zhang
25
Papers
135
Total Citations
Papers (25)
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
CVPR 2025
54
citations
Accelerating Diffusion Sampling with Optimized Time Steps
CVPR 2024
51
citations
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
NeurIPS 2025arXiv
20
citations
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
ICCV 2025
7
citations
Rethinking Correspondence-based Category-Level Object Pose Estimation
CVPR 2025
2
citations
TurboVSR: Fantastic Video Upscalers and Where to Find Them
ICCV 2025
1
citations
Single-Shot Refinement Neural Network for Object Detection
CVPR 2018arXiv
0
citations
A Dataset and Benchmark for Large-Scale Multi-Modal Face Anti-Spoofing
CVPR 2019
0
citations
ScratchDet: Training Single-Shot Object Detectors From Scratch
CVPR 2019
0
citations
Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection
CVPR 2020arXiv
0
citations
iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression
CVPR 2021arXiv
0
citations
Split Hierarchical Variational Compression
CVPR 2022arXiv
0
citations
PILC: Practical Image Lossless Compression With an End-to-End GPU Oriented Neural Framework
CVPR 2022
0
citations
S3FD: Single Shot Scale-Invariant Face Detector
ICCV 2017
0
citations
Structure-Aware Correspondence Learning for Relative Pose Estimation
CVPR 2025
0
citations
Generative Map Priors for Collaborative BEV Semantic Segmentation
CVPR 2025
0
citations
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
CVPR 2025
0
citations
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025
0
citations
Pamba: Enhancing Global Interaction in Point Clouds via State Space Model
AAAI 2025
0
citations
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
CVPR 2024
0
citations
Understanding and Exploring the Network with Stochastic Architectures
NeurIPS 2020
0
citations
iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder
NeurIPS 2021
0
citations
OSOA: One-Shot Online Adaptation of Deep Generative Models for Lossless Compression
NeurIPS 2021
0
citations
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
NeurIPS 2023
0
citations
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
NeurIPS 2023
0
citations