Shifeng Zhang

25

Papers

135

Total Citations

Papers (25)

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Accelerating Diffusion Sampling with Optimized Time Steps

FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities

NeurIPS 2025arXiv

Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Rethinking Correspondence-based Category-Level Object Pose Estimation

TurboVSR: Fantastic Video Upscalers and Where to Find Them

Single-Shot Refinement Neural Network for Object Detection

A Dataset and Benchmark for Large-Scale Multi-Modal Face Anti-Spoofing

ScratchDet: Training Single-Shot Object Detectors From Scratch

Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection

iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression

Split Hierarchical Variational Compression

PILC: Practical Image Lossless Compression With an End-to-End GPU Oriented Neural Framework

S3FD: Single Shot Scale-Invariant Face Detector

Structure-Aware Correspondence Learning for Relative Pose Estimation

Generative Map Priors for Collaborative BEV Semantic Segmentation

Revisiting Audio-Visual Segmentation with Vision-Centric Transformer

LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation

Pamba: Enhancing Global Interaction in Point Clouds via State Space Model

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

Understanding and Exploring the Network with Stochastic Architectures

iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder

OSOA: One-Shot Online Adaptation of Deep Generative Models for Lossless Compression

Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models