Shifeng Zhang
12
Papers
135
Total Citations
Papers (12)
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
CVPR 2025
54
citations
Accelerating Diffusion Sampling with Optimized Time Steps
CVPR 2024
51
citations
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
NeurIPS 2025arXiv
20
citations
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
ICCV 2025
7
citations
Rethinking Correspondence-based Category-Level Object Pose Estimation
CVPR 2025
2
citations
TurboVSR: Fantastic Video Upscalers and Where to Find Them
ICCV 2025
1
citations
Structure-Aware Correspondence Learning for Relative Pose Estimation
CVPR 2025
0
citations
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
CVPR 2025
0
citations
Pamba: Enhancing Global Interaction in Point Clouds via State Space Model
AAAI 2025
0
citations
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
CVPR 2024
0
citations
Generative Map Priors for Collaborative BEV Semantic Segmentation
CVPR 2025
0
citations
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025
0
citations