Zhenheng Yang
14
Papers
169
Total Citations
Papers (14)
Show-o2: Improved Native Unified Multimodal Models
NeurIPS 2025
90
citations
Long Context Tuning for Video Generation
ICCV 2025
56
citations
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
ICCV 2025
22
citations
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
NeurIPS 2025
1
citations
Activity Driven Weakly Supervised Object Detection
CVPR 2019
0
citations
UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos
CVPR 2019
0
citations
Weakly Supervised Instance Segmentation for Videos With Temporal Mask Consistency
CVPR 2021arXiv
0
citations
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
ICCV 2017arXiv
0
citations
TALL: Temporal Activity Localization via Language Query
ICCV 2017arXiv
0
citations
Parallelized Autoregressive Visual Generation
CVPR 2025
0
citations
SPAN: Spatial Pyramid Attention Network for Image Manipulation Localization
ECCV 2020
0
citations
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
CVPR 2025
0
citations
LEGO: Learning Edge With Geometry All at Once by Watching Videos
CVPR 2018arXiv
0
citations
Occlusion Aware Unsupervised Learning of Optical Flow
CVPR 2018arXiv
0
citations