Shuangrui Ding
13
Papers
106
Total Citations
Papers (13)
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
CVPR 2025
37
citations
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
CVPR 2025
31
citations
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
ICML 2025
21
citations
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
ECCV 2024arXiv
11
citations
Keyframe-Guided Creative Video Inpainting
CVPR 2025
6
citations
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
ICCV 2023arXiv
0
citations
Static and Dynamic Concepts for Self-Supervised Video Representation Learning
ECCV 2022
0
citations
AMPA: Adaptive Mixed Precision Allocation for Low-Bit Integer Training
ICML 2024
0
citations
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
ICCV 2025
0
citations
Motion-Aware Contrastive Video Representation Learning via Foreground-Background Merging
CVPR 2022arXiv
0
citations
Enhancing Self-Supervised Video Representation Learning via Multi-Level Feature Optimization
ICCV 2021arXiv
0
citations
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
ICCV 2023arXiv
0
citations
Towards More Practical Adversarial Attacks on Graph Neural Networks
NeurIPS 2020
0
citations