Bo He
8
Papers
29
Total Citations
Papers (8)
OmniViD: A Generative Framework for Universal Video Understanding
CVPR 2024
29
citations
ASM-Loc: Action-Aware Segment Modeling for Weakly-Supervised Temporal Action Localization
CVPR 2022
0
citations
Towards Scalable Neural Representation for Diverse Videos
CVPR 2023arXiv
0
citations
Align and Attend: Multimodal Summarization With Dual Contrastive Losses
CVPR 2023arXiv
0
citations
Chop & Learn: Recognizing and Generating Object-State Compositions
ICCV 2023arXiv
0
citations
Learning Semantic Correspondence with Sparse Annotations
ECCV 2022
0
citations
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
CVPR 2024
0
citations
NeRV: Neural Representations for Videos
NeurIPS 2021
0
citations