Jianfeng Wang
7
Papers
100
Total Citations
Papers (7)
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning
CVPR 2024
49
citations
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
ICLR 2025arXiv
34
citations
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
ICLR 2025arXiv
17
citations
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
ICML 2024
0
citations
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
CVPR 2024
0
citations
LiVOS: Light Video Object Segmentation with Gated Linear Matching
CVPR 2025
0
citations
Segment and Caption Anything
CVPR 2024
0
citations