Xiaojie Jin
9
Papers
40
Total Citations
Papers (9)
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
CVPR 2025
28
citations
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
ICCV 2025
11
citations
PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling
NeurIPS 2025arXiv
1
citations
Stitching Segments and Sentences towards Generalization in Video-Text Pre-training
AAAI 2024
0
citations
Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
AAAI 2024
0
citations
MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval
CVPR 2024
0
citations
PixelLM: Pixel Reasoning with Large Multimodal Model
CVPR 2024
0
citations
Video Recognition in Portrait Mode
CVPR 2024
0
citations
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
0
citations