Kevin Lin
7
Papers
59
Total Citations
Papers (7)
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning
CVPR 2024
49
citations
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation
CVPR 2025arXiv
10
citations
LiVOS: Light Video Object Segmentation with Gated Linear Matching
CVPR 2025
0
citations
DisCo: Disentangled Control for Realistic Human Dance Generation
CVPR 2024
0
citations
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
ICML 2024
0
citations
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
ICCV 2025
0
citations
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension
ICCV 2025
0
citations