Kenneth Li
6
Papers
187
Total Citations
Papers (6)
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
NeurIPS 2025arXiv
130
citations
Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
ECCV 2024arXiv
35
citations
VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model
NeurIPS 2025
17
citations
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models
NeurIPS 2025arXiv
4
citations
Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs
NeurIPS 2025arXiv
1
citations
Augmenting Biological Fitness Prediction Benchmarks with Landscapes Features from GraphFLA
NeurIPS 2025arXiv
0
citations