Leonid Karlinsky
8
Papers
260
Total Citations
Papers (8)
Listen, Think, and Understand
ICLR 2024
221
citations
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts
ICLR 2025
19
citations
LiveXiv - A Multi-Modal live benchmark based on Arxiv papers content
ICLR 2025arXiv
11
citations
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features
ICCV 2025
3
citations
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
CVPR 2025
2
citations
Sample- and Parameter-Efficient Auto-Regressive Image Models
CVPR 2025
2
citations
Teaching VLMs to Localize Specific Objects from In-context Examples
ICCV 2025
2
citations
BATCLIP: Bimodal Online Test-Time Adaptation for CLIP
ICCV 2025
0
citations