Danny Yin
6
Papers
714
Total Citations
Papers (6)
VILA: On Pre-training for Visual Language Models
CVPR 2024
685
citations
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
CVPR 2025
29
citations
Scaling Vision Pre-Training to 4K Resolution
CVPR 2025
0
citations
NVILA: Efficient Frontier Visual Language Models
CVPR 2025
0
citations
RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models
CVPR 2025
0
citations
RegionGPT: Towards Region Understanding Vision Language Model
CVPR 2024
0
citations