Yao Lu
11
Papers
953
Total Citations
Papers (11)
VILA: On Pre-training for Visual Language Models
CVPR 2024
685
citations
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CVPR 2025
203
citations
WorldModelBench: Judging Video Generation Models As World Models
NeurIPS 2025
31
citations
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
CVPR 2025
29
citations
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer
ICCV 2025arXiv
4
citations
SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference
ICCV 2025
1
citations
NVILA: Efficient Frontier Visual Language Models
CVPR 2025
0
citations
Scaling Vision Pre-Training to 4K Resolution
CVPR 2025
0
citations
RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models
CVPR 2025
0
citations
A Set of Generalized Components to Achieve Effective Poison-only Clean-label Backdoor Attacks with Collaborative Sample Selection and Triggers
NeurIPS 2025
0
citations
ALRMR-GEC: Adjusting Learning Rate Based on Memory Rate to Optimize the Edit Scorer for Grammatical Error Correction
AAAI 2025
0
citations