Yao Lu

21
Papers
953
Total Citations

Papers (21)

VILA: On Pre-training for Visual Language Models

CVPR 2024
685
citations

CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

CVPR 2025
203
citations

WorldModelBench: Judging Video Generation Models As World Models

NeurIPS 2025
31
citations

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

CVPR 2025
29
citations

DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer

ICCV 2025arXiv
4
citations

SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference

ICCV 2025
1
citations

Scaling Vision Pre-Training to 4K Resolution

CVPR 2025
0
citations

Coherent Parametric Contours for Interactive Video Object Segmentation

CVPR 2016
0
citations

Learning Optical Flow From a Few Matches

CVPR 2021arXiv
0
citations

Taskology: Utilizing Task Relations at Scale

CVPR 2021arXiv
0
citations

Token Turing Machines

CVPR 2023arXiv
0
citations

Contour Flow: Middle-Level Motion Estimation by Combining Motion Segmentation and Contour Alignment

ICCV 2015
0
citations

Learning To Estimate Hidden Motions With Global Motion Aggregation

ICCV 2021arXiv
0
citations

Understanding the Dynamics of DNNs Using Graph Modularity

ECCV 2022
0
citations

NVILA: Efficient Frontier Visual Language Models

CVPR 2025
0
citations

RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models

CVPR 2025
0
citations

A Set of Generalized Components to Achieve Effective Poison-only Clean-label Backdoor Attacks with Collaborative Sample Selection and Triggers

NeurIPS 2025
0
citations

ALRMR-GEC: Adjusting Learning Rate Based on Memory Rate to Optimize the Edit Scorer for Grammatical Error Correction

AAAI 2025
0
citations

Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning

NeurIPS 2021
0
citations

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

NeurIPS 2023
0
citations

Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents

NeurIPS 2023
0
citations