"vision-and-language navigation" Papers
8 papers found
Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Akhil Perincherry, Jacob Krantz, Stefan Lee
CVPR 2025posterarXiv:2503.16394
7
citations
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation
Zihan Wang, Seungjun Lee, Gim Hee Lee
NeurIPS 2025oralarXiv:2505.11383
5
citations
General Scene Adaptation for Vision-and-Language Navigation
Haodong Hong, Yanyuan Qiao, Sen Wang et al.
ICLR 2025posterarXiv:2501.17403
10
citations
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi, Yicong Hong, Yuankai Qi et al.
AAAI 2024paperarXiv:2406.01256
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
JUNYU GAO, Xuan Yao, Changsheng Xu
ICML 2024poster
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Gengze Zhou, Yicong Hong, Qi Wu
AAAI 2024paperarXiv:2305.16986
276
citations
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.
AAAI 2024paperarXiv:2402.03561
10
citations
WebVLN: Vision-and-Language Navigation on Websites
Qi Chen, Dileepa Pitawela, Chongyang Zhao et al.
AAAI 2024paperarXiv:2312.15820
19
citations