"vision-and-language navigation" Papers
5 papers found
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi, Yicong Hong, Yuankai Qi et al.
AAAI 2024paperarXiv:2406.01256
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
JUNYU GAO, Xuan Yao, Changsheng Xu
ICML 2024poster
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Gengze Zhou, Yicong Hong, Qi Wu
AAAI 2024paperarXiv:2305.16986
276
citations
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.
AAAI 2024paperarXiv:2402.03561
10
citations
WebVLN: Vision-and-Language Navigation on Websites
Qi Chen, Dileepa Pitawela, Chongyang Zhao et al.
AAAI 2024paperarXiv:2312.15820
19
citations