Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

Xin Eric Wang, Vihan Jain, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi

2020-03-01ECCV 2020 8Vision-Language Navigation

Abstract

Recent research efforts enable study for natural language grounded navigation in photo-realistic environments, e.g., following natural language instructions or dialog. However, existing methods tend to overfit training data in seen environments and fail to generalize well in previously unseen environments. To close the gap between seen and unseen environments, we aim at learning a generalized navigation model from two novel perspectives: (1) we introduce a multitask navigation model that can be seamlessly trained on both Vision-Language Navigation (VLN) and Navigation from Dialog History (NDH) tasks, which benefits from richer natural language guidance and effectively transfers knowledge across tasks; (2) we propose to learn environment-agnostic representations for the navigation policy that are invariant among the environments seen during training, thus generalizing better on unseen environments. Extensive experiments show that environment-agnostic multitask learning significantly reduces the performance gap between seen and unseen environments, and the navigation agent trained so outperforms baselines on unseen environments by 16% (relative measure on success rate) on VLN and 120% (goal progress) on NDH. Our submission to the CVDN leaderboard establishes a new state-of-the-art for the NDH task on the holdout test set. Code is available at https://github.com/google-research/valan.

Results

Task	Dataset	Metric	Value	Model
Visual Navigation	Cooperative Vision-and-Dialogue Navigation	dist_to_end_reduction	3.91	Environment-agnostic Multitask Learning
Visual Navigation	Cooperative Vision-and-Dialogue Navigation	spl	0.17	Environment-agnostic Multitask Learning
Vision and Language Navigation	VLN Challenge	error	6.03	Environment-Agnostic Multitask Learning
Vision and Language Navigation	VLN Challenge	length	13.35	Environment-Agnostic Multitask Learning
Vision and Language Navigation	VLN Challenge	oracle success	0.56	Environment-Agnostic Multitask Learning
Vision and Language Navigation	VLN Challenge	spl	0.4	Environment-Agnostic Multitask Learning
Vision and Language Navigation	VLN Challenge	success	0.45	Environment-Agnostic Multitask Learning

Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

Abstract

Results

Related Papers

Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

Abstract

Results

Related Papers