Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation

Liyiming Ke, Xiujun Li, Yonatan Bisk, Ari Holtzman, Zhe Gan, Jingjing Liu, Jianfeng Gao, Yejin Choi, Siddhartha Srinivasa

2019-03-06CVPR 2019 6Vision-Language Navigation Vision and Language Navigation

Paper PDF Code(official)

Abstract

We present the Frontier Aware Search with backTracking (FAST) Navigator, a general framework for action decoding, that achieves state-of-the-art results on the Room-to-Room (R2R) Vision-and-Language navigation challenge of Anderson et. al. (2018). Given a natural language instruction and photo-realistic image views of a previously unseen environment, the agent was tasked with navigating from source to target location as quickly as possible. While all current approaches make local action decisions or score entire trajectories using beam search, ours balances local and global signals when exploring an unobserved environment. Importantly, this lets us act greedily but use global signals to backtrack when necessary. Applying FAST framework to existing state-of-the-art models achieved a 17% relative gain, an absolute 6% gain on Success rate weighted by Path Length (SPL).

Results

Task	Dataset	Metric	Value	Model
Vision-Language Navigation	Room2Room	spl	0.41	Tactical Rewind - short
Vision and Language Navigation	VLN Challenge	error	4.29	Tactical Rewind - long
Vision and Language Navigation	VLN Challenge	length	196.53	Tactical Rewind - long
Vision and Language Navigation	VLN Challenge	oracle success	0.9	Tactical Rewind - long
Vision and Language Navigation	VLN Challenge	spl	0.03	Tactical Rewind - long
Vision and Language Navigation	VLN Challenge	success	0.61	Tactical Rewind - long
Vision and Language Navigation	VLN Challenge	error	5.14	Tactical Rewind - short
Vision and Language Navigation	VLN Challenge	length	22.08	Tactical Rewind - short
Vision and Language Navigation	VLN Challenge	oracle success	0.64	Tactical Rewind - short
Vision and Language Navigation	VLN Challenge	spl	0.41	Tactical Rewind - short
Vision and Language Navigation	VLN Challenge	success	0.54	Tactical Rewind - short

Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation

Abstract

Results

Related Papers

Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation

Abstract

Results

Related Papers