Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans

Nachiket Deo, Mohan M. Trivedi

2020-01-03Reinforcement Learning Trajectory Forecasting Trajectory Prediction

Abstract

We address the problem of forecasting pedestrian and vehicle trajectories in unknown environments, conditioned on their past motion and scene structure. Trajectory forecasting is a challenging problem due to the large variation in scene structure and the multimodal distribution of future trajectories. Unlike prior approaches that directly learn one-to-many mappings from observed context to multiple future trajectories, we propose to condition trajectory forecasts on plans sampled from a grid based policy learned using maximum entropy inverse reinforcement learning (MaxEnt IRL). We reformulate MaxEnt IRL to allow the policy to jointly infer plausible agent goals, and paths to those goals on a coarse 2-D grid defined over the scene. We propose an attention based trajectory generator that generates continuous valued future trajectories conditioned on state sequences sampled from the MaxEnt policy. Quantitative and qualitative evaluation on the publicly available Stanford drone and NuScenes datasets shows that our model generates trajectories that are diverse, representing the multimodal predictive distribution, and precise, conforming to the underlying scene structure over long prediction horizons.

Results

Task	Dataset	Metric	Value	Model
Trajectory Prediction	Stanford Drone	ADE-8/12 @K = 20	12.58	P2TIRL
Trajectory Prediction	Stanford Drone	FDE-8/12 @K= 20	22.07	P2TIRL
Trajectory Prediction	nuScenes	MinADE_10	1.16	P2T
Trajectory Prediction	nuScenes	MinADE_5	1.45	P2T
Trajectory Prediction	nuScenes	MinFDE_1	10.5	P2T
Trajectory Prediction	nuScenes	MissRateTopK_2_10	0.46	P2T
Trajectory Prediction	nuScenes	MissRateTopK_2_5	0.64	P2T
Trajectory Prediction	nuScenes	OffRoadRate	0.03	P2T

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21 CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18 VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17 VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17 QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17 Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17