TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FIERY: Future Instance Prediction in Bird's-Eye View from ...

FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras

Anthony Hu, Zak Murez, Nikhil Mohan, Sofía Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall

2021-04-21ICCV 2021 10Sensor FusionNavigateFuture predictionAutonomous DrivingSemantic SegmentationPredictionBird's-Eye View Semantic SegmentationInstance Segmentation
PaperPDFCode(official)

Abstract

Driving requires interacting with road agents and predicting their future behaviour in order to navigate safely. We present FIERY: a probabilistic future prediction model in bird's-eye view from monocular cameras. Our model predicts future instance segmentation and motion of dynamic agents that can be transformed into non-parametric future trajectories. Our approach combines the perception, sensor fusion and prediction components of a traditional autonomous driving stack by estimating bird's-eye-view prediction directly from surround RGB monocular camera inputs. FIERY learns to model the inherent stochastic nature of the future solely from camera driving data in an end-to-end manner, without relying on HD maps, and predicts multimodal future trajectories. We show that our model outperforms previous prediction baselines on the NuScenes and Lyft datasets. The code and trained models are available at https://github.com/wayveai/fiery.

Results

TaskDatasetMetricValueModel
Semantic SegmentationnuScenesIoU ped - 224x480 - Vis filter. - 100x100 at 0.517.2FIERY (static)
Semantic SegmentationnuScenesIoU veh - 224x480 - No vis filter - 100x100 at 0.535.8FIERY (static)
Semantic SegmentationnuScenesIoU veh - 224x480 - Vis filter. - 100x100 at 0.539.8FIERY (static)
Semantic SegmentationnuScenesIoU veh - 224x480 - No vis filter - 100x100 at 0.538.2FIERY
Semantic SegmentationnuScenesIoU veh - 224x480 - No vis filter - 100x50 at 0.2541.1FIERY
Semantic SegmentationnuScenesIoU vehicle - Setting 358.5FIERY
Semantic SegmentationLyft Level 5IoU vehicle - 224x480 - Long36.7FIERY
Semantic SegmentationLyft Level 5IoU vehicle - 224x480 - Short59.4FIERY
10-shot image generationnuScenesIoU ped - 224x480 - Vis filter. - 100x100 at 0.517.2FIERY (static)
10-shot image generationnuScenesIoU veh - 224x480 - No vis filter - 100x100 at 0.535.8FIERY (static)
10-shot image generationnuScenesIoU veh - 224x480 - Vis filter. - 100x100 at 0.539.8FIERY (static)
10-shot image generationnuScenesIoU veh - 224x480 - No vis filter - 100x100 at 0.538.2FIERY
10-shot image generationnuScenesIoU veh - 224x480 - No vis filter - 100x50 at 0.2541.1FIERY
10-shot image generationnuScenesIoU vehicle - Setting 358.5FIERY
10-shot image generationLyft Level 5IoU vehicle - 224x480 - Long36.7FIERY
10-shot image generationLyft Level 5IoU vehicle - 224x480 - Short59.4FIERY
Bird's-Eye View Semantic SegmentationnuScenesIoU ped - 224x480 - Vis filter. - 100x100 at 0.517.2FIERY (static)
Bird's-Eye View Semantic SegmentationnuScenesIoU veh - 224x480 - No vis filter - 100x100 at 0.535.8FIERY (static)
Bird's-Eye View Semantic SegmentationnuScenesIoU veh - 224x480 - Vis filter. - 100x100 at 0.539.8FIERY (static)
Bird's-Eye View Semantic SegmentationnuScenesIoU veh - 224x480 - No vis filter - 100x100 at 0.538.2FIERY
Bird's-Eye View Semantic SegmentationnuScenesIoU veh - 224x480 - No vis filter - 100x50 at 0.2541.1FIERY
Bird's-Eye View Semantic SegmentationnuScenesIoU vehicle - Setting 358.5FIERY
Bird's-Eye View Semantic SegmentationLyft Level 5IoU vehicle - 224x480 - Long36.7FIERY
Bird's-Eye View Semantic SegmentationLyft Level 5IoU vehicle - 224x480 - Short59.4FIERY

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17