TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BlitzNet: A Real-Time Deep Network for Scene Understanding

BlitzNet: A Real-Time Deep Network for Scene Understanding

Nikita Dvornik, Konstantin Shmelkov, Julien Mairal, Cordelia Schmid

2017-08-09ICCV 2017 10Real-Time Semantic SegmentationScene UnderstandingSegmentationReal-Time Object DetectionAutonomous DrivingSemantic Segmentationobject-detectionObject Detection
PaperPDFCodeCode

Abstract

Real-time scene understanding has become crucial in many applications such as autonomous driving. In this paper, we propose a deep architecture, called BlitzNet, that jointly performs object detection and semantic segmentation in one forward pass, allowing real-time computations. Besides the computational gain of having a single network to perform several tasks, we show that object detection and semantic segmentation benefit from each other in terms of accuracy. Experimental results for VOC and COCO datasets show state-of-the-art performance for object detection and segmentation among real time systems.

Results

TaskDatasetMetricValueModel
Object DetectionPASCAL VOC 2007FPS24BlitzNet512 (s4)
Object DetectionPASCAL VOC 2007FPS19.5BlitzNet512 (s8)
3DPASCAL VOC 2007FPS24BlitzNet512 (s4)
3DPASCAL VOC 2007FPS19.5BlitzNet512 (s8)
2D ClassificationPASCAL VOC 2007FPS24BlitzNet512 (s4)
2D ClassificationPASCAL VOC 2007FPS19.5BlitzNet512 (s8)
2D Object DetectionPASCAL VOC 2007FPS24BlitzNet512 (s4)
2D Object DetectionPASCAL VOC 2007FPS19.5BlitzNet512 (s8)
16kPASCAL VOC 2007FPS24BlitzNet512 (s4)
16kPASCAL VOC 2007FPS19.5BlitzNet512 (s8)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17