TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Fully Convolutional Networks for Semantic Segmentation

Fully Convolutional Networks for Semantic Segmentation

Evan Shelhamer, Jonathan Long, Trevor Darrell

2016-05-20CVPR 2015Real-Time Semantic SegmentationScene SegmentationSegmentationSemantic SegmentationVideo Semantic Segmentation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, improve on the previous best result in semantic segmentation. Our key insight is to build "fully convolutional" networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning. We define and detail the space of fully convolutional networks, explain their application to spatially dense prediction tasks, and draw connections to prior models. We adapt contemporary classification networks (AlexNet, the VGG net, and GoogLeNet) into fully convolutional networks and transfer their learned representations by fine-tuning to the segmentation task. We then define a skip architecture that combines semantic information from a deep, coarse layer with appearance information from a shallow, fine layer to produce accurate and detailed segmentations. Our fully convolutional network achieves improved segmentation of PASCAL VOC (30% relative improvement to 67.2% mean IU on 2012), NYUDv2, SIFT Flow, and PASCAL-Context, while inference takes one tenth of a second for a typical image.

Results

TaskDatasetMetricValueModel
Scene ParsingCityscapes valmIoU70.1FCN-50 [14]
Semantic SegmentationPASCAL VOC 2011 testMean IoU32FCN-VGG16
Semantic SegmentationPASCAL VOC 2011 testMean IoU22.4FCN-pool4
Semantic SegmentationNYU Depth v2Mean Accuracy44FCN-32s RGB-HHA
Semantic SegmentationSUN-RGBDMean IoU27.39FCN
Video Semantic SegmentationCityscapes valmIoU70.1FCN-50 [14]
Scene UnderstandingCityscapes valmIoU70.1FCN-50 [14]
2D Semantic SegmentationCityscapes valmIoU70.1FCN-50 [14]
Scene SegmentationSUN-RGBDMean IoU27.39FCN
10-shot image generationPASCAL VOC 2011 testMean IoU32FCN-VGG16
10-shot image generationPASCAL VOC 2011 testMean IoU22.4FCN-pool4
10-shot image generationNYU Depth v2Mean Accuracy44FCN-32s RGB-HHA
10-shot image generationSUN-RGBDMean IoU27.39FCN

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17