Fully Convolutional Networks for Semantic Segmentation

Jonathan Long, Evan Shelhamer, Trevor Darrell

2014-11-14CVPR 2015 6Multi-tissue Nucleus Segmentation Thermal Image Segmentation Crack Segmentation Segmentation Semantic Segmentation Multispectral Object Detection

Abstract

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, exceed the state-of-the-art in semantic segmentation. Our key insight is to build "fully convolutional" networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning. We define and detail the space of fully convolutional networks, explain their application to spatially dense prediction tasks, and draw connections to prior models. We adapt contemporary classification networks (AlexNet, the VGG net, and GoogLeNet) into fully convolutional networks and transfer their learned representations by fine-tuning to the segmentation task. We then define a novel architecture that combines semantic information from a deep, coarse layer with appearance information from a shallow, fine layer to produce accurate and detailed segmentations. Our fully convolutional network achieves state-of-the-art segmentation of PASCAL VOC (20% relative improvement to 62.2% mean IU on 2012), NYUDv2, and SIFT Flow, while inference takes one third of a second for a typical image.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	Fine-Grained Grass Segmentation Dataset	mIoU	47.47	FCN
Semantic Segmentation	SELMA	mIoU	68.2	FCN
Semantic Segmentation	Event-based Segmentation Dataset	mIoU	59.6	FCN
Semantic Segmentation	PASCAL Context	mIoU	37.8	FCN-8s
Semantic Segmentation	SkyScapes-Dense	Mean IoU	33.06	FCN8s (ResNet-50)
Semantic Segmentation	SkyScapes-Lane	Mean IoU	13.74	FCN8s (ResNet-50)
Semantic Segmentation	Trans10K	GFLOPs	42.23	FCN
Semantic Segmentation	ADE20K	Validation mIoU	29.39	FCN
Semantic Segmentation	CrackVision12K	mIoU	0.59842	FCN
Multi-tissue Nucleus Segmentation	Kumar	Dice	0.797	FCN8 (e)
Multi-tissue Nucleus Segmentation	Kumar	Hausdorff Distance (mm)	31.2	FCN8 (e)
Multispectral Object Detection	KAIST Multispectral Pedestrian Detection Benchmark	All Miss Rate	51.7	FusionRPN+BF
10-shot image generation	Fine-Grained Grass Segmentation Dataset	mIoU	47.47	FCN
10-shot image generation	SELMA	mIoU	68.2	FCN
10-shot image generation	Event-based Segmentation Dataset	mIoU	59.6	FCN
10-shot image generation	PASCAL Context	mIoU	37.8	FCN-8s
10-shot image generation	SkyScapes-Dense	Mean IoU	33.06	FCN8s (ResNet-50)
10-shot image generation	SkyScapes-Lane	Mean IoU	13.74	FCN8s (ResNet-50)
10-shot image generation	Trans10K	GFLOPs	42.23	FCN
10-shot image generation	ADE20K	Validation mIoU	29.39	FCN
10-shot image generation	CrackVision12K	mIoU	0.59842	FCN

Fully Convolutional Networks for Semantic Segmentation

Abstract

Results

Related Papers

Fully Convolutional Networks for Semantic Segmentation

Abstract

Results

Related Papers