TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Semantic-Aware Scene Recognition

Semantic-Aware Scene Recognition

Alejandro López-Cifuentes, Marcos Escudero-Viñolo, Jesús Bescós, Álvaro García-Martín

2019-09-05Scene ClassificationScene RecognitionSemantic Segmentation
PaperPDFCode(official)

Abstract

Scene recognition is currently one of the top-challenging research fields in computer vision. This may be due to the ambiguity between classes: images of several scene classes may share similar objects, which causes confusion among them. The problem is aggravated when images of a particular scene class are notably different. Convolutional Neural Networks (CNNs) have significantly boosted performance in scene recognition, albeit it is still far below from other recognition tasks (e.g., object or image recognition). In this paper, we describe a novel approach for scene recognition based on an end-to-end multi-modal CNN that combines image and context information by means of an attention module. Context information, in the shape of semantic segmentation, is used to gate features extracted from the RGB image by leveraging on information encoded in the semantic representation: the set of scene objects and stuff, and their relative locations. This gating process reinforces the learning of indicative scene content and enhances scene disambiguation by refocusing the receptive fields of the CNN towards them. Experimental results on four publicly available datasets show that the proposed approach outperforms every other state-of-the-art method while significantly reducing the number of network parameters. All the code and data used along this paper is available at https://github.com/vpulab/Semantic-Aware-Scene-Recognition

Results

TaskDatasetMetricValueModel
Scene ParsingMIT Indoor ScenesAccuracy87.1Semantic-Aware Scene Recognition (ResNet-50)
Scene ParsingPlaces365Top 1 Accuracy56.51Semantic-Aware Scene Recognition (ResNet-18)
Scene ParsingPlaces365Top 5 Accuracy86Semantic-Aware Scene Recognition (ResNet-18)
Scene ParsingADE20KTop 1 Accuracy62.55Semantic-Aware Scene Recogniton (ResNet-18)
Scene ParsingSUN397Accuracy74.04Semantic-Aware Scene Recognition (ResNet-50)
AnimationMIT Indoor ScenesAccuracy87.1Semantic-Aware Scene Recognition (ResNet-50)
AnimationPlaces365Top 1 Accuracy56.51Semantic-Aware Scene Recognition (ResNet-18)
AnimationPlaces365Top 5 Accuracy86Semantic-Aware Scene Recognition (ResNet-18)
AnimationADE20KTop 1 Accuracy62.55Semantic-Aware Scene Recogniton (ResNet-18)
AnimationSUN397Accuracy74.04Semantic-Aware Scene Recognition (ResNet-50)
3D Character Animation From A Single PhotoMIT Indoor ScenesAccuracy87.1Semantic-Aware Scene Recognition (ResNet-50)
3D Character Animation From A Single PhotoPlaces365Top 1 Accuracy56.51Semantic-Aware Scene Recognition (ResNet-18)
3D Character Animation From A Single PhotoPlaces365Top 5 Accuracy86Semantic-Aware Scene Recognition (ResNet-18)
3D Character Animation From A Single PhotoADE20KTop 1 Accuracy62.55Semantic-Aware Scene Recogniton (ResNet-18)
3D Character Animation From A Single PhotoSUN397Accuracy74.04Semantic-Aware Scene Recognition (ResNet-50)
2D Semantic SegmentationMIT Indoor ScenesAccuracy87.1Semantic-Aware Scene Recognition (ResNet-50)
2D Semantic SegmentationPlaces365Top 1 Accuracy56.51Semantic-Aware Scene Recognition (ResNet-18)
2D Semantic SegmentationPlaces365Top 5 Accuracy86Semantic-Aware Scene Recognition (ResNet-18)
2D Semantic SegmentationADE20KTop 1 Accuracy62.55Semantic-Aware Scene Recogniton (ResNet-18)
2D Semantic SegmentationSUN397Accuracy74.04Semantic-Aware Scene Recognition (ResNet-50)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15