TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Weakly Supervised Cascaded Convolutional Networks

Weakly Supervised Cascaded Convolutional Networks

Ali Diba, Vivek Sharma, Ali Pazandeh, Hamed Pirsiavash, Luc van Gool

2016-11-24CVPR 2017 7Weakly Supervised Object DetectionMultiple Instance LearningSemantic Segmentationobject-detectionObject Detection
PaperPDF

Abstract

Object detection is a challenging task in visual understanding domain, and even more so if the supervision is to be weak. Recently, few efforts to handle the task without expensive human annotations is established by promising deep neural network. A new architecture of cascaded networks is proposed to learn a convolutional neural network (CNN) under such conditions. We introduce two such architectures, with either two cascade stages or three which are trained in an end-to-end pipeline. The first stage of both architectures extracts best candidate of class specific region proposals by training a fully convolutional network. In the case of the three stage architecture, the middle stage provides object segmentation, using the output of the activation maps of first stage. The final stage of both architectures is a part of a convolutional neural network that performs multiple instance learning on proposals extracted in the previous stage(s). Our experiments on the PASCAL VOC 2007, 2010, 2012 and large scale object datasets, ILSVRC 2013, 2014 datasets show improvements in the areas of weakly-supervised object detection, classification and localization.

Results

TaskDatasetMetricValueModel
Object DetectionPASCAL VOC 2007MAP42.8WCCN
Object DetectionPASCAL VOC 2012 testMAP37.9WCCN
Object DetectionCOCO test-devAP5012.3WCCN
Object DetectionImageNetMAP16.3WCCN
3DPASCAL VOC 2007MAP42.8WCCN
3DPASCAL VOC 2012 testMAP37.9WCCN
3DCOCO test-devAP5012.3WCCN
3DImageNetMAP16.3WCCN
2D ClassificationPASCAL VOC 2007MAP42.8WCCN
2D ClassificationPASCAL VOC 2012 testMAP37.9WCCN
2D ClassificationCOCO test-devAP5012.3WCCN
2D ClassificationImageNetMAP16.3WCCN
2D Object DetectionPASCAL VOC 2007MAP42.8WCCN
2D Object DetectionPASCAL VOC 2012 testMAP37.9WCCN
2D Object DetectionCOCO test-devAP5012.3WCCN
2D Object DetectionImageNetMAP16.3WCCN
16kPASCAL VOC 2007MAP42.8WCCN
16kPASCAL VOC 2012 testMAP37.9WCCN
16kCOCO test-devAP5012.3WCCN
16kImageNetMAP16.3WCCN

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17