TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Very Deep Convolutional Networks for Large-Scale Image Rec...

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, Andrew Zisserman

2014-09-04Activity Recognition In VideosImage ClassificationFace Anti-SpoofingDomain GeneralizationGeneral ClassificationClassification
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

Results

TaskDatasetMetricValueModel
Depth EstimationSiW-Enroll5AUC97.8VGG16
Depth EstimationCelebA-Spoof-Enroll5AUC98VGG16
Facial Recognition and ModellingSiW-Enroll5AUC97.8VGG16
Facial Recognition and ModellingCelebA-Spoof-Enroll5AUC98VGG16
Image-to-Image TranslationGTAV-to-Cityscapes LabelsmIoU41.3VGG16 60.3
Domain AdaptationVizWiz-ClassificationAccuracy - All Images36.7VGG-16 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images41.1VGG-16 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images31.1VGG-16 BN
Domain AdaptationVizWiz-ClassificationAccuracy - All Images36.2VGG-19 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images40.8VGG-19 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images29.4VGG-19 BN
Domain AdaptationVizWiz-ClassificationAccuracy - All Images34.7VGG-19
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images39.3VGG-19
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images29VGG-19
Domain AdaptationVizWiz-ClassificationAccuracy - All Images34.7VGG-16
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images39.5VGG-16
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images28.5VGG-16
Domain AdaptationVizWiz-ClassificationAccuracy - All Images33.7VGG-13 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images38.4VGG-13 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images28.3VGG-13 BN
Domain AdaptationVizWiz-ClassificationAccuracy - All Images32.9VGG-11 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images37.1VGG-11 BN
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images25.8VGG-11 BN
Domain AdaptationVizWiz-ClassificationAccuracy - All Images32.4VGG-13
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images36.5VGG-13
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images26.4VGG-13
Domain AdaptationVizWiz-ClassificationAccuracy - All Images31.5VGG-11
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images36.1VGG-11
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images25.2VGG-11
Image GenerationGTAV-to-Cityscapes LabelsmIoU41.3VGG16 60.3
Visual OdometrySiW-Enroll5AUC97.8VGG16
Visual OdometryCelebA-Spoof-Enroll5AUC98VGG16
Face ReconstructionSiW-Enroll5AUC97.8VGG16
Face ReconstructionCelebA-Spoof-Enroll5AUC98VGG16
3DSiW-Enroll5AUC97.8VGG16
3DCelebA-Spoof-Enroll5AUC98VGG16
3D Face ModellingSiW-Enroll5AUC97.8VGG16
3D Face ModellingCelebA-Spoof-Enroll5AUC98VGG16
3D Face ReconstructionSiW-Enroll5AUC97.8VGG16
3D Face ReconstructionCelebA-Spoof-Enroll5AUC98VGG16
Depth And Camera MotionSiW-Enroll5AUC97.8VGG16
Depth And Camera MotionCelebA-Spoof-Enroll5AUC98VGG16
ClassificationXImageNet-12Robustness Score0.8845VGG-16
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images36.7VGG-16 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images41.1VGG-16 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images31.1VGG-16 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images36.2VGG-19 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images40.8VGG-19 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images29.4VGG-19 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images34.7VGG-19
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images39.3VGG-19
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images29VGG-19
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images34.7VGG-16
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images39.5VGG-16
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images28.5VGG-16
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images33.7VGG-13 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images38.4VGG-13 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images28.3VGG-13 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images32.9VGG-11 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images37.1VGG-11 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images25.8VGG-11 BN
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images32.4VGG-13
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images36.5VGG-13
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images26.4VGG-13
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images31.5VGG-11
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images36.1VGG-11
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images25.2VGG-11
1 Image, 2*2 StitchingGTAV-to-Cityscapes LabelsmIoU41.3VGG16 60.3

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17