TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Searching for MobileNetV3

Searching for MobileNetV3

Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, Quoc V. Le, Hartwig Adam

2019-05-06ICCV 2019 10Dichotomous Image SegmentationImage ClassificationSegmentationSemantic SegmentationNeural Architecture SearchClassificationObject Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

We present the next generation of MobileNets based on a combination of complementary search techniques as well as a novel architecture design. MobileNetV3 is tuned to mobile phone CPUs through a combination of hardware-aware network architecture search (NAS) complemented by the NetAdapt algorithm and then subsequently improved through novel architecture advances. This paper starts the exploration of how automated search algorithms and network design can work together to harness complementary approaches improving the overall state of the art. Through this process we create two new MobileNet models for release: MobileNetV3-Large and MobileNetV3-Small which are targeted for high and low resource use cases. These models are then adapted and applied to the tasks of object detection and semantic segmentation. For the task of semantic segmentation (or any dense pixel prediction), we propose a new efficient segmentation decoder Lite Reduced Atrous Spatial Pyramid Pooling (LR-ASPP). We achieve new state of the art results for mobile classification, detection and segmentation. MobileNetV3-Large is 3.2\% more accurate on ImageNet classification while reducing latency by 15\% compared to MobileNetV2. MobileNetV3-Small is 4.6\% more accurate while reducing latency by 5\% compared to MobileNetV2. MobileNetV3-Large detection is 25\% faster at roughly the same accuracy as MobileNetV2 on COCO detection. MobileNetV3-Large LR-ASPP is 30\% faster than MobileNetV2 R-ASPP at similar accuracy for Cityscapes segmentation.

Results

TaskDatasetMetricValueModel
Semantic SegmentationDADA-segmIoU18.2MobileNetV3 (MobileNetV3small)
Object DetectionDIS-TE4E-measure0.848MBV3
Object DetectionDIS-TE4HCE3817MBV3
Object DetectionDIS-TE4MAE0.098MBV3
Object DetectionDIS-TE4S-Measure0.77MBV3
Object DetectionDIS-TE4max F-Measure0.736MBV3
Object DetectionDIS-TE4weighted F-measure0.664MBV3
Object DetectionDIS-VDE-measure0.841MBV3
Object DetectionDIS-VDHCE1625MBV3
Object DetectionDIS-VDMAE0.092MBV3
Object DetectionDIS-VDS-Measure0.758MBV3
Object DetectionDIS-VDmax F-Measure0.714MBV3
Object DetectionDIS-VDweighted F-measure0.642MBV3
Object DetectionDIS-TE2E-measure0.856MBV3
Object DetectionDIS-TE2HCE600MBV3
Object DetectionDIS-TE2MAE0.083MBV3
Object DetectionDIS-TE2S-Measure0.777MBV3
Object DetectionDIS-TE2max F-Measure0.743MBV3
Object DetectionDIS-TE2weighted F-measure0.672MBV3
Object DetectionDIS-TE1E-measure0.818MBV3
Object DetectionDIS-TE1HCE274MBV3
Object DetectionDIS-TE1MAE0.083MBV3
Object DetectionDIS-TE1S-Measure0.74MBV3
Object DetectionDIS-TE1max F-Measure0.669MBV3
Object DetectionDIS-TE1weighted F-measure0.595MBV3
Object DetectionDIS-TE3E-measure0.88MBV3
Object DetectionDIS-TE3HCE1136MBV3
Object DetectionDIS-TE3MAE0.078MBV3
Object DetectionDIS-TE3S-Measure0.764MBV3
Object DetectionDIS-TE3max F-Measure0.772MBV3
Object DetectionDIS-TE3weighted F-measure0.702MBV3
Image ClassificationImageNetGFLOPs0.438MobileNet V3-Large 1.0
3DDIS-TE4E-measure0.848MBV3
3DDIS-TE4HCE3817MBV3
3DDIS-TE4MAE0.098MBV3
3DDIS-TE4S-Measure0.77MBV3
3DDIS-TE4max F-Measure0.736MBV3
3DDIS-TE4weighted F-measure0.664MBV3
3DDIS-VDE-measure0.841MBV3
3DDIS-VDHCE1625MBV3
3DDIS-VDMAE0.092MBV3
3DDIS-VDS-Measure0.758MBV3
3DDIS-VDmax F-Measure0.714MBV3
3DDIS-VDweighted F-measure0.642MBV3
3DDIS-TE2E-measure0.856MBV3
3DDIS-TE2HCE600MBV3
3DDIS-TE2MAE0.083MBV3
3DDIS-TE2S-Measure0.777MBV3
3DDIS-TE2max F-Measure0.743MBV3
3DDIS-TE2weighted F-measure0.672MBV3
3DDIS-TE1E-measure0.818MBV3
3DDIS-TE1HCE274MBV3
3DDIS-TE1MAE0.083MBV3
3DDIS-TE1S-Measure0.74MBV3
3DDIS-TE1max F-Measure0.669MBV3
3DDIS-TE1weighted F-measure0.595MBV3
3DDIS-TE3E-measure0.88MBV3
3DDIS-TE3HCE1136MBV3
3DDIS-TE3MAE0.078MBV3
3DDIS-TE3S-Measure0.764MBV3
3DDIS-TE3max F-Measure0.772MBV3
3DDIS-TE3weighted F-measure0.702MBV3
RGB Salient Object DetectionDIS-TE4E-measure0.848MBV3
RGB Salient Object DetectionDIS-TE4HCE3817MBV3
RGB Salient Object DetectionDIS-TE4MAE0.098MBV3
RGB Salient Object DetectionDIS-TE4S-Measure0.77MBV3
RGB Salient Object DetectionDIS-TE4max F-Measure0.736MBV3
RGB Salient Object DetectionDIS-TE4weighted F-measure0.664MBV3
RGB Salient Object DetectionDIS-VDE-measure0.841MBV3
RGB Salient Object DetectionDIS-VDHCE1625MBV3
RGB Salient Object DetectionDIS-VDMAE0.092MBV3
RGB Salient Object DetectionDIS-VDS-Measure0.758MBV3
RGB Salient Object DetectionDIS-VDmax F-Measure0.714MBV3
RGB Salient Object DetectionDIS-VDweighted F-measure0.642MBV3
RGB Salient Object DetectionDIS-TE2E-measure0.856MBV3
RGB Salient Object DetectionDIS-TE2HCE600MBV3
RGB Salient Object DetectionDIS-TE2MAE0.083MBV3
RGB Salient Object DetectionDIS-TE2S-Measure0.777MBV3
RGB Salient Object DetectionDIS-TE2max F-Measure0.743MBV3
RGB Salient Object DetectionDIS-TE2weighted F-measure0.672MBV3
RGB Salient Object DetectionDIS-TE1E-measure0.818MBV3
RGB Salient Object DetectionDIS-TE1HCE274MBV3
RGB Salient Object DetectionDIS-TE1MAE0.083MBV3
RGB Salient Object DetectionDIS-TE1S-Measure0.74MBV3
RGB Salient Object DetectionDIS-TE1max F-Measure0.669MBV3
RGB Salient Object DetectionDIS-TE1weighted F-measure0.595MBV3
RGB Salient Object DetectionDIS-TE3E-measure0.88MBV3
RGB Salient Object DetectionDIS-TE3HCE1136MBV3
RGB Salient Object DetectionDIS-TE3MAE0.078MBV3
RGB Salient Object DetectionDIS-TE3S-Measure0.764MBV3
RGB Salient Object DetectionDIS-TE3max F-Measure0.772MBV3
RGB Salient Object DetectionDIS-TE3weighted F-measure0.702MBV3
2D ClassificationDIS-TE4E-measure0.848MBV3
2D ClassificationDIS-TE4HCE3817MBV3
2D ClassificationDIS-TE4MAE0.098MBV3
2D ClassificationDIS-TE4S-Measure0.77MBV3
2D ClassificationDIS-TE4max F-Measure0.736MBV3
2D ClassificationDIS-TE4weighted F-measure0.664MBV3
2D ClassificationDIS-VDE-measure0.841MBV3
2D ClassificationDIS-VDHCE1625MBV3
2D ClassificationDIS-VDMAE0.092MBV3
2D ClassificationDIS-VDS-Measure0.758MBV3
2D ClassificationDIS-VDmax F-Measure0.714MBV3
2D ClassificationDIS-VDweighted F-measure0.642MBV3
2D ClassificationDIS-TE2E-measure0.856MBV3
2D ClassificationDIS-TE2HCE600MBV3
2D ClassificationDIS-TE2MAE0.083MBV3
2D ClassificationDIS-TE2S-Measure0.777MBV3
2D ClassificationDIS-TE2max F-Measure0.743MBV3
2D ClassificationDIS-TE2weighted F-measure0.672MBV3
2D ClassificationDIS-TE1E-measure0.818MBV3
2D ClassificationDIS-TE1HCE274MBV3
2D ClassificationDIS-TE1MAE0.083MBV3
2D ClassificationDIS-TE1S-Measure0.74MBV3
2D ClassificationDIS-TE1max F-Measure0.669MBV3
2D ClassificationDIS-TE1weighted F-measure0.595MBV3
2D ClassificationDIS-TE3E-measure0.88MBV3
2D ClassificationDIS-TE3HCE1136MBV3
2D ClassificationDIS-TE3MAE0.078MBV3
2D ClassificationDIS-TE3S-Measure0.764MBV3
2D ClassificationDIS-TE3max F-Measure0.772MBV3
2D ClassificationDIS-TE3weighted F-measure0.702MBV3
2D Object DetectionDIS-TE4E-measure0.848MBV3
2D Object DetectionDIS-TE4HCE3817MBV3
2D Object DetectionDIS-TE4MAE0.098MBV3
2D Object DetectionDIS-TE4S-Measure0.77MBV3
2D Object DetectionDIS-TE4max F-Measure0.736MBV3
2D Object DetectionDIS-TE4weighted F-measure0.664MBV3
2D Object DetectionDIS-VDE-measure0.841MBV3
2D Object DetectionDIS-VDHCE1625MBV3
2D Object DetectionDIS-VDMAE0.092MBV3
2D Object DetectionDIS-VDS-Measure0.758MBV3
2D Object DetectionDIS-VDmax F-Measure0.714MBV3
2D Object DetectionDIS-VDweighted F-measure0.642MBV3
2D Object DetectionDIS-TE2E-measure0.856MBV3
2D Object DetectionDIS-TE2HCE600MBV3
2D Object DetectionDIS-TE2MAE0.083MBV3
2D Object DetectionDIS-TE2S-Measure0.777MBV3
2D Object DetectionDIS-TE2max F-Measure0.743MBV3
2D Object DetectionDIS-TE2weighted F-measure0.672MBV3
2D Object DetectionDIS-TE1E-measure0.818MBV3
2D Object DetectionDIS-TE1HCE274MBV3
2D Object DetectionDIS-TE1MAE0.083MBV3
2D Object DetectionDIS-TE1S-Measure0.74MBV3
2D Object DetectionDIS-TE1max F-Measure0.669MBV3
2D Object DetectionDIS-TE1weighted F-measure0.595MBV3
2D Object DetectionDIS-TE3E-measure0.88MBV3
2D Object DetectionDIS-TE3HCE1136MBV3
2D Object DetectionDIS-TE3MAE0.078MBV3
2D Object DetectionDIS-TE3S-Measure0.764MBV3
2D Object DetectionDIS-TE3max F-Measure0.772MBV3
2D Object DetectionDIS-TE3weighted F-measure0.702MBV3
10-shot image generationDADA-segmIoU18.2MobileNetV3 (MobileNetV3small)
16kDIS-TE4E-measure0.848MBV3
16kDIS-TE4HCE3817MBV3
16kDIS-TE4MAE0.098MBV3
16kDIS-TE4S-Measure0.77MBV3
16kDIS-TE4max F-Measure0.736MBV3
16kDIS-TE4weighted F-measure0.664MBV3
16kDIS-VDE-measure0.841MBV3
16kDIS-VDHCE1625MBV3
16kDIS-VDMAE0.092MBV3
16kDIS-VDS-Measure0.758MBV3
16kDIS-VDmax F-Measure0.714MBV3
16kDIS-VDweighted F-measure0.642MBV3
16kDIS-TE2E-measure0.856MBV3
16kDIS-TE2HCE600MBV3
16kDIS-TE2MAE0.083MBV3
16kDIS-TE2S-Measure0.777MBV3
16kDIS-TE2max F-Measure0.743MBV3
16kDIS-TE2weighted F-measure0.672MBV3
16kDIS-TE1E-measure0.818MBV3
16kDIS-TE1HCE274MBV3
16kDIS-TE1MAE0.083MBV3
16kDIS-TE1S-Measure0.74MBV3
16kDIS-TE1max F-Measure0.669MBV3
16kDIS-TE1weighted F-measure0.595MBV3
16kDIS-TE3E-measure0.88MBV3
16kDIS-TE3HCE1136MBV3
16kDIS-TE3MAE0.078MBV3
16kDIS-TE3S-Measure0.764MBV3
16kDIS-TE3max F-Measure0.772MBV3
16kDIS-TE3weighted F-measure0.702MBV3

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17