TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot...

MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection

Zhenhong Sun, Ming Lin, Xiuyu Sun, Zhiyu Tan, Hao Li, Rong Jin

2021-11-26Neural Architecture Searchobject-detectionObject Detection
PaperPDFCode(official)

Abstract

In object detection, the detection backbone consumes more than half of the overall inference cost. Recent researches attempt to reduce this cost by optimizing the backbone architecture with the help of Neural Architecture Search (NAS). However, existing NAS methods for object detection require hundreds to thousands of GPU hours of searching, making them impractical in fast-paced research and development. In this work, we propose a novel zero-shot NAS method to address this issue. The proposed method, named MAE-DET, automatically designs efficient detection backbones via the Maximum Entropy Principle without training network parameters, reducing the architecture design cost to nearly zero yet delivering the state-of-the-art (SOTA) performance. Under the hood, MAE-DET maximizes the differential entropy of detection backbones, leading to a better feature extractor for object detection under the same computational budgets. After merely one GPU day of fully automatic design, MAE-DET innovates SOTA detection backbones on multiple detection benchmark datasets with little human intervention. Comparing to ResNet-50 backbone, MAE-DET is $+2.0\%$ better in mAP when using the same amount of FLOPs/parameters, and is $1.54$ times faster on NVIDIA V100 at the same mAP. Code and pre-trained models are available at https://github.com/alibaba/lightweight-neuralarchitecture-search.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5065.5MAE-Det(MAE-Det-L+GFLV2)
Object DetectionCOCO minivalAP7552.2MAE-Det(MAE-Det-L+GFLV2)
Object DetectionCOCO minivalAPL61.1MAE-Det(MAE-Det-L+GFLV2)
Object DetectionCOCO minivalAPM51.9MAE-Det(MAE-Det-L+GFLV2)
Object DetectionCOCO minivalAPS30.3MAE-Det(MAE-Det-L+GFLV2)
Object DetectionCOCO minivalbox AP47.8MAE-Det(MAE-Det-L+GFLV2)
3DCOCO minivalAP5065.5MAE-Det(MAE-Det-L+GFLV2)
3DCOCO minivalAP7552.2MAE-Det(MAE-Det-L+GFLV2)
3DCOCO minivalAPL61.1MAE-Det(MAE-Det-L+GFLV2)
3DCOCO minivalAPM51.9MAE-Det(MAE-Det-L+GFLV2)
3DCOCO minivalAPS30.3MAE-Det(MAE-Det-L+GFLV2)
3DCOCO minivalbox AP47.8MAE-Det(MAE-Det-L+GFLV2)
2D ClassificationCOCO minivalAP5065.5MAE-Det(MAE-Det-L+GFLV2)
2D ClassificationCOCO minivalAP7552.2MAE-Det(MAE-Det-L+GFLV2)
2D ClassificationCOCO minivalAPL61.1MAE-Det(MAE-Det-L+GFLV2)
2D ClassificationCOCO minivalAPM51.9MAE-Det(MAE-Det-L+GFLV2)
2D ClassificationCOCO minivalAPS30.3MAE-Det(MAE-Det-L+GFLV2)
2D ClassificationCOCO minivalbox AP47.8MAE-Det(MAE-Det-L+GFLV2)
2D Object DetectionCOCO minivalAP5065.5MAE-Det(MAE-Det-L+GFLV2)
2D Object DetectionCOCO minivalAP7552.2MAE-Det(MAE-Det-L+GFLV2)
2D Object DetectionCOCO minivalAPL61.1MAE-Det(MAE-Det-L+GFLV2)
2D Object DetectionCOCO minivalAPM51.9MAE-Det(MAE-Det-L+GFLV2)
2D Object DetectionCOCO minivalAPS30.3MAE-Det(MAE-Det-L+GFLV2)
2D Object DetectionCOCO minivalbox AP47.8MAE-Det(MAE-Det-L+GFLV2)
16kCOCO minivalAP5065.5MAE-Det(MAE-Det-L+GFLV2)
16kCOCO minivalAP7552.2MAE-Det(MAE-Det-L+GFLV2)
16kCOCO minivalAPL61.1MAE-Det(MAE-Det-L+GFLV2)
16kCOCO minivalAPM51.9MAE-Det(MAE-Det-L+GFLV2)
16kCOCO minivalAPS30.3MAE-Det(MAE-Det-L+GFLV2)
16kCOCO minivalbox AP47.8MAE-Det(MAE-Det-L+GFLV2)

Related Papers

DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08