Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces. Specifically, we try to solve the common problem that anchor-based detectors deteriorate dramatically as the objects become smaller. We make contributions in the following three aspects: 1) proposing a scale-equitable face detection framework to handle different scales of faces well. We tile anchors on a wide range of layers to ensure that all scales of faces have enough features for detection. Besides, we design anchor scales based on the effective receptive field and a proposed equal proportion interval principle; 2) improving the recall rate of small faces by a scale compensation anchor matching strategy; 3) reducing the false positive rate of small faces via a max-out background label. As a consequence, our method achieves state-of-the-art detection performance on all the common face detection benchmarks, including the AFW, PASCAL face, FDDB and WIDER FACE datasets, and can run at 36 FPS on a Nvidia Titan X (Pascal) for VGA-resolution images.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | WIDER Face (Medium) | AP | 0.924 | S3FD(F+S+M) |
| Facial Recognition and Modelling | WIDER Face (Easy) | AP | 0.937 | S3FD(F+S+M) |
| Facial Recognition and Modelling | PASCAL Face | AP | 0.9849 | S3FD |
| Facial Recognition and Modelling | FDDB | AP | 0.983 | S3FD |
| Facial Recognition and Modelling | WIDER Face (Hard) | AP | 0.852 | S3FD(F+S+M) |
| Face Detection | WIDER Face (Medium) | AP | 0.924 | S3FD(F+S+M) |
| Face Detection | WIDER Face (Easy) | AP | 0.937 | S3FD(F+S+M) |
| Face Detection | PASCAL Face | AP | 0.9849 | S3FD |
| Face Detection | FDDB | AP | 0.983 | S3FD |
| Face Detection | WIDER Face (Hard) | AP | 0.852 | S3FD(F+S+M) |
| Face Reconstruction | WIDER Face (Medium) | AP | 0.924 | S3FD(F+S+M) |
| Face Reconstruction | WIDER Face (Easy) | AP | 0.937 | S3FD(F+S+M) |
| Face Reconstruction | PASCAL Face | AP | 0.9849 | S3FD |
| Face Reconstruction | FDDB | AP | 0.983 | S3FD |
| Face Reconstruction | WIDER Face (Hard) | AP | 0.852 | S3FD(F+S+M) |
| 3D | WIDER Face (Medium) | AP | 0.924 | S3FD(F+S+M) |
| 3D | WIDER Face (Easy) | AP | 0.937 | S3FD(F+S+M) |
| 3D | PASCAL Face | AP | 0.9849 | S3FD |
| 3D | FDDB | AP | 0.983 | S3FD |
| 3D | WIDER Face (Hard) | AP | 0.852 | S3FD(F+S+M) |
| 3D Face Modelling | WIDER Face (Medium) | AP | 0.924 | S3FD(F+S+M) |
| 3D Face Modelling | WIDER Face (Easy) | AP | 0.937 | S3FD(F+S+M) |
| 3D Face Modelling | PASCAL Face | AP | 0.9849 | S3FD |
| 3D Face Modelling | FDDB | AP | 0.983 | S3FD |
| 3D Face Modelling | WIDER Face (Hard) | AP | 0.852 | S3FD(F+S+M) |
| 3D Face Reconstruction | WIDER Face (Medium) | AP | 0.924 | S3FD(F+S+M) |
| 3D Face Reconstruction | WIDER Face (Easy) | AP | 0.937 | S3FD(F+S+M) |
| 3D Face Reconstruction | PASCAL Face | AP | 0.9849 | S3FD |
| 3D Face Reconstruction | FDDB | AP | 0.983 | S3FD |
| 3D Face Reconstruction | WIDER Face (Hard) | AP | 0.852 | S3FD(F+S+M) |