TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Face Detection with End-to-End Integration of a ConvNet an...

Face Detection with End-to-End Integration of a ConvNet and a 3D Model

Yunzhu Li, Benyuan Sun, Tianfu Wu, Yizhou Wang

2016-06-02Face VerificationFace Modelobject-detectionObject DetectionFace Detection
PaperPDFCodeCodeCodeCodeCode(official)

Abstract

This paper presents a method for face detection in the wild, which integrates a ConvNet and a 3D mean face model in an end-to-end multi-task discriminative learning framework. The 3D mean face model is predefined and fixed (e.g., we used the one provided in the AFLW dataset). The ConvNet consists of two components: (i) The face pro- posal component computes face bounding box proposals via estimating facial key-points and the 3D transformation (rotation and translation) parameters for each predicted key-point w.r.t. the 3D mean face model. (ii) The face verification component computes detection results by prun- ing and refining proposals based on facial key-points based configuration pooling. The proposed method addresses two issues in adapting state- of-the-art generic object detection ConvNets (e.g., faster R-CNN) for face detection: (i) One is to eliminate the heuristic design of prede- fined anchor boxes in the region proposals network (RPN) by exploit- ing a 3D mean face model. (ii) The other is to replace the generic RoI (Region-of-Interest) pooling layer with a configuration pooling layer to respect underlying object structures. The multi-task loss consists of three terms: the classification Softmax loss and the location smooth l1 -losses [14] of both the facial key-points and the face bounding boxes. In ex- periments, our ConvNet is trained on the AFLW dataset only and tested on the FDDB benchmark with fine-tuning and on the AFW benchmark without fine-tuning. The proposed method obtains very competitive state-of-the-art performance in the two benchmarks.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingAnnotated Faces in the WildAP0.9597Conv3D
Face DetectionAnnotated Faces in the WildAP0.9597Conv3D
Face ReconstructionAnnotated Faces in the WildAP0.9597Conv3D
3DAnnotated Faces in the WildAP0.9597Conv3D
3D Face ModellingAnnotated Faces in the WildAP0.9597Conv3D
3D Face ReconstructionAnnotated Faces in the WildAP0.9597Conv3D

Related Papers

ProxyFusion: Face Feature Aggregation Through Sparse Experts2025-09-24DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15