TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Joint Face Detection and Alignment using Multi-task Cascad...

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, Yu Qiao

2016-04-11Face AlignmentCode GenerationFace Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Face detection and alignment in unconstrained environment are challenging due to various poses, illuminations and occlusions. Recent studies show that deep learning approaches can achieve impressive performance on these two tasks. In this paper, we propose a deep cascaded multi-task framework which exploits the inherent correlation between them to boost up their performance. In particular, our framework adopts a cascaded structure with three stages of carefully designed deep convolutional networks that predict face and landmark location in a coarse-to-fine manner. In addition, in the learning process, we propose a new online hard sample mining strategy that can improve the performance automatically without manual sample selection. Our method achieves superior accuracy over the state-of-the-art techniques on the challenging FDDB and WIDER FACE benchmark for face detection, and AFLW benchmark for face alignment, while keeps real time performance.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingWIDER Face (Medium)AP0.82Multitask Cascade CNN
Facial Recognition and ModellingWIDER Face (Easy)AP0.851Multitask Cascade CNN
Facial Recognition and ModellingWIDER Face (Hard)AP0.607Multitask Cascade CNN
Face DetectionWIDER Face (Medium)AP0.82Multitask Cascade CNN
Face DetectionWIDER Face (Easy)AP0.851Multitask Cascade CNN
Face DetectionWIDER Face (Hard)AP0.607Multitask Cascade CNN
Face ReconstructionWIDER Face (Medium)AP0.82Multitask Cascade CNN
Face ReconstructionWIDER Face (Easy)AP0.851Multitask Cascade CNN
Face ReconstructionWIDER Face (Hard)AP0.607Multitask Cascade CNN
3DWIDER Face (Medium)AP0.82Multitask Cascade CNN
3DWIDER Face (Easy)AP0.851Multitask Cascade CNN
3DWIDER Face (Hard)AP0.607Multitask Cascade CNN
3D Face ModellingWIDER Face (Medium)AP0.82Multitask Cascade CNN
3D Face ModellingWIDER Face (Easy)AP0.851Multitask Cascade CNN
3D Face ModellingWIDER Face (Hard)AP0.607Multitask Cascade CNN
3D Face ReconstructionWIDER Face (Medium)AP0.82Multitask Cascade CNN
3D Face ReconstructionWIDER Face (Easy)AP0.851Multitask Cascade CNN
3D Face ReconstructionWIDER Face (Hard)AP0.607Multitask Cascade CNN

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18Towards Formal Verification of LLM-Generated Code from Natural Language Prompts2025-07-17MERA Code: A Unified Framework for Evaluating Code Generation Across Tasks2025-07-16Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training2025-07-16The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15Kodezi Chronos: A Debugging-First Language Model for Repository-Scale, Memory-Driven Code Understanding2025-07-14CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks2025-07-14CodeAssistBench (CAB): Dataset & Benchmarking for Multi-turn Chat-Based Code Assistance2025-07-14