TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Supervision-by-Registration: An Unsupervised Approach to I...

Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors

Xuanyi Dong, Shoou-I Yu, Xinshuo Weng, Shih-En Wei, Yi Yang, Yaser Sheikh

2018-07-03CVPR 2018 6Optical Flow EstimationFacial Landmark Detection
PaperPDFCode

Abstract

In this paper, we present supervision-by-registration, an unsupervised approach to improve the precision of facial landmark detectors on both images and video. Our key observation is that the detections of the same landmark in adjacent frames should be coherent with registration, i.e., optical flow. Interestingly, the coherency of optical flow is a source of supervision that does not require manual labeling, and can be leveraged during detector training. For example, we can enforce in the training loss function that a detected landmark at frame$_{t-1}$ followed by optical flow tracking from frame$_{t-1}$ to frame$_t$ should coincide with the location of the detection at frame$_t$. Essentially, supervision-by-registration augments the training loss function with a registration loss, thus training the detector to have output that is not only close to the annotations in labeled images, but also consistent with registration on large amounts of unlabeled videos. End-to-end training with the registration loss is made possible by a differentiable Lucas-Kanade operation, which computes optical flow registration in the forward pass, and back-propagates gradients that encourage temporal coherency in the detector. The output of our method is a more precise image-based facial landmark detector, which can be applied to single images or video. With supervision-by-registration, we demonstrate (1) improvements in facial landmark detection on both images (300W, ALFW) and video (300VW, Youtube-Celebrities), and (2) significant reduction of jittering in video detections.

Results

TaskDatasetMetricValueModel
Facial Recognition and Modelling300-VW (C)AUC0.08 private59.39CPM+SBR+PAM
Facial Recognition and Modelling300-VW (C)AUC0.08 private58.22CPM+SBR
Facial Landmark Detection300-VW (C)AUC0.08 private59.39CPM+SBR+PAM
Facial Landmark Detection300-VW (C)AUC0.08 private58.22CPM+SBR
Face Reconstruction300-VW (C)AUC0.08 private59.39CPM+SBR+PAM
Face Reconstruction300-VW (C)AUC0.08 private58.22CPM+SBR
3D300-VW (C)AUC0.08 private59.39CPM+SBR+PAM
3D300-VW (C)AUC0.08 private58.22CPM+SBR
3D Face Modelling300-VW (C)AUC0.08 private59.39CPM+SBR+PAM
3D Face Modelling300-VW (C)AUC0.08 private58.22CPM+SBR
3D Face Reconstruction300-VW (C)AUC0.08 private59.39CPM+SBR+PAM
3D Face Reconstruction300-VW (C)AUC0.08 private58.22CPM+SBR

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Learning to Track Any Points from Human Motion2025-07-08TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation2025-06-29EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting2025-06-26WAFT: Warping-Alone Field Transforms for Optical Flow2025-06-26Feature Hallucination for Self-supervised Action Recognition2025-06-25