M. Saquib Sarfraz, Rainer Stiefelhagen
Cross modal face matching between the thermal and visible spectrum is a much desired capability for night-time surveillance and security applications. Due to a very large modality gap, thermal-to-visible face recognition is one of the most challenging face matching problem. In this paper, we present an approach to bridge this modality gap by a significant margin. Our approach captures the highly non-linear relationship between the two modalities by using a deep neural network. Our model attempts to learn a non-linear mapping from visible to thermal spectrum while preserving the identity information. We show substantive performance improvement on three difficult thermal-visible face datasets. The presented approach improves the state-of-the-art by more than 10\% on UND-X1 dataset and by more than 15-30\% on NVESD dataset in terms of Rank-1 identification. Our method bridges the drop in performance due to the modality gap by more than 40\%.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | UND-X1 | Rank-1 | 83.73 | DPM |
| Facial Recognition and Modelling | Carl | Rank-1 | 71 | DPM |
| Face Reconstruction | UND-X1 | Rank-1 | 83.73 | DPM |
| Face Reconstruction | Carl | Rank-1 | 71 | DPM |
| Face Recognition | UND-X1 | Rank-1 | 83.73 | DPM |
| Face Recognition | Carl | Rank-1 | 71 | DPM |
| 3D | UND-X1 | Rank-1 | 83.73 | DPM |
| 3D | Carl | Rank-1 | 71 | DPM |
| 3D Face Modelling | UND-X1 | Rank-1 | 83.73 | DPM |
| 3D Face Modelling | Carl | Rank-1 | 71 | DPM |
| 3D Face Reconstruction | UND-X1 | Rank-1 | 83.73 | DPM |
| 3D Face Reconstruction | Carl | Rank-1 | 71 | DPM |