Yifan Sun, Changmao Cheng, Yuhan Zhang, Chi Zhang, Liang Zheng, Zhongdao Wang, Yichen Wei
This paper provides a pair similarity optimization viewpoint on deep feature learning, aiming to maximize the within-class similarity $s_p$ and minimize the between-class similarity $s_n$. We find a majority of loss functions, including the triplet loss and the softmax plus cross-entropy loss, embed $s_n$ and $s_p$ into similarity pairs and seek to reduce $(s_n-s_p)$. Such an optimization manner is inflexible, because the penalty strength on every single similarity score is restricted to be equal. Our intuition is that if a similarity score deviates far from the optimum, it should be emphasized. To this end, we simply re-weight each similarity to highlight the less-optimized similarity scores. It results in a Circle loss, which is named due to its circular decision boundary. The Circle loss has a unified formula for two elemental deep feature learning approaches, i.e. learning with class-level labels and pair-wise labels. Analytically, we show that the Circle loss offers a more flexible optimization approach towards a more definite convergence target, compared with the loss functions optimizing $(s_n-s_p)$. Experimentally, we demonstrate the superiority of the Circle loss on a variety of deep feature learning tasks. On face recognition, person re-identification, as well as several fine-grained image retrieval datasets, the achieved performance is on par with the state of the art.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | LFW | Accuracy | 0.9973 | CircleLoss |
| Facial Recognition and Modelling | CFP-FP | Accuracy | 0.9602 | CircleLoss(ours) |
| Person Re-Identification | MSMT17 | Rank-1 | 76.9 | MGN + CircleLoss(ours) |
| Person Re-Identification | MSMT17 | mAP | 52.1 | MGN + CircleLoss(ours) |
| Person Re-Identification | MSMT17 | Rank-1 | 76.3 | ResNet50 + CircleLoss(ours) |
| Person Re-Identification | MSMT17 | mAP | 50.2 | ResNet50 + CircleLoss(ours) |
| Person Re-Identification | Market-1501 | Rank-1 | 96.1 | MGN + CircleLoss(ours) |
| Person Re-Identification | Market-1501 | mAP | 87.4 | MGN + CircleLoss(ours) |
| Person Re-Identification | Market-1501 | Rank-1 | 94.2 | ResNet50 + CircleLoss(ours) |
| Person Re-Identification | Market-1501 | mAP | 84.9 | ResNet50 + CircleLoss(ours) |
| Face Reconstruction | LFW | Accuracy | 0.9973 | CircleLoss |
| Face Reconstruction | CFP-FP | Accuracy | 0.9602 | CircleLoss(ours) |
| Face Recognition | LFW | Accuracy | 0.9973 | CircleLoss |
| Face Recognition | CFP-FP | Accuracy | 0.9602 | CircleLoss(ours) |
| 3D | LFW | Accuracy | 0.9973 | CircleLoss |
| 3D | CFP-FP | Accuracy | 0.9602 | CircleLoss(ours) |
| Metric Learning | CARS196 | R@1 | 83.4 | CircleLoss |
| Metric Learning | Stanford Online Products | R@1 | 78.3 | Circle Loss |
| 3D Face Modelling | LFW | Accuracy | 0.9973 | CircleLoss |
| 3D Face Modelling | CFP-FP | Accuracy | 0.9602 | CircleLoss(ours) |
| 3D Face Reconstruction | LFW | Accuracy | 0.9973 | CircleLoss |
| 3D Face Reconstruction | CFP-FP | Accuracy | 0.9602 | CircleLoss(ours) |