Xianghan Meng, Zhiyuan Huang, wei he, Xianbiao Qi, Rong Xiao, Chun-Guang Li
Subspace clustering is a classical unsupervised learning task, built on a basic assumption that high-dimensional data can be approximated by a union of subspaces (UoS). Nevertheless, the real-world data are often deviating from the UoS assumption. To address this challenge, state-of-the-art deep subspace clustering algorithms attempt to jointly learn UoS representations and self-expressive coefficients. However, the general framework of the existing algorithms suffers from a catastrophic feature collapse and lacks a theoretical guarantee to learn desired UoS representation. In this paper, we present a Principled fRamewOrk for Deep Subspace Clustering (PRO-DSC), which is designed to learn structured representations and self-expressive coefficients in a unified manner. Specifically, in PRO-DSC, we incorporate an effective regularization on the learned representations into the self-expressive model, prove that the regularized self-expressive model is able to prevent feature space collapse, and demonstrate that the learned optimal representations under certain condition lie on a union of orthogonal subspaces. Moreover, we provide a scalable and efficient approach to implement our PRO-DSC and conduct extensive experiments to verify our theoretical findings and demonstrate the superior performance of our proposed deep subspace clustering approach. The code is available at https://github.com/mengxianghan123/PRO-DSC.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Clustering | CIFAR-10 | Accuracy | 0.972 | PRO-DSC |
| Image Clustering | CIFAR-10 | NMI | 0.928 | PRO-DSC |
| Image Clustering | Tiny-ImageNet | Accuracy | 0.698 | PRO-DSC |
| Image Clustering | Tiny-ImageNet | NMI | 0.805 | PRO-DSC |
| Image Clustering | CIFAR-100 | Accuracy | 0.773 | PRO-DSC |
| Image Clustering | CIFAR-100 | NMI | 0.824 | PRO-DSC |
| Image Clustering | ImageNet | Accuracy | 65 | PRO-DSC |
| Image Clustering | ImageNet | NMI | 83.4 | PRO-DSC |
| Image Clustering | Imagenet-dog-15 | Accuracy | 0.84 | PRO-DSC |
| Image Clustering | Imagenet-dog-15 | NMI | 0.812 | PRO-DSC |
| Image Classification | CIFAR-20 | Accuracy | 71.6 | PRO-DSC |
| Image Classification | CIFAR-20 | NMI | 73.2 | PRO-DSC |