Fengfu Li, Hong Qiao, Bo Zhang, Xuanyang Xi
Traditional image clustering methods take a two-step approach, feature learning and clustering, sequentially. However, recent research results demonstrated that combining the separated phases in a unified framework and training them jointly can achieve a better performance. In this paper, we first introduce fully convolutional auto-encoders for image feature learning and then propose a unified clustering framework to learn image representations and cluster centers jointly based on a fully convolutional auto-encoder and soft $k$-means scores. At initial stages of the learning procedure, the representations extracted from the auto-encoder may not be very discriminative for latter clustering. We address this issue by adopting a boosted discriminative distribution, where high score assignments are highlighted and low score ones are de-emphasized. With the gradually boosted discrimination, clustering assignment scores are discriminated and cluster purities are enlarged. Experiments on several vision benchmark datasets show that our methods can achieve a state-of-the-art performance.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Clustering | MNIST-full | Accuracy | 0.976 | DBC |
| Image Clustering | MNIST-full | NMI | 0.937 | DBC |
| Image Clustering | USPS | Accuracy | 0.743 | DBC |
| Image Clustering | USPS | NMI | 0.724 | DBC |
| Image Clustering | Coil-20 | Accuracy | 0.793 | DBC |
| Image Clustering | Coil-20 | NMI | 0.895 | DBC |
| Image Clustering | coil-100 | Accuracy | 0.775 | DBC |
| Image Clustering | coil-100 | NMI | 0.905 | DBC |