Dhruv Makwana, Subhrajit Nag, Onkar Susladkar, Gayatri Deshmukh, Sai Chandra Teja R, Sparsh Mittal, C Krishna Mohan
We propose a novel deep learning model named ACLNet, for cloud segmentation from ground images. ACLNet uses both deep neural network and machine learning (ML) algorithm to extract complementary features. Specifically, it uses EfficientNet-B0 as the backbone, "`a trous spatial pyramid pooling" (ASPP) to learn at multiple receptive fields, and "global attention module" (GAM) to extract finegrained details from the image. ACLNet also uses k-means clustering to extract cloud boundaries more precisely. ACLNet is effective for both daytime and nighttime images. It provides lower error rate, higher recall and higher F1-score than state-of-art cloud segmentation models. The source-code of ACLNet is available here: https://github.com/ckmvigil/ACLNet.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | SWINySEG | Average Precision | 0.959 | ACLNet |
| Semantic Segmentation | SWINySEG | Average Recall | 0.979 | ACLNet |
| Semantic Segmentation | SWINySEG | F1-Score | 0.968 | ACLNet |
| Semantic Segmentation | SWINySEG | MCC | 0.96 | ACLNet |
| Semantic Segmentation | SWINySEG | Mean IoU | 0.993 | ACLNet |
| Semantic Segmentation | SWINSEG | Average Precision | 0.917 | ACLNet |
| Semantic Segmentation | SWINSEG | Average Recall | 0.982 | ACLNet |
| Semantic Segmentation | SWINSEG | F1-Score | 0.947 | ACLNet |
| Semantic Segmentation | SWINSEG | MCC | 0.93 | ACLNet |
| Semantic Segmentation | SWINSEG | Mean IoU | 0.985 | ACLNet |
| Semantic Segmentation | SWIMSEG | Average Precision | 0.964 | ACLNet |
| Semantic Segmentation | SWIMSEG | Average Recall | 0.979 | ACLNet |
| Semantic Segmentation | SWIMSEG | F1-Score | 0.971 | ACLNet |
| Semantic Segmentation | SWIMSEG | MCC | 0.956 | ACLNet |
| Semantic Segmentation | SWIMSEG | Mean IoU | 0.992 | ACLNet |
| 10-shot image generation | SWINySEG | Average Precision | 0.959 | ACLNet |
| 10-shot image generation | SWINySEG | Average Recall | 0.979 | ACLNet |
| 10-shot image generation | SWINySEG | F1-Score | 0.968 | ACLNet |
| 10-shot image generation | SWINySEG | MCC | 0.96 | ACLNet |
| 10-shot image generation | SWINySEG | Mean IoU | 0.993 | ACLNet |
| 10-shot image generation | SWINSEG | Average Precision | 0.917 | ACLNet |
| 10-shot image generation | SWINSEG | Average Recall | 0.982 | ACLNet |
| 10-shot image generation | SWINSEG | F1-Score | 0.947 | ACLNet |
| 10-shot image generation | SWINSEG | MCC | 0.93 | ACLNet |
| 10-shot image generation | SWINSEG | Mean IoU | 0.985 | ACLNet |
| 10-shot image generation | SWIMSEG | Average Precision | 0.964 | ACLNet |
| 10-shot image generation | SWIMSEG | Average Recall | 0.979 | ACLNet |
| 10-shot image generation | SWIMSEG | F1-Score | 0.971 | ACLNet |
| 10-shot image generation | SWIMSEG | MCC | 0.956 | ACLNet |
| 10-shot image generation | SWIMSEG | Mean IoU | 0.992 | ACLNet |