Fangqin Zhou, Mert Kilickaya, Joaquin Vanschoren
Hyperspectral image classification is gaining popularity for high-precision vision tasks in remote sensing, thanks to their ability to capture visual information available in a wide continuum of spectra. Researchers have been working on automating Hyperspectral image classification, with recent efforts leveraging Vision-Transformers. However, most research models only spectra information and lacks attention to the locality (i.e., neighboring pixels), which may be not sufficiently discriminative, resulting in performance limitations. To address this, we present three contributions: i) We introduce the Hyperspectral Locality-aware Image TransformEr (HyLITE), a vision transformer that models both local and spectral information, ii) A novel regularization function that promotes the integration of local-to-global information, and iii) Our proposed approach outperforms competing baselines by a significant margin, achieving up to 10% gains in accuracy. The trained models and the code are available at HyLITE.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Hyperspectral | Pavia University | OA@15perclass | 91.28 | HyLITE |
| Hyperspectral | Houston | OA@15perclass | 88.49 | HyLITE |
| Hyperspectral | Indian Pines | OA@15perclass | 89.8 | HyLITE |
| Hyperspectral | Indian Pines | Overall Accuracy | 89.8 | HyLITE |
| Image Classification | Pavia University | OA@15perclass | 91.28 | HyLITE |
| Image Classification | Houston | OA@15perclass | 88.49 | HyLITE |
| Image Classification | Indian Pines | OA@15perclass | 89.8 | HyLITE |
| Image Classification | Indian Pines | Overall Accuracy | 89.8 | HyLITE |
| Hyperspectral Image Segmentation | Pavia University | OA@15perclass | 91.28 | HyLITE |
| Hyperspectral Image Segmentation | Houston | OA@15perclass | 88.49 | HyLITE |
| Hyperspectral Image Segmentation | Indian Pines | OA@15perclass | 89.8 | HyLITE |
| Hyperspectral Image Segmentation | Indian Pines | Overall Accuracy | 89.8 | HyLITE |