TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Revisiting the Encoding of Satellite Image Time Series

Revisiting the Encoding of Satellite Image Time Series

Xin Cai, Yaxin Bi, Peter Nicholl, Roy Sterritt

2023-05-03Panoptic SegmentationRepresentation LearningSegmentationSemantic SegmentationTime Seriesobject-detectionObject DetectionImage Segmentation
PaperPDFCode(official)

Abstract

Satellite Image Time Series (SITS) representation learning is complex due to high spatiotemporal resolutions, irregular acquisition times, and intricate spatiotemporal interactions. These challenges result in specialized neural network architectures tailored for SITS analysis. The field has witnessed promising results achieved by pioneering researchers, but transferring the latest advances or established paradigms from Computer Vision (CV) to SITS is still highly challenging due to the existing suboptimal representation learning framework. In this paper, we develop a novel perspective of SITS processing as a direct set prediction problem, inspired by the recent trend in adopting query-based transformer decoders to streamline the object detection or image segmentation pipeline. We further propose to decompose the representation learning process of SITS into three explicit steps: collect-update-distribute, which is computationally efficient and suits for irregularly-sampled and asynchronous temporal satellite observations. Facilitated by the unique reformulation, our proposed temporal learning backbone of SITS, initially pre-trained on the resource efficient pixel-set format and then fine-tuned on the downstream dense prediction tasks, has attained new state-of-the-art (SOTA) results on the PASTIS benchmark dataset. Specifically, the clear separation between temporal and spatial components in the semantic/panoptic segmentation pipeline of SITS makes us leverage the latest advances in CV, such as the universal image segmentation architecture, resulting in a noticeable 2.5 points increase in mIoU and 8.8 points increase in PQ, respectively, compared to the best scores reported so far.

Results

TaskDatasetMetricValueModel
Semantic SegmentationPASTISMean IoU (test)67.9Exchanger+Mask2Former
Semantic SegmentationPASTISMean IoU (test)66.8Exchanger+Unet
Semantic SegmentationPASTISPQ52.6Exchanger+Mask2Former
Semantic SegmentationPASTISRQ61.6Exchanger+Mask2Former
Semantic SegmentationPASTISSQ84.6Exchanger+Mask2Former
Semantic SegmentationPASTISPQ47.8Exchanger+Unet+PaPs
Semantic SegmentationPASTISRQ58.9Exchanger+Unet+PaPs
Semantic SegmentationPASTISSQ80.3Exchanger+Unet+PaPs
10-shot image generationPASTISMean IoU (test)67.9Exchanger+Mask2Former
10-shot image generationPASTISMean IoU (test)66.8Exchanger+Unet
10-shot image generationPASTISPQ52.6Exchanger+Mask2Former
10-shot image generationPASTISRQ61.6Exchanger+Mask2Former
10-shot image generationPASTISSQ84.6Exchanger+Mask2Former
10-shot image generationPASTISPQ47.8Exchanger+Unet+PaPs
10-shot image generationPASTISRQ58.9Exchanger+Unet+PaPs
10-shot image generationPASTISSQ80.3Exchanger+Unet+PaPs
Panoptic SegmentationPASTISPQ52.6Exchanger+Mask2Former
Panoptic SegmentationPASTISRQ61.6Exchanger+Mask2Former
Panoptic SegmentationPASTISSQ84.6Exchanger+Mask2Former
Panoptic SegmentationPASTISPQ47.8Exchanger+Unet+PaPs
Panoptic SegmentationPASTISRQ58.9Exchanger+Unet+PaPs
Panoptic SegmentationPASTISSQ80.3Exchanger+Unet+PaPs

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17