GeoJEPAD

GeoJEPA Dataset

GraphsImagesTextsApache 2.0Introduced 2025-02-25

GeoJEPAD is a multimodal dataset combining OpenStreetMap (OSM) data (attributes and geometries) with high-resolution aerial imagery from diverse urban areas.

 Sourced from NAIP and OSM and then processed, tiled, and cropped. Geometries and relations represented as graphs with optional visibility edges.

Motivation:

Created to reduce biases introduced by traditional augmentation and sampling techniques, the dataset supports unbiased self-supervised multimodal geospatial learning.

Potential Use Cases:

  • Self-supervised multimodal representation learning.
  • Semantic segmentation of aerial imagery.
  • Geospatial retrieval and urban analytics tasks.
  • Benchmarking multimodal fusion models like JEPA.