GeoJEPAD
GeoJEPA Dataset
GraphsImagesTextsApache 2.0Introduced 2025-02-25
GeoJEPAD is a multimodal dataset combining OpenStreetMap (OSM) data (attributes and geometries) with high-resolution aerial imagery from diverse urban areas. Sourced from NAIP and OSM and then processed, tiled, and cropped. Geometries and relations represented as graphs with optional visibility edges.
Motivation:
Created to reduce biases introduced by traditional augmentation and sampling techniques, the dataset supports unbiased self-supervised multimodal geospatial learning.
Potential Use Cases:
- Self-supervised multimodal representation learning.
- Semantic segmentation of aerial imagery.
- Geospatial retrieval and urban analytics tasks.
- Benchmarking multimodal fusion models like JEPA.