TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Tangent Images for Mitigating Spherical Distortion

Tangent Images for Mitigating Spherical Distortion

Marc Eder, Mykhailo Shvets, John Lim, Jan-Michael Frahm

2019-12-19CVPR 2020 6Semantic Segmentation
PaperPDFCode(official)

Abstract

In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision. Inspired by techniques in cartography and computer graphics, we render a spherical image to a set of distortion-mitigated, locally-planar image grids tangent to a subdivided icosahedron. By varying the resolution of these grids independently of the subdivision level, we can effectively represent high resolution spherical images while still benefiting from the low-distortion icosahedral spherical approximation. We show that training standard convolutional neural networks on tangent images compares favorably to the many specialized spherical convolutional kernels that have been developed, while also scaling efficiently to handle significantly higher spherical resolutions. Furthermore, because our approach does not require specialized kernels, we show that we can transfer networks trained on perspective images to spherical data without fine-tuning and with limited performance drop-off. Finally, we demonstrate that tangent images can be used to improve the quality of sparse feature detection on spherical images, illustrating its usefulness for traditional computer vision tasks like structure-from-motion and SLAM.

Results

TaskDatasetMetricValueModel
Semantic SegmentationStanford2D3D Panoramic - RGBDmAcc69.1Tangent (ResNet-101)
Semantic SegmentationStanford2D3D Panoramic - RGBDmIoU51.9Tangent (ResNet-101)
Semantic SegmentationStanford2D3D PanoramicmAcc65.2Tangent (ResNet-101)
10-shot image generationStanford2D3D Panoramic - RGBDmAcc69.1Tangent (ResNet-101)
10-shot image generationStanford2D3D Panoramic - RGBDmIoU51.9Tangent (ResNet-101)
10-shot image generationStanford2D3D PanoramicmAcc65.2Tangent (ResNet-101)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15