TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera...

SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments

Niklas Gard, Anna Hilsmann, Peter Eisert

2024-04-16Visual LocalizationIndoor Localization (6-DoF Pose)Indoor LocalizationIndoor Localization (3-DoF Pose: X, Y, Yaw)Camera Localization6D Pose Estimation
PaperPDFCode(official)

Abstract

In this paper, we present SPVLoc, a global indoor localization method that accurately determines the six-dimensional (6D) camera pose of a query image and requires minimal scene-specific prior knowledge and no scene-specific training. Our approach employs a novel matching procedure to localize the perspective camera's viewport, given as an RGB image, within a set of panoramic semantic layout representations of the indoor environment. The panoramas are rendered from an untextured 3D reference model, which only comprises approximate structural information about room shapes, along with door and window annotations. We demonstrate that a straightforward convolutional network structure can successfully achieve image-to-panorama and ultimately image-to-model matching. Through a viewport classification score, we rank reference panoramas and select the best match for the query image. Then, a 6D relative pose is estimated between the chosen panorama and query image. Our experiments demonstrate that this approach not only efficiently bridges the domain gap but also generalizes well to previously unseen scenes that are not part of the training data. Moreover, it achieves superior localization accuracy compared to the state of the art methods and also estimates more degrees of freedom of the camera pose. Our source code is publicly available at https://fraunhoferhhi.github.io/spvloc .

Results

TaskDatasetMetricValueModel
Visual Place RecognitionStructured3D (perspective, furnished)<1m median rotation error (deg)1.26SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)<1m median translation error (cm)12.86SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 10cm (%)30.55SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 1m (%)84.5SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 1m 30 deg (%)84.2SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 50cm (%)81.58SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)<1m median rotation error (deg)1.14SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)<1m median translation error (cm)11.7SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 10cm (%)35.99SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 1m (%)86.73SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 1m 30 deg (%)86.25SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 50cm (%)84.06SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)<1m median rotation error (deg)1.62SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)<1m median translation error (cm)14.3SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 10cm (%)23.54SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 1m (%)84.5SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 1m 30 deg (%)84.18SPVLoc
Visual Place RecognitionStructured3D (perspective, furnished)Recall 50cm (%)81.48SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)<1m median rotation error (deg)1.5SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)<1m median translation error (cm)13.09SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 10cm (%)28.77SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 1m (%)86.73SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 1m 30 deg (%)86.21SPVLoc
Visual Place RecognitionStructured3D (perspective, emtpy)Recall 50cm (%)83.97SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)<1m median rotation error (deg)1.26SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)<1m median translation error (cm)12.86SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 10cm (%)30.55SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 1m (%)84.5SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 1m 30 deg (%)84.2SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 50cm (%)81.58SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)<1m median rotation error (deg)1.14SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)<1m median translation error (cm)11.7SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 10cm (%)35.99SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 1m (%)86.73SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 1m 30 deg (%)86.25SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 50cm (%)84.06SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)<1m median rotation error (deg)1.62SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)<1m median translation error (cm)14.3SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 10cm (%)23.54SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 1m (%)84.5SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 1m 30 deg (%)84.18SPVLoc
Indoor LocalizationStructured3D (perspective, furnished)Recall 50cm (%)81.48SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)<1m median rotation error (deg)1.5SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)<1m median translation error (cm)13.09SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 10cm (%)28.77SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 1m (%)86.73SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 1m 30 deg (%)86.21SPVLoc
Indoor LocalizationStructured3D (perspective, emtpy)Recall 50cm (%)83.97SPVLoc

Related Papers

Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures2025-07-14Privacy-Preserving Multi-Stage Fall Detection Framework with Semi-supervised Federated Learning and Robotic Vision Confirmation2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor Variations2025-07-08A Privacy-Preserving Indoor Localization System based on Hierarchical Federated Learning2025-07-02MatChA: Cross-Algorithm Matching with Feature Augmentation2025-06-27OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography2025-06-26RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base2025-06-23