TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/EFM3D: A Benchmark for Measuring Progress Towards 3D Egoce...

EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models

Julian Straub, Daniel DeTone, Tianwei Shen, Nan Yang, Chris Sweeney, Richard Newcombe

2024-06-14Multi-View 3D Reconstruction3D Reconstructionobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

The advent of wearable computers enables a new source of context for AI that is embedded in egocentric sensor data. This new egocentric data comes equipped with fine-grained 3D location information and thus presents the opportunity for a novel class of spatial foundation models that are rooted in 3D space. To measure progress on what we term Egocentric Foundation Models (EFMs) we establish EFM3D, a benchmark with two core 3D egocentric perception tasks. EFM3D is the first benchmark for 3D object detection and surface regression on high quality annotated egocentric data of Project Aria. We propose Egocentric Voxel Lifting (EVL), a baseline for 3D EFMs. EVL leverages all available egocentric modalities and inherits foundational capabilities from 2D foundation models. This model, trained on a large simulated dataset, outperforms existing methods on the EFM3D benchmark.

Results

TaskDatasetMetricValueModel
3D ReconstructionAria Synthetic EnvironmentsAccuracy5.7EVL
3D ReconstructionAria Synthetic EnvironmentsCompleteness87.7EVL
3D ReconstructionAria Synthetic EnvironmentsPrecision82.2EVL
3D ReconstructionAria Synthetic EnvironmentsRecall10.6EVL
3D ReconstructionAria Digital Twin DatasetAccuracy18.2EVL
3D ReconstructionAria Digital Twin DatasetCompleteness3.105EVL
3D ReconstructionAria Digital Twin DatasetPrecision59.4EVL
Object DetectionAria Everyday ObjectsmAP22EVL
Object DetectionAria Everyday ObjectsmAP163DETR
Object DetectionAria Everyday ObjectsmAP15ImVoxelNet
Object DetectionAria Everyday ObjectsmAP8Cube R-CNN
Object DetectionAria Synthetic EnvironmentsMAP75EVL
Object DetectionAria Synthetic EnvironmentsMAP64ImVoxelNet
Object DetectionAria Synthetic EnvironmentsMAP36Cube R-CNN
Object DetectionAria Synthetic EnvironmentsMAP333DETR
3DAria Everyday ObjectsmAP22EVL
3DAria Everyday ObjectsmAP163DETR
3DAria Everyday ObjectsmAP15ImVoxelNet
3DAria Everyday ObjectsmAP8Cube R-CNN
3DAria Synthetic EnvironmentsMAP75EVL
3DAria Synthetic EnvironmentsMAP64ImVoxelNet
3DAria Synthetic EnvironmentsMAP36Cube R-CNN
3DAria Synthetic EnvironmentsMAP333DETR
3DAria Synthetic EnvironmentsAccuracy5.7EVL
3DAria Synthetic EnvironmentsCompleteness87.7EVL
3DAria Synthetic EnvironmentsPrecision82.2EVL
3DAria Synthetic EnvironmentsRecall10.6EVL
3DAria Digital Twin DatasetAccuracy18.2EVL
3DAria Digital Twin DatasetCompleteness3.105EVL
3DAria Digital Twin DatasetPrecision59.4EVL
3D Object DetectionAria Everyday ObjectsmAP22EVL
3D Object DetectionAria Everyday ObjectsmAP163DETR
3D Object DetectionAria Everyday ObjectsmAP15ImVoxelNet
3D Object DetectionAria Everyday ObjectsmAP8Cube R-CNN
3D Object DetectionAria Synthetic EnvironmentsMAP75EVL
3D Object DetectionAria Synthetic EnvironmentsMAP64ImVoxelNet
3D Object DetectionAria Synthetic EnvironmentsMAP36Cube R-CNN
3D Object DetectionAria Synthetic EnvironmentsMAP333DETR
2D ClassificationAria Everyday ObjectsmAP22EVL
2D ClassificationAria Everyday ObjectsmAP163DETR
2D ClassificationAria Everyday ObjectsmAP15ImVoxelNet
2D ClassificationAria Everyday ObjectsmAP8Cube R-CNN
2D ClassificationAria Synthetic EnvironmentsMAP75EVL
2D ClassificationAria Synthetic EnvironmentsMAP64ImVoxelNet
2D ClassificationAria Synthetic EnvironmentsMAP36Cube R-CNN
2D ClassificationAria Synthetic EnvironmentsMAP333DETR
2D Object DetectionAria Everyday ObjectsmAP22EVL
2D Object DetectionAria Everyday ObjectsmAP163DETR
2D Object DetectionAria Everyday ObjectsmAP15ImVoxelNet
2D Object DetectionAria Everyday ObjectsmAP8Cube R-CNN
2D Object DetectionAria Synthetic EnvironmentsMAP75EVL
2D Object DetectionAria Synthetic EnvironmentsMAP64ImVoxelNet
2D Object DetectionAria Synthetic EnvironmentsMAP36Cube R-CNN
2D Object DetectionAria Synthetic EnvironmentsMAP333DETR
16kAria Everyday ObjectsmAP22EVL
16kAria Everyday ObjectsmAP163DETR
16kAria Everyday ObjectsmAP15ImVoxelNet
16kAria Everyday ObjectsmAP8Cube R-CNN
16kAria Synthetic EnvironmentsMAP75EVL
16kAria Synthetic EnvironmentsMAP64ImVoxelNet
16kAria Synthetic EnvironmentsMAP36Cube R-CNN
16kAria Synthetic EnvironmentsMAP333DETR

Related Papers

AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16