TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/IndustReal: A Dataset for Procedure Step Recognition Handl...

IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting

Tim J. Schoonbeek, Tim Houben, Hans Onvlee, Peter H. N. de With, Fons van der Sommen

2023-10-26Procedure Step RecognitionAction RecognitionObject Detection
PaperPDFCode(official)

Abstract

Although action recognition for procedural tasks has received notable attention, it has a fundamental flaw in that no measure of success for actions is provided. This limits the applicability of such systems especially within the industrial domain, since the outcome of procedural actions is often significantly more important than the mere execution. To address this limitation, we define the novel task of procedure step recognition (PSR), focusing on recognizing the correct completion and order of procedural steps. Alongside the new task, we also present the multi-modal IndustReal dataset. Unlike currently available datasets, IndustReal contains procedural errors (such as omissions) as well as execution errors. A significant part of these errors are exclusively present in the validation and test sets, making IndustReal suitable to evaluate robustness of algorithms to new, unseen mistakes. Additionally, to encourage reproducibility and allow for scalable approaches trained on synthetic data, the 3D models of all parts are publicly available. Annotations and benchmark performance are provided for action recognition and assembly state detection, as well as the new PSR task. IndustReal, along with the code and model weights, is available at: https://github.com/TimSchoonbeek/IndustReal .

Results

TaskDatasetMetricValueModel
Activity RecognitionIndustRealTop-165.25MViT-V2
Activity RecognitionIndustRealTop-587.93MViT-V2
Object DetectionIndustRealmAP64.1YoloV8
Object DetectionIndustRealmAP57.5YoloV8 (synthetic data only)
3DIndustRealmAP64.1YoloV8
3DIndustRealmAP57.5YoloV8 (synthetic data only)
Action RecognitionIndustRealTop-165.25MViT-V2
Action RecognitionIndustRealTop-587.93MViT-V2
2D ClassificationIndustRealmAP64.1YoloV8
2D ClassificationIndustRealmAP57.5YoloV8 (synthetic data only)
2D Object DetectionIndustRealmAP64.1YoloV8
2D Object DetectionIndustRealmAP57.5YoloV8 (synthetic data only)
Procedure Step RecognitionIndustRealDelay (seconds)22.4B3
Procedure Step RecognitionIndustRealF10.883B3
Procedure Step RecognitionIndustRealPOS0.797B3
Procedure Step RecognitionIndustRealDelay (seconds)49.5B3 - Synthetic Only
Procedure Step RecognitionIndustRealF10.597B3 - Synthetic Only
Procedure Step RecognitionIndustRealPOS0.734B3 - Synthetic Only
16kIndustRealmAP64.1YoloV8
16kIndustRealmAP57.5YoloV8 (synthetic data only)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07