TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PEg TRAnsfer Workflow recognition challenge report: Does m...

PEg TRAnsfer Workflow recognition challenge report: Does multi-modal data improve recognition?

Arnaud Huaulmé, Kanako Harada, Quang-Minh Nguyen, Bogyu Park, Seungbum Hong, Min-Kook Choi, Michael Peven, Yunshuang Li, Yonghao Long, Qi Dou, Satyadwyoom Kumar, Seenivasan Lalithkumar, Ren Hongliang, Hiroki Matsuzaki, Yuto Ishikawa, Yuriko Harai, Satoshi Kondo, Mamoru Mitsuishi, Pierre Jannin

2022-02-11Video Based Workflow RecognitionSemantic SegmentationSegmentation Based Workflow RecognitionVideo & Kinematic Base Workflow RecognitionKinematic Based Workflow RecognitionVideo, Kinematic & Segmentation Base Workflow Recognition
PaperPDF

Abstract

This paper presents the design and results of the "PEg TRAnsfert Workflow recognition" (PETRAW) challenge whose objective was to develop surgical workflow recognition methods based on one or several modalities, among video, kinematic, and segmentation data, in order to study their added value. The PETRAW challenge provided a data set of 150 peg transfer sequences performed on a virtual simulator. This data set was composed of videos, kinematics, semantic segmentation, and workflow annotations which described the sequences at three different granularity levels: phase, step, and activity. Five tasks were proposed to the participants: three of them were related to the recognition of all granularities with one of the available modalities, while the others addressed the recognition with a combination of modalities. Average application-dependent balanced accuracy (AD-Accuracy) was used as evaluation metric to take unbalanced classes into account and because it is more clinically relevant than a frame-by-frame score. Seven teams participated in at least one task and four of them in all tasks. Best results are obtained with the use of the video and the kinematics data with an AD-Accuracy between 93% and 90% for the four teams who participated in all tasks. The improvement between video/kinematic-based methods and the uni-modality ones was significant for all of the teams. However, the difference in testing execution time between the video/kinematic-based and the kinematic-based methods has to be taken into consideration. Is it relevant to spend 20 to 200 times more computing time for less than 3% of improvement? The PETRAW data set is publicly available at www.synapse.org/PETRAW to encourage further research in surgical workflow recognition.

Results

TaskDatasetMetricValueModel
Video & Kinematic Base Workflow RecognitionPETRAWAverage AD-Accuracy93.09NCC Next
Video & Kinematic Base Workflow RecognitionPETRAWAverage AD-Accuracy91.61SK
Video & Kinematic Base Workflow RecognitionPETRAWAverage AD-Accuracy91.33Hutom
Video & Kinematic Base Workflow RecognitionPETRAWAverage AD-Accuracy90.18MediCIS
Video & Kinematic Base Workflow RecognitionPETRAWAverage AD-Accuracy86.98MedAIR
Video & Kinematic Base Workflow RecognitionPETRAWAverage AD-Accuracy84.8MMLAB
Semantic SegmentationPETRAWMean IoU (class)96.9NCC Next
Semantic SegmentationPETRAWMean IoU (class)96.4SK
Semantic SegmentationPETRAWMean IoU (class)94MediCIS
Semantic SegmentationPETRAWMean IoU (class)85Hutom
Video, Kinematic & Segmentation Base Workflow RecognitionPETRAWAverage AD-Accuracy93.09NCC Next
Video, Kinematic & Segmentation Base Workflow RecognitionPETRAWAverage AD-Accuracy91.37SK
Video, Kinematic & Segmentation Base Workflow RecognitionPETRAWAverage AD-Accuracy91.27Hutom
Video, Kinematic & Segmentation Base Workflow RecognitionPETRAWAverage AD-Accuracy89.81MediCIS Task 5
Video Based Workflow RecognitionPETRAWAverage AD-Accuracy90.77SK
Video Based Workflow RecognitionPETRAWAverage AD-Accuracy90.51Hutom
Video Based Workflow RecognitionPETRAWAverage AD-Accuracy89.15MediCIS
Video Based Workflow RecognitionPETRAWAverage AD-Accuracy87.77NCC Next
Video Based Workflow RecognitionPETRAWAverage AD-Accuracy84.31MedAIR
Kinematic Based Workflow RecognitionPETRAWAverage AD-Accuracy90.72MedAIR
Kinematic Based Workflow RecognitionPETRAWAverage AD-Accuracy90.32NCC Next
Kinematic Based Workflow RecognitionPETRAWAverage AD-Accuracy89.71MediCIS
Kinematic Based Workflow RecognitionPETRAWAverage AD-Accuracy89.66SK
Kinematic Based Workflow RecognitionPETRAWAverage AD-Accuracy86.45JHU-CIRL
Kinematic Based Workflow RecognitionPETRAWAverage AD-Accuracy84.31Hutom
Segmentation Based Workflow RecognitionPETRAWAverage AD-Accuracy88.51SK
Segmentation Based Workflow RecognitionPETRAWAverage AD-Accuracy87.71NCC Next
Segmentation Based Workflow RecognitionPETRAWAverage AD-Accuracy87.22MediCIS
Segmentation Based Workflow RecognitionPETRAWAverage AD-Accuracy60.28Hutom
10-shot image generationPETRAWMean IoU (class)96.9NCC Next
10-shot image generationPETRAWMean IoU (class)96.4SK
10-shot image generationPETRAWMean IoU (class)94MediCIS
10-shot image generationPETRAWMean IoU (class)85Hutom

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15