PEg TRAnsfer Workflow recognition challenge report: Does multi-modal data improve recognition?

Arnaud Huaulmé, Kanako Harada, Quang-Minh Nguyen, Bogyu Park, Seungbum Hong, Min-Kook Choi, Michael Peven, Yunshuang Li, Yonghao Long, Qi Dou, Satyadwyoom Kumar, Seenivasan Lalithkumar, Ren Hongliang, Hiroki Matsuzaki, Yuto Ishikawa, Yuriko Harai, Satoshi Kondo, Mamoru Mitsuishi, Pierre Jannin

2022-02-11Video Based Workflow Recognition Semantic Segmentation Segmentation Based Workflow Recognition Video & Kinematic Base Workflow Recognition Kinematic Based Workflow Recognition Video, Kinematic & Segmentation Base Workflow Recognition

Paper PDF

Abstract

This paper presents the design and results of the "PEg TRAnsfert Workflow recognition" (PETRAW) challenge whose objective was to develop surgical workflow recognition methods based on one or several modalities, among video, kinematic, and segmentation data, in order to study their added value. The PETRAW challenge provided a data set of 150 peg transfer sequences performed on a virtual simulator. This data set was composed of videos, kinematics, semantic segmentation, and workflow annotations which described the sequences at three different granularity levels: phase, step, and activity. Five tasks were proposed to the participants: three of them were related to the recognition of all granularities with one of the available modalities, while the others addressed the recognition with a combination of modalities. Average application-dependent balanced accuracy (AD-Accuracy) was used as evaluation metric to take unbalanced classes into account and because it is more clinically relevant than a frame-by-frame score. Seven teams participated in at least one task and four of them in all tasks. Best results are obtained with the use of the video and the kinematics data with an AD-Accuracy between 93% and 90% for the four teams who participated in all tasks. The improvement between video/kinematic-based methods and the uni-modality ones was significant for all of the teams. However, the difference in testing execution time between the video/kinematic-based and the kinematic-based methods has to be taken into consideration. Is it relevant to spend 20 to 200 times more computing time for less than 3% of improvement? The PETRAW data set is publicly available at www.synapse.org/PETRAW to encourage further research in surgical workflow recognition.

Results

Task	Dataset	Metric	Value	Model
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	93.09	NCC Next
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.61	SK
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.33	Hutom
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	90.18	MediCIS
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	86.98	MedAIR
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	84.8	MMLAB
Semantic Segmentation	PETRAW	Mean IoU (class)	96.9	NCC Next
Semantic Segmentation	PETRAW	Mean IoU (class)	96.4	SK
Semantic Segmentation	PETRAW	Mean IoU (class)	94	MediCIS
Semantic Segmentation	PETRAW	Mean IoU (class)	85	Hutom
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	93.09	NCC Next
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.37	SK
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.27	Hutom
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	89.81	MediCIS Task 5
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.77	SK
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.51	Hutom
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	89.15	MediCIS
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	87.77	NCC Next
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	84.31	MedAIR
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.72	MedAIR
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.32	NCC Next
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	89.71	MediCIS
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	89.66	SK
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	86.45	JHU-CIRL
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	84.31	Hutom
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	88.51	SK
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	87.71	NCC Next
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	87.22	MediCIS
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	60.28	Hutom
10-shot image generation	PETRAW	Mean IoU (class)	96.9	NCC Next
10-shot image generation	PETRAW	Mean IoU (class)	96.4	SK
10-shot image generation	PETRAW	Mean IoU (class)	94	MediCIS
10-shot image generation	PETRAW	Mean IoU (class)	85	Hutom

Abstract

Results

Task	Dataset	Metric	Value	Model
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	93.09	NCC Next
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.61	SK
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.33	Hutom
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	90.18	MediCIS
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	86.98	MedAIR
Video & Kinematic Base Workflow Recognition	PETRAW	Average AD-Accuracy	84.8	MMLAB
Semantic Segmentation	PETRAW	Mean IoU (class)	96.9	NCC Next
Semantic Segmentation	PETRAW	Mean IoU (class)	96.4	SK
Semantic Segmentation	PETRAW	Mean IoU (class)	94	MediCIS
Semantic Segmentation	PETRAW	Mean IoU (class)	85	Hutom
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	93.09	NCC Next
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.37	SK
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	91.27	Hutom
Video, Kinematic & Segmentation Base Workflow Recognition	PETRAW	Average AD-Accuracy	89.81	MediCIS Task 5
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.77	SK
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.51	Hutom
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	89.15	MediCIS
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	87.77	NCC Next
Video Based Workflow Recognition	PETRAW	Average AD-Accuracy	84.31	MedAIR
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.72	MedAIR
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	90.32	NCC Next
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	89.71	MediCIS
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	89.66	SK
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	86.45	JHU-CIRL
Kinematic Based Workflow Recognition	PETRAW	Average AD-Accuracy	84.31	Hutom
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	88.51	SK
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	87.71	NCC Next
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	87.22	MediCIS
Segmentation Based Workflow Recognition	PETRAW	Average AD-Accuracy	60.28	Hutom
10-shot image generation	PETRAW	Mean IoU (class)	96.9	NCC Next
10-shot image generation	PETRAW	Mean IoU (class)	96.4	SK
10-shot image generation	PETRAW	Mean IoU (class)	94	MediCIS
10-shot image generation	PETRAW	Mean IoU (class)	85	Hutom

PEg TRAnsfer Workflow recognition challenge report: Does multi-modal data improve recognition?

Abstract

Results

Related Papers

PEg TRAnsfer Workflow recognition challenge report: Does multi-modal data improve recognition?

Abstract

Results

Related Papers