TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/EndoNet: A Deep Architecture for Recognition Tasks on Lapa...

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos

Andru P. Twinanda, Sherif Shehata, Didier Mutter, Jacques Marescaux, Michel de Mathelin, Nicolas Padoy

2016-02-09Online surgical phase recognitionSchedulingSurgical tool detectionOffline surgical phase recognition
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Surgical workflow recognition has numerous potential medical applications, such as the automatic indexing of surgical video databases and the optimization of real-time operating room scheduling, among others. As a result, phase recognition has been studied in the context of several kinds of surgeries, such as cataract, neurological, and laparoscopic surgeries. In the literature, two types of features are typically used to perform this task: visual features and tool usage signals. However, the visual features used are mostly handcrafted. Furthermore, the tool usage signals are usually collected via a manual annotation process or by using additional equipment. In this paper, we propose a novel method for phase recognition that uses a convolutional neural network (CNN) to automatically learn features from cholecystectomy videos and that relies uniquely on visual information. In previous studies, it has been shown that the tool signals can provide valuable information in performing the phase recognition task. Thus, we present a novel CNN architecture, called EndoNet, that is designed to carry out the phase recognition and tool presence detection tasks in a multi-task manner. To the best of our knowledge, this is the first work proposing to use a CNN for multiple recognition tasks on laparoscopic videos. Extensive experimental comparisons to other methods show that EndoNet yields state-of-the-art results for both tasks.

Results

TaskDatasetMetricValueModel
Object DetectionCholec80mAP81EndoNet
Object DetectionCholec80mAP80.9ToolNet
3DCholec80mAP81EndoNet
3DCholec80mAP80.9ToolNet
2D ClassificationCholec80mAP81EndoNet
2D ClassificationCholec80mAP80.9ToolNet
2D Object DetectionCholec80mAP81EndoNet
2D Object DetectionCholec80mAP80.9ToolNet
16kCholec80mAP81EndoNet
16kCholec80mAP80.9ToolNet

Related Papers

An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC2025-07-18Fremer: Lightweight and Effective Frequency Transformer for Workload Forecasting in Cloud Services2025-07-17Transient-Stability-Aware Frequency Provision in IBR-Rich Grids via Information Gap Decision Theory and Deep Learning2025-07-17Leveraging Asynchronous Cross-border Market Data for Improved Day-Ahead Electricity Price Forecasting in European Markets2025-07-17FormulaOne: Measuring the Depth of Algorithmic Reasoning Beyond Competitive Programming2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17SystolicAttention: Fusing FlashAttention within a Single Systolic Array2025-07-15Optimal Sensor Scheduling and Selection for Continuous-Discrete Kalman Filtering with Auxiliary Dynamics2025-07-15