TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dissecting Self-Supervised Learning Methods for Surgical C...

Dissecting Self-Supervised Learning Methods for Surgical Computer Vision

Sanat Ramesh, Vinkle Srivastav, Deepak Alapatt, Tong Yu, Aditya Murali, Luca Sestini, Chinedu Innocent Nwoye, Idris Hamoud, Saurav Sharma, Antoine Fleurentin, Georgios Exarchakis, Alexandros Karargyris, Nicolas Padoy

2022-07-01Action Triplet RecognitionSelf-Supervised LearningSurgical phase recognitionSemantic SegmentationSurgical tool detection
PaperPDFCode(official)

Abstract

The field of surgical computer vision has undergone considerable breakthroughs in recent years with the rising popularity of deep neural network-based methods. However, standard fully-supervised approaches for training such models require vast amounts of annotated data, imposing a prohibitively high cost; especially in the clinical domain. Self-Supervised Learning (SSL) methods, which have begun to gain traction in the general computer vision community, represent a potential solution to these annotation costs, allowing to learn useful representations from only unlabeled data. Still, the effectiveness of SSL methods in more complex and impactful domains, such as medicine and surgery, remains limited and unexplored. In this work, we address this critical need by investigating four state-of-the-art SSL methods (MoCo v2, SimCLR, DINO, SwAV) in the context of surgical computer vision. We present an extensive analysis of the performance of these methods on the Cholec80 dataset for two fundamental and popular tasks in surgical context understanding, phase recognition and tool presence detection. We examine their parameterization, then their behavior with respect to training data quantities in semi-supervised settings. Correct transfer of these methods to surgery, as described and conducted in this work, leads to substantial performance gains over generic uses of SSL - up to 7.4% on phase recognition and 20% on tool presence detection - as well as state-of-the-art semi-supervised phase recognition approaches by up to 14%. Further results obtained on a highly diverse selection of surgical datasets exhibit strong generalization properties. The code is available at https://github.com/CAMMA-public/SelfSupSurg.

Results

TaskDatasetMetricValueModel
Activity RecognitionCholecT50 (Challenge)mAP35.7MoCo V2 Surg SSL - Rendezvous head
Semantic SegmentationEndoscapesMean F173.2MoCo V2 Surg SSL - DeepLabv3+ head
Object DetectionHeiChole BenchmarkmAP66.9MoCo V2 Surg SSL - FCN head
Object DetectionCholec80mAP93.5MoCo V2 Surg SSL - FCN head
3DHeiChole BenchmarkmAP66.9MoCo V2 Surg SSL - FCN head
3DCholec80mAP93.5MoCo V2 Surg SSL - FCN head
Action RecognitionCholecT50 (Challenge)mAP35.7MoCo V2 Surg SSL - Rendezvous head
2D ClassificationHeiChole BenchmarkmAP66.9MoCo V2 Surg SSL - FCN head
2D ClassificationCholec80mAP93.5MoCo V2 Surg SSL - FCN head
2D Object DetectionHeiChole BenchmarkmAP66.9MoCo V2 Surg SSL - FCN head
2D Object DetectionCholec80mAP93.5MoCo V2 Surg SSL - FCN head
Surgical phase recognitionHeiChole BenchmarkF164.7MoCo V2 Surg SSL - TCN head
Surgical phase recognitionCholec80F181.6MoCo V2 Surg SSL - TCN head
10-shot image generationEndoscapesMean F173.2MoCo V2 Surg SSL - DeepLabv3+ head
16kHeiChole BenchmarkmAP66.9MoCo V2 Surg SSL - FCN head
16kCholec80mAP93.5MoCo V2 Surg SSL - FCN head

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15