TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Reinforcement Learning for Surgical Gesture Segmentat...

Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification

Daochang Liu, Tingting Jiang

2018-06-21Action SegmentationSequential Decision MakingReinforcement LearningSegmentationDecision MakingSurgical Gesture RecognitionGeneral ClassificationClassificationreinforcement-learning
PaperPDFCode

Abstract

Recognition of surgical gesture is crucial for surgical skill assessment and efficient surgery training. Prior works on this task are based on either variant graphical models such as HMMs and CRFs, or deep learning models such as Recurrent Neural Networks and Temporal Convolutional Networks. Most of the current approaches usually suffer from over-segmentation and therefore low segment-level edit scores. In contrast, we present an essentially different methodology by modeling the task as a sequential decision-making process. An intelligent agent is trained using reinforcement learning with hierarchical features from a deep model. Temporal consistency is integrated into our action design and reward mechanism to reduce over-segmentation errors. Experiments on JIGSAWS dataset demonstrate that the proposed method performs better than state-of-the-art methods in terms of the edit score and on par in frame-wise accuracy. Our code will be released later.

Results

TaskDatasetMetricValueModel
Action LocalizationJIGSAWSAccuracy81.43RL (full)
Action LocalizationJIGSAWSEdit Distance87.96RL (full)
Action LocalizationJIGSAWSF1@1092RL (full)
Action LocalizationJIGSAWSF1@2590.5RL (full)
Action LocalizationJIGSAWSF1@5082.2RL (full)
Action SegmentationJIGSAWSAccuracy81.43RL (full)
Action SegmentationJIGSAWSEdit Distance87.96RL (full)
Action SegmentationJIGSAWSF1@1092RL (full)
Action SegmentationJIGSAWSF1@2590.5RL (full)
Action SegmentationJIGSAWSF1@5082.2RL (full)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18Graph-Structured Data Analysis of Component Failure in Autonomous Cargo Ships Based on Feature Fusion2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17