TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Automatic Gesture Recognition in Robot-assisted Surgery wi...

Automatic Gesture Recognition in Robot-assisted Surgery with Reinforcement Learning and Tree Search

Xiaojie Gao, Yueming Jin, Qi Dou, Pheng-Ann Heng

2020-02-20Action SegmentationReinforcement LearningSurgical Gesture RecognitionGesture Recognitionreinforcement-learning
PaperPDF

Abstract

Automatic surgical gesture recognition is fundamental for improving intelligence in robot-assisted surgery, such as conducting complicated tasks of surgery surveillance and skill evaluation. However, current methods treat each frame individually and produce the outcomes without effective consideration on future information. In this paper, we propose a framework based on reinforcement learning and tree search for joint surgical gesture segmentation and classification. An agent is trained to segment and classify the surgical video in a human-like manner whose direct decisions are re-considered by tree search appropriately. Our proposed tree search algorithm unites the outputs from two designed neural networks, i.e., policy and value network. With the integration of complementary information from distinct models, our framework is able to achieve the better performance than baseline methods using either of the neural networks. For an overall evaluation, our developed approach consistently outperforms the existing methods on the suturing task of JIGSAWS dataset in terms of accuracy, edit score and F1 score. Our study highlights the utilization of tree search to refine actions in reinforcement learning framework for surgical robotic applications.

Results

TaskDatasetMetricValueModel
Action LocalizationJIGSAWSAccuracy81.67RL+Tree
Action LocalizationJIGSAWSEdit Distance88.53RL+Tree
Action LocalizationJIGSAWSF1@1092.68RL+Tree
Action LocalizationJIGSAWSF1@2590.99RL+Tree
Action LocalizationJIGSAWSF1@5083.15RL+Tree
Action SegmentationJIGSAWSAccuracy81.67RL+Tree
Action SegmentationJIGSAWSEdit Distance88.53RL+Tree
Action SegmentationJIGSAWSF1@1092.68RL+Tree
Action SegmentationJIGSAWSF1@2590.99RL+Tree
Action SegmentationJIGSAWSF1@5083.15RL+Tree

Related Papers

Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation2025-09-04CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17