TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms ...

YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection

Chun-Tse Chien, Rui-Yang Ju, Kuang-Yi Chou, Enkaer Xieerke, Jen-Shiun Chiang

2024-02-14Medical Object DetectionFracture detectionmedical image detectionOpen Vocabulary Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Wrist trauma and even fractures occur frequently in daily life, particularly among children who account for a significant proportion of fracture cases. Before performing surgery, surgeons often request patients to undergo X-ray imaging first and prepare for it based on the analysis of the radiologist. With the development of neural networks, You Only Look Once (YOLO) series models have been widely used in fracture detection as computer-assisted diagnosis (CAD). In 2023, Ultralytics presented the latest version of the YOLO models, which has been employed for detecting fractures across various parts of the body. Attention mechanism is one of the hottest methods to improve the model performance. This research work proposes YOLOv8-AM, which incorporates the attention mechanism into the original YOLOv8 architecture. Specifically, we respectively employ four attention modules, Convolutional Block Attention Module (CBAM), Global Attention Mechanism (GAM), Efficient Channel Attention (ECA), and Shuffle Attention (SA), to design the improved models and train them on GRAZPEDWRI-DX dataset. Experimental results demonstrate that the mean Average Precision at IoU 50 (mAP 50) of the YOLOv8-AM model based on ResBlock + CBAM (ResCBAM) increased from 63.6% to 65.8%, which achieves the state-of-the-art (SOTA) performance. Conversely, YOLOv8-AM model incorporating GAM obtains the mAP 50 value of 64.2%, which is not a satisfactory enhancement. Therefore, we combine ResBlock and GAM, introducing ResGAM to design another new YOLOv8-AM model, whose mAP 50 value is increased to 65.0%. The implementation code for this study is available on GitHub at https://github.com/RuiyangJu/Fracture_Detection_Improved_YOLOv8.

Results

TaskDatasetMetricValueModel
Object DetectionMSCOCOAP 0.50.5Yolov8
Object DetectionGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
Object DetectionGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
Object DetectionGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
Object DetectionGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
Object DetectionGRAZPEDWRI-DXAP5065YOLOv8+ResGAM
Object DetectionGRAZPEDWRI-DXF1-score0.64YOLOv8+ResGAM
Object DetectionGRAZPEDWRI-DXAP5064.3YOLOv8+SA
Object DetectionGRAZPEDWRI-DXF1-score0.63YOLOv8+SA
Object DetectionGRAZPEDWRI-DXAP5064.2YOLOv8+ECA
Object DetectionGRAZPEDWRI-DXF1-score0.65YOLOv8+ECA
Object DetectionGRAZPEDWRI-DXAP5064.2YOLOv8+GAM
Object DetectionGRAZPEDWRI-DXF1-score0.65YOLOv8+GAM
3DMSCOCOAP 0.50.5Yolov8
3DGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
3DGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
3DGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
3DGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
3DGRAZPEDWRI-DXAP5065YOLOv8+ResGAM
3DGRAZPEDWRI-DXF1-score0.64YOLOv8+ResGAM
3DGRAZPEDWRI-DXAP5064.3YOLOv8+SA
3DGRAZPEDWRI-DXF1-score0.63YOLOv8+SA
3DGRAZPEDWRI-DXAP5064.2YOLOv8+ECA
3DGRAZPEDWRI-DXF1-score0.65YOLOv8+ECA
3DGRAZPEDWRI-DXAP5064.2YOLOv8+GAM
3DGRAZPEDWRI-DXF1-score0.65YOLOv8+GAM
2D ClassificationMSCOCOAP 0.50.5Yolov8
2D ClassificationGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
2D ClassificationGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
2D ClassificationGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
2D ClassificationGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
2D ClassificationGRAZPEDWRI-DXAP5065YOLOv8+ResGAM
2D ClassificationGRAZPEDWRI-DXF1-score0.64YOLOv8+ResGAM
2D ClassificationGRAZPEDWRI-DXAP5064.3YOLOv8+SA
2D ClassificationGRAZPEDWRI-DXF1-score0.63YOLOv8+SA
2D ClassificationGRAZPEDWRI-DXAP5064.2YOLOv8+ECA
2D ClassificationGRAZPEDWRI-DXF1-score0.65YOLOv8+ECA
2D ClassificationGRAZPEDWRI-DXAP5064.2YOLOv8+GAM
2D ClassificationGRAZPEDWRI-DXF1-score0.65YOLOv8+GAM
2D Object DetectionMSCOCOAP 0.50.5Yolov8
2D Object DetectionGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
2D Object DetectionGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
2D Object DetectionGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
2D Object DetectionGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
2D Object DetectionGRAZPEDWRI-DXAP5065YOLOv8+ResGAM
2D Object DetectionGRAZPEDWRI-DXF1-score0.64YOLOv8+ResGAM
2D Object DetectionGRAZPEDWRI-DXAP5064.3YOLOv8+SA
2D Object DetectionGRAZPEDWRI-DXF1-score0.63YOLOv8+SA
2D Object DetectionGRAZPEDWRI-DXAP5064.2YOLOv8+ECA
2D Object DetectionGRAZPEDWRI-DXF1-score0.65YOLOv8+ECA
2D Object DetectionGRAZPEDWRI-DXAP5064.2YOLOv8+GAM
2D Object DetectionGRAZPEDWRI-DXF1-score0.65YOLOv8+GAM
Open Vocabulary Object DetectionMSCOCOAP 0.50.5Yolov8
16kMSCOCOAP 0.50.5Yolov8
16kGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
16kGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
16kGRAZPEDWRI-DXAP5065.8YOLOv8+ResCBAM
16kGRAZPEDWRI-DXF1-score0.64YOLOv8+ResCBAM
16kGRAZPEDWRI-DXAP5065YOLOv8+ResGAM
16kGRAZPEDWRI-DXF1-score0.64YOLOv8+ResGAM
16kGRAZPEDWRI-DXAP5064.3YOLOv8+SA
16kGRAZPEDWRI-DXF1-score0.63YOLOv8+SA
16kGRAZPEDWRI-DXAP5064.2YOLOv8+ECA
16kGRAZPEDWRI-DXF1-score0.65YOLOv8+ECA
16kGRAZPEDWRI-DXAP5064.2YOLOv8+GAM
16kGRAZPEDWRI-DXF1-score0.65YOLOv8+GAM

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07