Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Coordinate attention

Coordinate attention

GeneralIntroduced 200031 papers

Description

Hou et al. proposed coordinate attention, a novel attention mechanism which embeds positional information into channel attention, so that the network can focus on large important regions at little computational cost.

The coordinate attention mechanism has two consecutive steps, coordinate information embedding and coordinate attention generation. First, two spatial extents of pooling kernels encode each channel horizontally and vertically. In the second step, a shared $1\times 1$ convolutional transformation function is applied to the concatenated outputs of the two pooling layers. Then coordinate attention splits the resulting tensor into two separate tensors to yield attention vectors with the same number of channels for horizontal and vertical coordinates of the input $X$ along. This can be written as \begin{align} z^h &= \text{GAP}^h(X) \end{align} \begin{align} z^w &= \text{GAP}^w(X) \end{align} \begin{align} f &= \delta(\text{BN}(\text{Conv}_1^{1\times 1}([z^h;z^w]))) \end{align} \begin{align} f^h, f^w &= \text{Split}(f) \end{align} \begin{align} s^h &= \sigma(\text{Conv}_h^{1\times 1}(f^h)) \end{align} \begin{align} s^w &= \sigma(\text{Conv}_w^{1\times 1}(f^w)) \end{align} \begin{align} Y &= X s^h s^w \end{align} where $\text{GAP}^h$ and $\text{GAP}^w$ denote pooling functions for vertical and horizontal coordinates, and $s^h \in \mathbb{R}^{C\times 1\times W}$ and $s^w \in \mathbb{R}^{C\times H\times 1}$ represent corresponding attention weights.

Using coordinate attention, the network can accurately obtain the position of a targeted object. This approach has a larger receptive field than BAM and CBAM. Like an SE block, it also models cross-channel relationships, effectively enhancing the expressive power of the learned features. Due to its lightweight design and flexibility, it can be easily used in classical building blocks of mobile networks.

Papers Using This Method

Design description of Wisdom Computing Persperctive2025-05-02 Enhancing Traffic Sign Recognition On The Performance Based On Yolov82025-04-02 LightEndoStereo: A Real-time Lightweight Stereo Matching Method for Endoscopy Images2025-03-02 A Physics-Inspired Deep Learning Framework with Polar Coordinate Attention for Ptychographic Imaging2024-11-25 Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models2024-10-29 Optimizing YOLO Architectures for Optimal Road Damage Detection and Classification: A Comparative Study from YOLOv7 to YOLOv102024-10-10 Improved Unet model for brain tumor image segmentation based on ASPP-coordinate attention mechanism2024-09-13 RICAU-Net: Residual-block Inspired Coordinate Attention U-Net for Segmentation of Small and Sparse Calcium Lesions in Cardiac CT2024-09-11 ALSS-YOLO: An Adaptive Lightweight Channel Split and Shuffling Network for TIR Wildlife Detection in UAV Imagery2024-09-10 CSANet: Channel Spatial Attention Network for Robust 3D Face Alignment and Reconstruction2024-05-30 ELA: Efficient Local Attention for Deep Convolutional Neural Networks2024-03-02 Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network2024-01-16 YOLO algorithm with hybrid attention feature pyramid network for solder joint defect detection2024-01-02 YOLOv5s-BC: An improved YOLOv5s-based method for real-time apple detection2023-11-10 Marine Debris Detection in Satellite Surveillance using Attention Mechanisms2023-07-09 Multi-cropping Contrastive Learning and Domain Consistency for Unsupervised Image-to-Image Translation2023-04-24 Two-stage MR Image Segmentation Method for Brain Tumors based on Attention Mechanism2023-04-17 Fast vehicle detection algorithm based on lightweight YOLO7-tiny2023-04-12 PCCA-Model: an attention module for medical image segmentation2023-04-01 TWR-MCAE: A Data Augmentation Method for Through-the-Wall Radar Human Motion Recognition2023-01-06