TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Coordinate attention

Coordinate attention

GeneralIntroduced 200031 papers
Source Paper

Description

Hou et al. proposed coordinate attention, a novel attention mechanism which embeds positional information into channel attention, so that the network can focus on large important regions at little computational cost.

The coordinate attention mechanism has two consecutive steps, coordinate information embedding and coordinate attention generation. First, two spatial extents of pooling kernels encode each channel horizontally and vertically. In the second step, a shared 1×11\times 11×1 convolutional transformation function is applied to the concatenated outputs of the two pooling layers. Then coordinate attention splits the resulting tensor into two separate tensors to yield attention vectors with the same number of channels for horizontal and vertical coordinates of the input XXX along. This can be written as \begin{align} z^h &= \text{GAP}^h(X) \end{align} \begin{align} z^w &= \text{GAP}^w(X) \end{align} \begin{align} f &= \delta(\text{BN}(\text{Conv}_1^{1\times 1}([z^h;z^w]))) \end{align} \begin{align} f^h, f^w &= \text{Split}(f) \end{align} \begin{align} s^h &= \sigma(\text{Conv}_h^{1\times 1}(f^h)) \end{align} \begin{align} s^w &= \sigma(\text{Conv}_w^{1\times 1}(f^w)) \end{align} \begin{align} Y &= X s^h s^w \end{align} where GAPh\text{GAP}^hGAPh and GAPw\text{GAP}^wGAPw denote pooling functions for vertical and horizontal coordinates, and sh∈RC×1×Ws^h \in \mathbb{R}^{C\times 1\times W}sh∈RC×1×W and sw∈RC×H×1s^w \in \mathbb{R}^{C\times H\times 1}sw∈RC×H×1 represent corresponding attention weights.

Using coordinate attention, the network can accurately obtain the position of a targeted object. This approach has a larger receptive field than BAM and CBAM. Like an SE block, it also models cross-channel relationships, effectively enhancing the expressive power of the learned features. Due to its lightweight design and flexibility, it can be easily used in classical building blocks of mobile networks.

Papers Using This Method

Design description of Wisdom Computing Persperctive2025-05-02Enhancing Traffic Sign Recognition On The Performance Based On Yolov82025-04-02LightEndoStereo: A Real-time Lightweight Stereo Matching Method for Endoscopy Images2025-03-02A Physics-Inspired Deep Learning Framework with Polar Coordinate Attention for Ptychographic Imaging2024-11-25Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models2024-10-29Optimizing YOLO Architectures for Optimal Road Damage Detection and Classification: A Comparative Study from YOLOv7 to YOLOv102024-10-10Improved Unet model for brain tumor image segmentation based on ASPP-coordinate attention mechanism2024-09-13RICAU-Net: Residual-block Inspired Coordinate Attention U-Net for Segmentation of Small and Sparse Calcium Lesions in Cardiac CT2024-09-11ALSS-YOLO: An Adaptive Lightweight Channel Split and Shuffling Network for TIR Wildlife Detection in UAV Imagery2024-09-10CSANet: Channel Spatial Attention Network for Robust 3D Face Alignment and Reconstruction2024-05-30ELA: Efficient Local Attention for Deep Convolutional Neural Networks2024-03-02Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network2024-01-16YOLO algorithm with hybrid attention feature pyramid network for solder joint defect detection2024-01-02YOLOv5s-BC: An improved YOLOv5s-based method for real-time apple detection2023-11-10Marine Debris Detection in Satellite Surveillance using Attention Mechanisms2023-07-09Multi-cropping Contrastive Learning and Domain Consistency for Unsupervised Image-to-Image Translation2023-04-24Two-stage MR Image Segmentation Method for Brain Tumors based on Attention Mechanism2023-04-17Fast vehicle detection algorithm based on lightweight YOLO7-tiny2023-04-12PCCA-Model: an attention module for medical image segmentation2023-04-01TWR-MCAE: A Data Augmentation Method for Through-the-Wall Radar Human Motion Recognition2023-01-06