Description
A general multimodal attention unit for any number of modalities. Graphical models inspire it, i.e., it infers several attention beliefs via aggregated interaction messages.
Papers Using This Method
SAD-Net: a full spectral self-attention detail enhancement network for single image dehazing2025-04-07A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models2024-07-25FGA: Fourier-Guided Attention Network for Crowd Count Estimation2024-07-08Multi Loss-based Feature Fusion and Top Two Voting Ensemble Decision Strategy for Facial Expression Recognition in the Wild2023-11-06Towards Fair Evaluation of Dialogue State Tracking by Flexible Incorporation of Turn-level Performances2022-04-07A Simple Baseline for Audio-Visual Scene-Aware Dialog2019-06-01Factor Graph Attention2019-04-11A Simple Baseline for Audio-Visual Scene-Aware Dialog2019-04-11High-Order Attention Models for Visual Question Answering2017-11-12