TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TranSalNet: Towards perceptually relevant visual saliency ...

TranSalNet: Towards perceptually relevant visual saliency prediction

Jianxun Lou, Hanhe Lin, David Marshall, Dietmar Saupe, Hantao Liu

2021-10-07Saliency PredictionPrediction
PaperPDFCode(official)

Abstract

Visual saliency prediction using transformers - Convolutional neural networks (CNNs) have significantly advanced computational modelling for saliency prediction. However, accurately simulating the mechanisms of visual attention in the human cortex remains an academic challenge. It is critical to integrate properties of human vision into the design of CNN architectures, leading to perceptually more relevant saliency prediction. Due to the inherent inductive biases of CNN architectures, there is a lack of sufficient long-range contextual encoding capacity. This hinders CNN-based saliency models from capturing properties that emulate viewing behaviour of humans. Transformers have shown great potential in encoding long-range information by leveraging the self-attention mechanism. In this paper, we propose a novel saliency model that integrates transformer components to CNNs to capture the long-range contextual visual information. Experimental results show that the transformers provide added value to saliency prediction, enhancing its perceptual relevance in the performance. Our proposed saliency model using transformers has achieved superior results on public benchmarks and competitions for saliency prediction models. The source code of our proposed saliency model TranSalNet is available at: https://github.com/LJOVO/TranSalNet

Results

TaskDatasetMetricValueModel
Saliency DetectionMIT300AUC-Judd0.8734TranSalNet
Saliency DetectionMIT300CC0.807TranSalNet
Saliency DetectionMIT300KLD1.0141TranSalNet
Saliency DetectionMIT300NSS2.4134TranSalNet
Saliency DetectionMIT300SIM0.6895TranSalNet
Saliency DetectionMIT300sAUC0.7467TranSalNet
Saliency DetectionSALECIKL0.873Transalnet
Saliency DetectionSALICONAUC0.868TranSalNet
Saliency DetectionSALICONCC0.907TranSalNet
Saliency DetectionSALICONKLD0.373TranSalNet
Saliency DetectionSALICONNSS2.014TranSalNet
Saliency DetectionSALICONSIM0.803TranSalNet
Saliency DetectionSALICONsAUC0.747TranSalNet
Saliency PredictionMIT300AUC-Judd0.8734TranSalNet
Saliency PredictionMIT300CC0.807TranSalNet
Saliency PredictionMIT300KLD1.0141TranSalNet
Saliency PredictionMIT300NSS2.4134TranSalNet
Saliency PredictionMIT300SIM0.6895TranSalNet
Saliency PredictionMIT300sAUC0.7467TranSalNet
Saliency PredictionSALECIKL0.873Transalnet
Saliency PredictionSALICONAUC0.868TranSalNet
Saliency PredictionSALICONCC0.907TranSalNet
Saliency PredictionSALICONKLD0.373TranSalNet
Saliency PredictionSALICONNSS2.014TranSalNet
Saliency PredictionSALICONSIM0.803TranSalNet
Saliency PredictionSALICONsAUC0.747TranSalNet
Few-Shot Transfer Learning for Saliency PredictionMIT300AUC-Judd0.8734TranSalNet
Few-Shot Transfer Learning for Saliency PredictionMIT300CC0.807TranSalNet
Few-Shot Transfer Learning for Saliency PredictionMIT300KLD1.0141TranSalNet
Few-Shot Transfer Learning for Saliency PredictionMIT300NSS2.4134TranSalNet
Few-Shot Transfer Learning for Saliency PredictionMIT300SIM0.6895TranSalNet
Few-Shot Transfer Learning for Saliency PredictionMIT300sAUC0.7467TranSalNet
Few-Shot Transfer Learning for Saliency PredictionSALECIKL0.873Transalnet
Few-Shot Transfer Learning for Saliency PredictionSALICONAUC0.868TranSalNet
Few-Shot Transfer Learning for Saliency PredictionSALICONCC0.907TranSalNet
Few-Shot Transfer Learning for Saliency PredictionSALICONKLD0.373TranSalNet
Few-Shot Transfer Learning for Saliency PredictionSALICONNSS2.014TranSalNet
Few-Shot Transfer Learning for Saliency PredictionSALICONSIM0.803TranSalNet
Few-Shot Transfer Learning for Saliency PredictionSALICONsAUC0.747TranSalNet

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11Foundation models for time series forecasting: Application in conformal prediction2025-07-09Predicting Graph Structure via Adapted Flux Balance Analysis2025-07-08Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis2025-07-08A Wireless Foundation Model for Multi-Task Prediction2025-07-08High Order Collaboration-Oriented Federated Graph Neural Network for Accurate QoS Prediction2025-07-07