TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder ...

TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection

Kyle Min, Jason J. Corso

2019-08-15ICCV 2019 10Video Saliency DetectionSaliency Detection
PaperPDFCode(official)

Abstract

TASED-Net is a 3D fully-convolutional network architecture for video saliency detection. It consists of two building blocks: first, the encoder network extracts low-resolution spatiotemporal features from an input clip of several consecutive frames, and then the following prediction network decodes the encoded features spatially while aggregating all the temporal information. As a result, a single prediction map is produced from an input clip of multiple frames. Frame-wise saliency maps can be predicted by applying TASED-Net in a sliding-window fashion to a video. The proposed approach assumes that the saliency map of any frame can be predicted by considering a limited number of past frames. The results of our extensive experiments on video saliency detection validate this assumption and demonstrate that our fully-convolutional model with temporal aggregation method is effective. TASED-Net significantly outperforms previous state-of-the-art approaches on all three major large-scale datasets of video saliency detection: DHF1K, Hollywood2, and UCFSports. After analyzing the results qualitatively, we observe that our model is especially better at attending to salient moving objects.

Results

TaskDatasetMetricValueModel
Saliency DetectionDHF1KNSS2.667TASED-Net
Saliency DetectionMSU Video Saliency PredictionAUC-J0.852TASED-Net
Saliency DetectionMSU Video Saliency PredictionCC0.71TASED-Net
Saliency DetectionMSU Video Saliency PredictionFPS1.85TASED-Net
Saliency DetectionMSU Video Saliency PredictionKLDiv0.538TASED-Net
Saliency DetectionMSU Video Saliency PredictionNSS1.96TASED-Net
Saliency DetectionMSU Video Saliency PredictionSIM0.61TASED-Net

Related Papers

Feature Hallucination for Self-supervised Action Recognition2025-06-25Low-Rate Semantic Communication with Codebook-based Conditional Generative Models2025-04-07Collaborative Temporal Consistency Learning for Point-supervised Natural Language Video Localization2025-03-22A Deep Learning Framework for Visual Attention Prediction and Analysis of News Interfaces2025-03-21Copy-Move Detection in Optical Microscopy: A Segmentation Network and A Dataset2024-12-13Unlocking Comics: The AI4VA Dataset for Visual Understanding2024-10-27AGSENet: A Robust Road Ponding Detection Method for Proactive Traffic Safety2024-10-22VISTA: A Visual and Textual Attention Dataset for Interpreting Multimodal Models2024-10-06