MENSA

Movie Scene Saliency Dataset

Introduced 2024-04-04

MENSA: Movie Scene Saliency Dataset

Dataset Summary

The dataset, MENSA (Movie Scene Saliency Dataset) is from the paper "Select and Summarize: Scene Saliency for Movie Script Summarization", and consists of movie scripts and their corresponding summaries. Each scene in the movie script is annotated with scene saliency labels. The training set contains silver labels, which are automatically generated, while the validation and test sets contain human-annotated gold labels.

Dataset Structure

The dataset is divided into three parts:

  • Training Set: Contains movie scripts and summaries with silver scene saliency labels.
  • Validation Set: Contains movie scripts and summaries with human-annotated gold scene saliency labels.
  • Test Set: Contains movie scripts and summaries with human-annotated gold scene saliency labels.