TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Action Changes by Measuring Verb-Adverb Textual R...

Learning Action Changes by Measuring Verb-Adverb Textual Relationships

Davide Moltisanti, Frank Keller, Hakan Bilen, Laura Sevilla-Lara

2023-03-27CVPR 2023 1Video-Adverb Retrieval
PaperPDFCode(official)

Abstract

The goal of this work is to understand the way actions are performed in videos. That is, given a video, we aim to predict an adverb indicating a modification applied to the action (e.g. cut "finely"). We cast this problem as a regression task. We measure textual relationships between verbs and adverbs to generate a regression target representing the action change we aim to learn. We test our approach on a range of datasets and achieve state-of-the-art results on both adverb prediction and antonym classification. Furthermore, we outperform previous work when we lift two commonly assumed conditions: the availability of action labels during testing and the pairing of adverbs as antonyms. Existing datasets for adverb recognition are either noisy, which makes learning difficult, or contain actions whose appearance is not influenced by adverbs, which makes evaluation less reliable. To address this, we collect a new high quality dataset: Adverbs in Recipes (AIR). We focus on instructional recipes videos, curating a set of actions that exhibit meaningful visual changes when performed differently. Videos in AIR are more tightly trimmed and were manually reviewed by multiple annotators to ensure high labelling quality. Results show that models learn better from AIR given its cleaner videos. At the same time, adverb prediction on AIR is challenging, demonstrating that there is considerable room for improvement.

Results

TaskDatasetMetricValueModel
VideoVATEX AdverbsAcc-A0.755Action Changes (reg)
VideoVATEX AdverbsmAP M0.086Action Changes (reg)
VideoVATEX AdverbsmAP W0.261Action Changes (reg)
VideoVATEX AdverbsAcc-A0.754Action Changes (cls)
VideoVATEX AdverbsmAP M0.108Action Changes (cls)
VideoVATEX AdverbsmAP W0.283Action Changes (cls)
VideoVATEX AdverbsAcc-A0.701Action Changes (reg, fixed δ)
VideoVATEX AdverbsmAP M0.051Action Changes (reg, fixed δ)
VideoVATEX AdverbsmAP W0.175Action Changes (reg, fixed δ)
VideoAIRAcc-A0.837Action Changes (cls)
VideoAIRmAP M0.289Action Changes (cls)
VideoAIRmAP W0.613Action Changes (cls)
VideoAIRAcc-A0.847Action Changes (reg)
VideoAIRmAP M0.244Action Changes (reg)
VideoAIRmAP M0.193Action Changes (reg, fixed δ)
VideoAIRmAP W0.554Action Changes (reg, fixed δ)
VideoHowTo100M AdverbsAcc-A0.799Action Changes (reg)
VideoHowTo100M AdverbsAcc-A0.786Action Changes (cls)
VideoHowTo100M AdverbsmAP M0.423Action Changes (cls)
VideoHowTo100M AdverbsmAP W0.555Action Changes (cls)
VideoHowTo100M AdverbsAcc-A0.706Action Changes (reg, fixed δ)
VideoHowTo100M AdverbsmAP M0.215Action Changes (reg, fixed δ)
VideoHowTo100M AdverbsmAP W0.32Action Changes (reg, fixed δ)
VideoActivityNet AdverbsAcc-A0.741Action Changes (cls)
VideoActivityNet AdverbsmAP M0.096Action Changes (cls)
VideoActivityNet AdverbsmAP W0.13Action Changes (cls)
VideoActivityNet AdverbsAcc-A0.714Action Changes (reg)
VideoActivityNet AdverbsmAP M0.079Action Changes (reg)
VideoActivityNet AdverbsAcc-A0.706Action Changes (reg, fixed δ)
VideoActivityNet AdverbsmAP M0.075Action Changes (reg, fixed δ)
VideoActivityNet AdverbsmAP W0.119Action Changes (reg, fixed δ)
VideoMSR-VTT AdverbsAcc-A0.774Action Changes (reg)
VideoMSR-VTT AdverbsmAP M0.114Action Changes (reg)
VideoMSR-VTT AdverbsmAP W0.282Action Changes (reg)
VideoMSR-VTT AdverbsAcc-A0.751Action Changes (cls)
VideoMSR-VTT AdverbsmAP M0.131Action Changes (cls)
VideoMSR-VTT AdverbsmAP W0.305Action Changes (cls)
VideoMSR-VTT AdverbsAcc-A0.706Action Changes (reg, fixed δ)
VideoMSR-VTT AdverbsmAP M0.1Action Changes (reg, fixed δ)
VideoMSR-VTT AdverbsmAP W0.203Action Changes (reg, fixed δ)
Video RetrievalVATEX AdverbsAcc-A0.755Action Changes (reg)
Video RetrievalVATEX AdverbsmAP M0.086Action Changes (reg)
Video RetrievalVATEX AdverbsmAP W0.261Action Changes (reg)
Video RetrievalVATEX AdverbsAcc-A0.754Action Changes (cls)
Video RetrievalVATEX AdverbsmAP M0.108Action Changes (cls)
Video RetrievalVATEX AdverbsmAP W0.283Action Changes (cls)
Video RetrievalVATEX AdverbsAcc-A0.701Action Changes (reg, fixed δ)
Video RetrievalVATEX AdverbsmAP M0.051Action Changes (reg, fixed δ)
Video RetrievalVATEX AdverbsmAP W0.175Action Changes (reg, fixed δ)
Video RetrievalAIRAcc-A0.837Action Changes (cls)
Video RetrievalAIRmAP M0.289Action Changes (cls)
Video RetrievalAIRmAP W0.613Action Changes (cls)
Video RetrievalAIRAcc-A0.847Action Changes (reg)
Video RetrievalAIRmAP M0.244Action Changes (reg)
Video RetrievalAIRmAP M0.193Action Changes (reg, fixed δ)
Video RetrievalAIRmAP W0.554Action Changes (reg, fixed δ)
Video RetrievalHowTo100M AdverbsAcc-A0.799Action Changes (reg)
Video RetrievalHowTo100M AdverbsAcc-A0.786Action Changes (cls)
Video RetrievalHowTo100M AdverbsmAP M0.423Action Changes (cls)
Video RetrievalHowTo100M AdverbsmAP W0.555Action Changes (cls)
Video RetrievalHowTo100M AdverbsAcc-A0.706Action Changes (reg, fixed δ)
Video RetrievalHowTo100M AdverbsmAP M0.215Action Changes (reg, fixed δ)
Video RetrievalHowTo100M AdverbsmAP W0.32Action Changes (reg, fixed δ)
Video RetrievalActivityNet AdverbsAcc-A0.741Action Changes (cls)
Video RetrievalActivityNet AdverbsmAP M0.096Action Changes (cls)
Video RetrievalActivityNet AdverbsmAP W0.13Action Changes (cls)
Video RetrievalActivityNet AdverbsAcc-A0.714Action Changes (reg)
Video RetrievalActivityNet AdverbsmAP M0.079Action Changes (reg)
Video RetrievalActivityNet AdverbsAcc-A0.706Action Changes (reg, fixed δ)
Video RetrievalActivityNet AdverbsmAP M0.075Action Changes (reg, fixed δ)
Video RetrievalActivityNet AdverbsmAP W0.119Action Changes (reg, fixed δ)
Video RetrievalMSR-VTT AdverbsAcc-A0.774Action Changes (reg)
Video RetrievalMSR-VTT AdverbsmAP M0.114Action Changes (reg)
Video RetrievalMSR-VTT AdverbsmAP W0.282Action Changes (reg)
Video RetrievalMSR-VTT AdverbsAcc-A0.751Action Changes (cls)
Video RetrievalMSR-VTT AdverbsmAP M0.131Action Changes (cls)
Video RetrievalMSR-VTT AdverbsmAP W0.305Action Changes (cls)
Video RetrievalMSR-VTT AdverbsAcc-A0.706Action Changes (reg, fixed δ)
Video RetrievalMSR-VTT AdverbsmAP M0.1Action Changes (reg, fixed δ)
Video RetrievalMSR-VTT AdverbsmAP W0.203Action Changes (reg, fixed δ)
Video-Adverb RetrievalVATEX AdverbsAcc-A0.755Action Changes (reg)
Video-Adverb RetrievalVATEX AdverbsmAP M0.086Action Changes (reg)
Video-Adverb RetrievalVATEX AdverbsmAP W0.261Action Changes (reg)
Video-Adverb RetrievalVATEX AdverbsAcc-A0.754Action Changes (cls)
Video-Adverb RetrievalVATEX AdverbsmAP M0.108Action Changes (cls)
Video-Adverb RetrievalVATEX AdverbsmAP W0.283Action Changes (cls)
Video-Adverb RetrievalVATEX AdverbsAcc-A0.701Action Changes (reg, fixed δ)
Video-Adverb RetrievalVATEX AdverbsmAP M0.051Action Changes (reg, fixed δ)
Video-Adverb RetrievalVATEX AdverbsmAP W0.175Action Changes (reg, fixed δ)
Video-Adverb RetrievalAIRAcc-A0.837Action Changes (cls)
Video-Adverb RetrievalAIRmAP M0.289Action Changes (cls)
Video-Adverb RetrievalAIRmAP W0.613Action Changes (cls)
Video-Adverb RetrievalAIRAcc-A0.847Action Changes (reg)
Video-Adverb RetrievalAIRmAP M0.244Action Changes (reg)
Video-Adverb RetrievalAIRmAP M0.193Action Changes (reg, fixed δ)
Video-Adverb RetrievalAIRmAP W0.554Action Changes (reg, fixed δ)
Video-Adverb RetrievalHowTo100M AdverbsAcc-A0.799Action Changes (reg)
Video-Adverb RetrievalHowTo100M AdverbsAcc-A0.786Action Changes (cls)
Video-Adverb RetrievalHowTo100M AdverbsmAP M0.423Action Changes (cls)
Video-Adverb RetrievalHowTo100M AdverbsmAP W0.555Action Changes (cls)
Video-Adverb RetrievalHowTo100M AdverbsAcc-A0.706Action Changes (reg, fixed δ)
Video-Adverb RetrievalHowTo100M AdverbsmAP M0.215Action Changes (reg, fixed δ)
Video-Adverb RetrievalHowTo100M AdverbsmAP W0.32Action Changes (reg, fixed δ)
Video-Adverb RetrievalActivityNet AdverbsAcc-A0.741Action Changes (cls)
Video-Adverb RetrievalActivityNet AdverbsmAP M0.096Action Changes (cls)
Video-Adverb RetrievalActivityNet AdverbsmAP W0.13Action Changes (cls)
Video-Adverb RetrievalActivityNet AdverbsAcc-A0.714Action Changes (reg)
Video-Adverb RetrievalActivityNet AdverbsmAP M0.079Action Changes (reg)
Video-Adverb RetrievalActivityNet AdverbsAcc-A0.706Action Changes (reg, fixed δ)
Video-Adverb RetrievalActivityNet AdverbsmAP M0.075Action Changes (reg, fixed δ)
Video-Adverb RetrievalActivityNet AdverbsmAP W0.119Action Changes (reg, fixed δ)
Video-Adverb RetrievalMSR-VTT AdverbsAcc-A0.774Action Changes (reg)
Video-Adverb RetrievalMSR-VTT AdverbsmAP M0.114Action Changes (reg)
Video-Adverb RetrievalMSR-VTT AdverbsmAP W0.282Action Changes (reg)
Video-Adverb RetrievalMSR-VTT AdverbsAcc-A0.751Action Changes (cls)
Video-Adverb RetrievalMSR-VTT AdverbsmAP M0.131Action Changes (cls)
Video-Adverb RetrievalMSR-VTT AdverbsmAP W0.305Action Changes (cls)
Video-Adverb RetrievalMSR-VTT AdverbsAcc-A0.706Action Changes (reg, fixed δ)
Video-Adverb RetrievalMSR-VTT AdverbsmAP M0.1Action Changes (reg, fixed δ)
Video-Adverb RetrievalMSR-VTT AdverbsmAP W0.203Action Changes (reg, fixed δ)

Related Papers

Video-adverb retrieval with compositional adverb-action embeddings2023-09-26How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs2022-03-23Action Modifiers: Learning from Adverbs in Instructional Videos2019-12-13