Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Multimodal Text and Image Classification

Multimodal Text and Image Classification

28 benchmarks7 papers

Classification with both source Image and Text

Benchmarks

Multimodal Text and Image Classification on VALSE

average pairwise accuracy Average Accuracy

Multimodal Text and Image Classification on VALSE actant swap

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE action replacement

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE coreference clean

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE coreference standard

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE counting adversarial

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE counting balanced

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE counting small numbers

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE existence

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE foil-it (noun phrases)

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE plurality

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on VALSE spatial relations

pairwise accuracy Accuracy (%)

Multimodal Text and Image Classification on Food-101

Multimodal Text and Image Classification on CD18

Accuracy F-measure (%)

Multimodal Text and Image Classification on CUB-200-2011