TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Multimodal Text and Image Classification

Multimodal Text and Image Classification

28 benchmarks7 papers

Classification with both source Image and Text

Benchmarks

Multimodal Text and Image Classification on VALSE

average pairwise accuracyAverage Accuracy

Multimodal Text and Image Classification on VALSE actant swap

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE action replacement

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE coreference clean

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE coreference standard

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE counting adversarial

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE counting balanced

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE counting small numbers

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE existence

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE foil-it (noun phrases)

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE plurality

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on VALSE spatial relations

pairwise accuracyAccuracy (%)

Multimodal Text and Image Classification on Food-101

Accuracy (%)

Multimodal Text and Image Classification on CD18

AccuracyF-measure (%)

Multimodal Text and Image Classification on CUB-200-2011

Accuracy