Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Multimodal Text and Image Classification
Multimodal Text and Image Classification
28 benchmarks
7 papers
Classification with both source Image and Text
Benchmarks
Multimodal Text and Image Classification on
VALSE
average pairwise accuracy
Average Accuracy
Multimodal Text and Image Classification on
VALSE actant swap
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE action replacement
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE coreference clean
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE coreference standard
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE counting adversarial
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE counting balanced
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE counting small numbers
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE existence
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE foil-it (noun phrases)
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE plurality
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
VALSE spatial relations
pairwise accuracy
Accuracy (%)
Multimodal Text and Image Classification on
Food-101
Accuracy (%)
Multimodal Text and Image Classification on
CD18
Accuracy
F-measure (%)
Multimodal Text and Image Classification on
CUB-200-2011
Accuracy