Multimodal Text and Image Classification on CUB-200-2011

Metric: Accuracy (higher is better)

LeaderboardDataset
#ModelAccuracyExtra DataPaperDateCode
1Two Branch Network (Text - Bert + Image - Nts-Net)96.81No--Code