Multimodal Machine Translation
10 benchmarks108 papers
Multimodal machine translation is the task of doing machine translation with multiple data sources - for example, translating "a bird is flying over water" + an image of a bird over water to German text.
<span style="color:grey; opacity: 0.6">( Image credit: Findings of the Third Shared Task on Multimodal Machine Translation )</span>