Vidore

Visual Document Retrieval Benchmark

Introduced 2024-06-27

It is collection regrouping all datasets constituting the ViDoRe benchmark. It includes the test sets from different academic datasets (ArXiVQA, DocVQA, InfoVQA, TATDQA, TabFQuAD) and from datasets synthetically generated spanning various themes and industrial applications: (Artificial Intelligence, Government Reports, Healthcare Industry, Energy and Shift Project). Further details can be found on the corresponding dataset cards.