Papers With Code 2 | ML Benchmarks, SotA Results & Code

Media-Text dataset comprising images of banners, posters, covers and another images characterised for media industry.

DATASET DESCRIPTION

400 images
7 744 annotated text instances
973 annotations have been marked as illegible for the task of text recognition
659 texts have been markes as do not care (###) for scene text detection.
Images are represented by 193 unique resolutions. Annotation Format - Each image has corresponding gt_*.txt file, which contains annotations in bounding box format (defined by 4 courners), transcription, and bool flag which determines that text is illegible for OCR. Proposed format is similar to ICDAR15 annotations.

x1, x2, ..., x4, y4, transcription, OCR Flag

**Example: **

37,68,198,49,214,181,52,200,LADIES,False

**Full paper: ** ResearchGate

Please cite the related works in your publications if it helps your research: <br> S. Kalisz, M. Marczyk, J. Polańska, and R. Fagas, “Media-text: a media industry-based dataset for scene text detection,” in Modelling and simulation 2024. The 2024 European Simulation and Modelling Conference, M. Graña and J. D. Nuñez-Gonzalez, Eds., EUROSIS-ETI, 2024, pp. 138–144.