Media-Text
MediaText: a media industry-based dataset for scene text detetcion
ImagesIntroduced 2024-10-23
Media-Text dataset comprising images of banners, posters, covers and another images characterised for media industry.
DATASET DESCRIPTION
- 400 images
- 7 744 annotated text instances
- 973 annotations have been marked as illegible for the task of text recognition
- 659 texts have been markes as do not care (###) for scene text detection.
- Images are represented by 193 unique resolutions. Annotation Format - Each image has corresponding gt_*.txt file, which contains annotations in bounding box format (defined by 4 courners), transcription, and bool flag which determines that text is illegible for OCR. Proposed format is similar to ICDAR15 annotations.
x1, x2, ..., x4, y4, transcription, OCR Flag
**Example: **
37,68,198,49,214,181,52,200,LADIES,False
**Full paper: ** ResearchGate
Please cite the related works in your publications if it helps your research: <br> S. Kalisz, M. Marczyk, J. Polańska, and R. Fagas, “Media-text: a media industry-based dataset for scene text detection,” in Modelling and simulation 2024. The 2024 European Simulation and Modelling Conference, M. Graña and J. D. Nuñez-Gonzalez, Eds., EUROSIS-ETI, 2024, pp. 138–144.