Scene Text Detection is a computer vision task that involves automatically identifying and localizing text within natural images or videos. The goal of scene text detection is to develop algorithms that can robustly detect and and label text with bounding boxes in uncontrolled and complex environments, such as street signs, billboards, or license plates.
<span class="description-source">Source: ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection </span>