Description
PP-OCR is an OCR system that consists of three parts, text detection, detected boxes rectification and text recognition. The purpose of text detection is to locate the text area in the image. In PP-OCR, Differentiable Binarization (DB) is used as text detector which is based on a simple segmentation network. It integrates feature extraction and sequence modeling. It adopts the Connectionist Temporal Classification (CTC) loss to avoid the inconsistency between prediction and label.
Papers Using This Method
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts2025-02-25OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst2024-06-14PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System2021-09-07PP-OCR: A Practical Ultra Lightweight OCR System2020-09-21