NCSE v2.0

NCSE v2.0: A Dataset of OCR-Processed 19th Century English Newspapers

ImagesTextsCC BY 4.0Introduced 2025-02-18

The NCSE v2.0 is a digitized collection of six 19th-century English periodicals

The ground truth contains 358 cropped images of text blocks from 31 pages of 19th century newspaper data