BLN600
BLN600: A Parallel Corpus of Machine/Human Transcribed Nineteenth Century Newspaper Texts
ImagesTextsCC BY-NC-ND 4.0Introduced 2024-09-24
A publicly available corpus of nineteenth-century newspaper text focused on crime in London, derived from the Gale British Library Newspapers corpus parts 1 and 2. The corpus comprises 600 newspaper excerpts and for each excerpt contains the original source image, the machine transcription of that image as found in the BLN and a gold standard manual transcription.