Chinese Gigaword

TextsCustom

Chinese Gigaword corpus consists of 2.2M of headline-document pairs of news stories covering over 284 months from two Chinese newspapers, namely the Xinhua News Agency of China (XIN) and the Central News Agency of Taiwan (CNA).

Source: Order-Preserving Abstractive Summarization for Spoken Content Based on Connectionist Temporal Classification Image Source: https://catalog.ldc.upenn.edu/desc/addenda/LDC2011T13.jpg