ASPEC

Asian Scientific Paper Excerpt Corpus

TextsCustom (non-commercial)Introduced 2016-01-01

ASPEC, Asian Scientific Paper Excerpt Corpus, is constructed by the Japan Science and Technology Agency (JST) in collaboration with the National Institute of Information and Communications Technology (NICT). It consists of a Japanese-English paper abstract corpus of 3M parallel sentences (ASPEC-JE) and a Japanese-Chinese paper excerpt corpus of 680K parallel sentences (ASPEC-JC). This corpus is one of the achievements of the Japanese-Chinese machine translation project which was run in Japan from 2006 to 2010.

Source: ASPEC Image Source: https://www.aclweb.org/anthology/L16-1350.pdf