CLSE
Corpus of Linguistically Significant Entities
TextsCC-BYIntroduced 2022-11-04
CLSE is an augmented version of the Schema-Guided Dialog Dataset. The corpus includes 34 languages and covers 74 different semantic types to support various applications from airline ticketing to video games.
Source: CLSE: Corpus of Linguistically Significant Entities
Image Source: https://arxiv.org/pdf/2211.02423v1.pdf