CLSE

Corpus of Linguistically Significant Entities

TextsCC-BYIntroduced 2022-11-04

CLSE is an augmented version of the Schema-Guided Dialog Dataset. The corpus includes 34 languages and covers 74 different semantic types to support various applications from airline ticketing to video games.

Source: CLSE: Corpus of Linguistically Significant Entities

Image Source: https://arxiv.org/pdf/2211.02423v1.pdf