Hinglish-TOP
TextsApache-2.0 licenseIntroduced 2022-11-14
Hinglish-TOP is a human annotated code-switched semantic parsing dataset containing 10k human annotations for Hindi-English (HINGLISH) code switched utterances, and over 170K CST5 generated code-switched utterances from the TOPv2 dataset.
Source: CST5: Data Augmentation for Code-Switched Semantic Parsing
Image Source: https://arxiv.org/pdf/2211.07514v1.pdf