MixText

Natural Language ProcessingIntroduced 20003 papers

Description

MixText is a semi-supervised learning method for text classification, which uses a new data augmentation method called TMix. TMix creates a large amount of augmented training samples by interpolating text in hidden space. The technique leverages advances in data augmentation to guess low-entropy labels for unlabeled data, making them as easy to use as labeled data.

Papers Using This Method

FPMT: Enhanced Semi-Supervised Model for Traffic Incident Detection2024-09-12 LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?2024-01-11 MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification2020-04-25