Trans-Encoder

GeneralIntroduced 20001 papers

Description

Unsupervised knowledge distillation from a pretrained language model to itself, by alternating between its bi- and cross-encoder forms.

Papers Using This Method