Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners

Seonghyeon Ye, Doyoung Kim, Joel Jang, Joongbo Shin, Minjoon Seo

2022-10-06Question Answering Sentence Completion Coreference Resolution Natural Language Inference Common Sense Reasoning Natural Language Inference (Zero-Shot)Word Sense Disambiguation Language Modelling

Paper PDF Code(official)

Abstract

Meta-training, which fine-tunes the language model (LM) on various downstream tasks by maximizing the likelihood of the target label given the task instruction and input instance, has improved the zero-shot task generalization performance. However, meta-trained LMs still struggle to generalize to challenging tasks containing novel labels unseen during meta-training. In this paper, we propose Flipped Learning, an alternative method of meta-training which trains the LM to generate the task instruction given the input instance and label. During inference, the LM trained with Flipped Learning, referred to as Flipped, selects the label option that is most likely to generate the task instruction. On 14 tasks of the BIG-bench benchmark, the 11B-sized Flipped outperforms zero-shot T0-11B and even a 16 times larger 3-shot GPT-3 (175B) on average by 8.4% and 9.7% points, respectively. Flipped gives particularly large improvements on tasks with unseen labels, outperforming T0-11B by up to +20% average F1 score. This indicates that the strong task generalization of Flipped comes from improved generalization to novel labels. We release our code at https://github.com/seonghyeonye/Flipped-Learning.

Results

Task	Dataset	Metric	Value	Model
Question Answering	COPA	Accuracy	89.88	Flipped-3B
Question Answering	StoryCloze	Accuracy	95.88	Flipped-3B
Common Sense Reasoning	WinoGrande	Accuracy	58.56	Flipped-3B
Word Sense Disambiguation	Words in Context	Accuracy	50.42	Flipped-3B
Natural Language Inference	ANLI test	A1	39.99	Flipped-3B
Natural Language Inference	ANLI test	A2	37.05	Flipped-3B
Natural Language Inference	ANLI test	A3	37.73	Flipped-3B
Natural Language Inference	RTE	Accuracy	71.05	Flipped-3B
Coreference Resolution	Winograd Schema Challenge	Accuracy	58.37	Flipped-3B
Sentence Completion	HellaSwag	Accuracy	41.6	Flipped-3B

Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners

Abstract

Results

Related Papers

Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners

Abstract

Results

Related Papers