On the Effectiveness of Compact Biomedical Transformers

Omid Rohanian, Mohammadmahdi Nouriborji, Samaneh Kouchaki, David A. Clifton

2022-09-07Continual Learning Knowledge Distillation Named Entity Recognition (NER)Language Modelling

Abstract

Language models pre-trained on biomedical corpora, such as BioBERT, have recently shown promising results on downstream biomedical tasks. Many existing pre-trained models, on the other hand, are resource-intensive and computationally heavy owing to factors such as embedding size, hidden dimension, and number of layers. The natural language processing (NLP) community has developed numerous strategies to compress these models utilising techniques such as pruning, quantisation, and knowledge distillation, resulting in models that are considerably faster, smaller, and subsequently easier to use in practice. By the same token, in this paper we introduce six lightweight models, namely, BioDistilBERT, BioTinyBERT, BioMobileBERT, DistilBioBERT, TinyBioBERT, and CompactBioBERT which are obtained either by knowledge distillation from a biomedical teacher or continual learning on the Pubmed dataset via the Masked Language Modelling (MLM) objective. We evaluate all of our models on three biomedical tasks and compare them with BioBERT-v1.1 to create efficient lightweight models that perform on par with their larger counterparts. All the models will be publicly available on our Huggingface profile at https://huggingface.co/nlpie and the codes used to run the experiments will be available at https://github.com/nlpie-research/Compact-Biomedical-Transformers.

Results

Task	Dataset	Metric	Value	Model
Named Entity Recognition (NER)	NCBI-disease	F1	88.67	CompactBioBERT
Named Entity Recognition (NER)	NCBI-disease	F1	87.93	DistilBioBERT
Named Entity Recognition (NER)	NCBI-disease	F1	87.61	BioDistilBERT
Named Entity Recognition (NER)	NCBI-disease	F1	87.21	BioMobileBERT
Named Entity Recognition (NER)	BC5CDR-chemical	F1	94.53	DistilBioBERT
Named Entity Recognition (NER)	BC5CDR-chemical	F1	94.48	BioDistilBERT
Named Entity Recognition (NER)	BC5CDR-chemical	F1	94.31	CompactBioBERT
Named Entity Recognition (NER)	BC5CDR-chemical	F1	94.23	BioMobileBERT
Named Entity Recognition (NER)	BC5CDR-disease	F1	85.61	BioDistilBERT
Named Entity Recognition (NER)	BC5CDR-disease	F1	85.42	DistilBioBERT
Named Entity Recognition (NER)	BC5CDR-disease	F1	85.38	CompactBioBERT
Named Entity Recognition (NER)	BC5CDR-disease	F1	84.62	BioMobileBERT
Named Entity Recognition (NER)	BC2GM	F1	86.97	BioDistilBERT
Named Entity Recognition (NER)	BC2GM	F1	86.71	CompactBioBERT
Named Entity Recognition (NER)	BC2GM	F1	86.6	DistilBioBERT
Named Entity Recognition (NER)	BC2GM	F1	85.26	BioMobileBERT
Named Entity Recognition (NER)	JNLPBA	F1	80.13	BioMobileBERT
Named Entity Recognition (NER)	JNLPBA	F1	79.97	DistilBioBERT
Named Entity Recognition (NER)	JNLPBA	F1	79.88	CompactBioBERT
Named Entity Recognition (NER)	JNLPBA	F1	79.1	BioDistilBERT

On the Effectiveness of Compact Biomedical Transformers

Abstract

Results

Related Papers

On the Effectiveness of Compact Biomedical Transformers

Abstract

Results

Related Papers