TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SciBERT: A Pretrained Language Model for Scientific Text

SciBERT: A Pretrained Language Model for Scientific Text

Iz Beltagy, Kyle Lo, Arman Cohan

2019-03-26IJCNLP 2019 11Participant Intervention Comparison Outcome ExtractionMedical Named Entity RecognitionRelation ExtractionGeneral ClassificationNamed Entity Recognition (NER)Citation Intent ClassificationSentence ClassificationDependency ParsingLanguage Modelling
PaperPDFCodeCodeCodeCode(official)CodeCode

Abstract

Obtaining large-scale annotated data for NLP tasks in the scientific domain is challenging and expensive. We release SciBERT, a pretrained language model based on BERT (Devlin et al., 2018) to address the lack of high-quality, large-scale labeled scientific data. SciBERT leverages unsupervised pretraining on a large multi-domain corpus of scientific publications to improve performance on downstream scientific NLP tasks. We evaluate on a suite of tasks including sequence tagging, sentence classification and dependency parsing, with datasets from a variety of scientific domains. We demonstrate statistically significant improvements over BERT and achieve new state-of-the-art results on several of these tasks. The code and pretrained models are available at https://github.com/allenai/scibert/.

Results

TaskDatasetMetricValueModel
Relation ExtractionChemProtF183.64SciBert (Finetune)
Relation ExtractionChemProtF173.7SciBERT (Base Vocab)
Relation ExtractionSciERCF174.64SciBERT (SciVocab)
Relation ExtractionSciERCF174.42SciBERT (Base Vocab)
Relation ExtractionJNLPBAF176.09SciBERT (SciVocab)
Information ExtractionEBM-NLPF171.18SciBERT (SciVocab)
Information ExtractionEBM-NLPF170.82SciBERT (Base Vocab)
Dependency ParsingGENIA - UASF192.46SciBERT (SciVocab)
Dependency ParsingGENIA - UASF192.32SciBERT (Base Vocab)
Dependency ParsingGENIA - LASF191.41SciBERT (SciVocab)
Dependency ParsingGENIA - LASF191.26SciBERT (Base Vocab)
Named Entity Recognition (NER)NCBI-diseaseF186.88SciBERT (Base Vocab)
Named Entity Recognition (NER)NCBI-diseaseF186.45SciBERT (SciVocab)
Named Entity Recognition (NER)SciERCF167.57SciBERT (SciVocab)
Named Entity Recognition (NER)SciERCF165.24SciBERT (Base Vocab)
Named Entity Recognition (NER)BC5CDRF188.94SciBERT (SciVocab)
Named Entity Recognition (NER)BC5CDRF188.11SciBERT (Base Vocab)
Named Entity Recognition (NER)JNLPBAF175.77SciBERT (Base Vocab)
Text ClassificationACL-ARCF170.98SciBERT
Text ClassificationPaper FieldF165.71SciBERT (SciVocab)
Text ClassificationPaper FieldF164.02SciBERT (Base Vocab)
Text ClassificationScienceCiteF184.99SciBERT (SciVocab)
Text ClassificationScienceCiteF184.43SciBERT (Base Vocab)
Text ClassificationPubMed 20k RCTF186.81SciBERT (Base Vocab)
Text ClassificationSciCiteF184.9SciBERT
Text ClassificationSciCiteMacro-F186.32SciBERT
Participant Intervention Comparison Outcome ExtractionEBM-NLPF171.18SciBERT (SciVocab)
Participant Intervention Comparison Outcome ExtractionEBM-NLPF170.82SciBERT (Base Vocab)
Sentence ClassificationACL-ARCF170.98SciBERT
Sentence ClassificationPaper FieldF165.71SciBERT (SciVocab)
Sentence ClassificationPaper FieldF164.02SciBERT (Base Vocab)
Sentence ClassificationScienceCiteF184.99SciBERT (SciVocab)
Sentence ClassificationScienceCiteF184.43SciBERT (Base Vocab)
Sentence ClassificationPubMed 20k RCTF186.81SciBERT (Base Vocab)
Sentence ClassificationSciCiteF184.9SciBERT
ClassificationACL-ARCF170.98SciBERT
ClassificationPaper FieldF165.71SciBERT (SciVocab)
ClassificationPaper FieldF164.02SciBERT (Base Vocab)
ClassificationScienceCiteF184.99SciBERT (SciVocab)
ClassificationScienceCiteF184.43SciBERT (Base Vocab)
ClassificationPubMed 20k RCTF186.81SciBERT (Base Vocab)
ClassificationSciCiteF184.9SciBERT
ClassificationSciCiteMacro-F186.32SciBERT

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Assay2Mol: large language model-based drug design using BioAssay context2025-07-16Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing2025-07-16