SPECTER: Document-level Representation Learning using Citation-informed Transformers

Arman Cohan, Sergey Feldman, Iz Beltagy, Doug Downey, Daniel S. Weld

2020-04-15ACL 2020 6Representation Learning Document Classification General Classification Citation Prediction Language Modelling

Paper PDF Code Code(official)Code Code(official)Code

Abstract

Representation learning is a critical ingredient for natural language processing systems. Recent Transformer language models like BERT learn powerful textual representations, but these models are targeted towards token- and sentence-level training objectives and do not leverage information on inter-document relatedness, which limits their document-level representation power. For applications on scientific documents, such as classification and recommendation, the embeddings power strong performance on end tasks. We propose SPECTER, a new method to generate document-level embedding of scientific documents based on pretraining a Transformer language model on a powerful signal of document-level relatedness: the citation graph. Unlike existing pretrained language models, SPECTER can be easily applied to downstream applications without task-specific fine-tuning. Additionally, to encourage further research on document-level models, we introduce SciDocs, a new evaluation benchmark consisting of seven document-level tasks ranging from citation prediction, to document classification and recommendation. We show that SPECTER outperforms a variety of competitive baselines on the benchmark.

Results

Task	Dataset	Metric	Value	Model
Representation Learning	SciDocs	Avg.	80	SPECTER
Representation Learning	SciDocs	Avg.	76	Citeomatic
Representation Learning	SciDocs	Avg.	59.6	SciBERT
Text Classification	SciDocs (MeSH)	F1 (micro)	86.4	SPECTER
Text Classification	SciDocs (MAG)	F1 (micro)	82	SPECTER
Document Classification	SciDocs (MeSH)	F1 (micro)	86.4	SPECTER
Document Classification	SciDocs (MAG)	F1 (micro)	82	SPECTER
Classification	SciDocs (MeSH)	F1 (micro)	86.4	SPECTER
Classification	SciDocs (MAG)	F1 (micro)	82	SPECTER

SPECTER: Document-level Representation Learning using Citation-informed Transformers

Abstract

Results

Related Papers

SPECTER: Document-level Representation Learning using Citation-informed Transformers

Abstract

Results

Related Papers