Neural sentence embedding models for semantic similarity estimation in the biomedical domain

Kathrin Blagec, Hong Xu, Asan Agibetov, Matthias Samwald

2021-10-01Sentence Embeddings For Biomedical Texts Sentence Embedding Semantic Similarity Semantic Textual Similarity Sentence-Embedding

Paper PDF Code(official)

Abstract

BACKGROUND: In this study, we investigated the efficacy of current state-of-the-art neural sentence embedding models for semantic similarity estimation of sentences from biomedical literature. We trained different neural embedding models on 1.7 million articles from the PubMed Open Access dataset, and evaluated them based on a biomedical benchmark set containing 100 sentence pairs annotated by human experts and a smaller contradiction subset derived from the original benchmark set. RESULTS: With a Pearson correlation of 0.819, our best unsupervised model based on the Paragraph Vector Distributed Memory algorithm outperforms previous state-of-the-art results achieved on the BIOSSES biomedical benchmark set. Moreover, our proposed supervised model that combines different string-based similarity metrics with a neural embedding model surpasses previous ontology-dependent supervised state-of-the-art approaches in terms of Pearson's r (r=0.871) on the biomedical benchmark set. In contrast to the promising results for the original benchmark, we found our best models' performance on the smaller contradiction subset to be poor. CONCLUSIONS: In this study we highlighted the value of neural network-based models for semantic similarity estimation in the biomedical domain by showing that they can keep up with and even surpass previous state-of-the-art approaches for semantic similarity estimation that depend on the availability of laboriously curated ontologies when evaluated on a biomedical benchmark set. Capturing contradictions and negations in biomedical sentences, however, emerged as an essential area for further work.

Results

Task	Dataset	Metric	Value	Model
Sentence Embeddings	BIOSSES	Pearson Correlation	0.871	Supervised combination of: Jaccard, Q-gram, sent2vec, Paragraph vector DM, skip-thoughts, fastText
Sentence Embeddings	BIOSSES	Pearson Correlation	0.846	Unsupervised combination (mean) of: Jaccard, q-gram, Paragraph vector (PV-DBOW) and sent2vec
Sentence Embeddings	BIOSSES	Pearson Correlation	0.819	Paragraph vector (PV-DM)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.804	Paragraph vector (PV-DBOW)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.798	Sent2vec
Sentence Embeddings	BIOSSES	Pearson Correlation	0.766	fastText (skip-gram, max pooling)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.723	Q-gram (q = 3)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.485	Skip-thoughts
Sentence Embeddings	BIOSSES	Pearson Correlation	0.253	fastText (CBOW, max pooling)
Representation Learning	BIOSSES	Pearson Correlation	0.871	Supervised combination of: Jaccard, Q-gram, sent2vec, Paragraph vector DM, skip-thoughts, fastText
Representation Learning	BIOSSES	Pearson Correlation	0.846	Unsupervised combination (mean) of: Jaccard, q-gram, Paragraph vector (PV-DBOW) and sent2vec
Representation Learning	BIOSSES	Pearson Correlation	0.819	Paragraph vector (PV-DM)
Representation Learning	BIOSSES	Pearson Correlation	0.804	Paragraph vector (PV-DBOW)
Representation Learning	BIOSSES	Pearson Correlation	0.798	Sent2vec
Representation Learning	BIOSSES	Pearson Correlation	0.766	fastText (skip-gram, max pooling)
Representation Learning	BIOSSES	Pearson Correlation	0.723	Q-gram (q = 3)
Representation Learning	BIOSSES	Pearson Correlation	0.485	Skip-thoughts
Representation Learning	BIOSSES	Pearson Correlation	0.253	fastText (CBOW, max pooling)

Abstract

Results

Task	Dataset	Metric	Value	Model
Sentence Embeddings	BIOSSES	Pearson Correlation	0.871	Supervised combination of: Jaccard, Q-gram, sent2vec, Paragraph vector DM, skip-thoughts, fastText
Sentence Embeddings	BIOSSES	Pearson Correlation	0.846	Unsupervised combination (mean) of: Jaccard, q-gram, Paragraph vector (PV-DBOW) and sent2vec
Sentence Embeddings	BIOSSES	Pearson Correlation	0.819	Paragraph vector (PV-DM)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.804	Paragraph vector (PV-DBOW)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.798	Sent2vec
Sentence Embeddings	BIOSSES	Pearson Correlation	0.766	fastText (skip-gram, max pooling)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.723	Q-gram (q = 3)
Sentence Embeddings	BIOSSES	Pearson Correlation	0.485	Skip-thoughts
Sentence Embeddings	BIOSSES	Pearson Correlation	0.253	fastText (CBOW, max pooling)
Representation Learning	BIOSSES	Pearson Correlation	0.871	Supervised combination of: Jaccard, Q-gram, sent2vec, Paragraph vector DM, skip-thoughts, fastText
Representation Learning	BIOSSES	Pearson Correlation	0.846	Unsupervised combination (mean) of: Jaccard, q-gram, Paragraph vector (PV-DBOW) and sent2vec
Representation Learning	BIOSSES	Pearson Correlation	0.819	Paragraph vector (PV-DM)
Representation Learning	BIOSSES	Pearson Correlation	0.804	Paragraph vector (PV-DBOW)
Representation Learning	BIOSSES	Pearson Correlation	0.798	Sent2vec
Representation Learning	BIOSSES	Pearson Correlation	0.766	fastText (skip-gram, max pooling)
Representation Learning	BIOSSES	Pearson Correlation	0.723	Q-gram (q = 3)
Representation Learning	BIOSSES	Pearson Correlation	0.485	Skip-thoughts
Representation Learning	BIOSSES	Pearson Correlation	0.253	fastText (CBOW, max pooling)

Neural sentence embedding models for semantic similarity estimation in the biomedical domain

Abstract

Results

Related Papers

Neural sentence embedding models for semantic similarity estimation in the biomedical domain

Abstract

Results

Related Papers