BIOSSES

Biomedical Semantic Similarity Estimation System

MedicalTextsIntroduced 2017-07-15

The BIOSSES data set comprises total 100 sentence pairs all of which were selected from the "TAC2 Biomedical Summarization Track Training Data Set" .

The sentence pairs were evaluated by five different human experts that judged their similarity and gave scores in a range [0-4]. Our guideline was prepared based on SemEval 2012 Task 6 Guideline.

Image source: BIOSSES