This is not a Dataset

This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Textsapache-2.0Introduced 2023-10-24

We introduce a large semi-automatically generated dataset of ~400,000 descriptive sentences about commonsense knowledge that can be true or false in which negation is present in about 2/3 of the corpus in different forms that we use to evaluate LLMs.