Odd One Out

1 benchmarks21 papers

This task tests to what extent a language model is able to identify the odd word.

Source: BIG-bench

Benchmarks

Odd One Out on BIG-bench