D-HAT
Reported on 4 benchmarks across 2 tasks
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Knowledge Base2 results
- F1 (%)53.9best: 95.78 (gpt4-0613_zeroshot)
- F1 (%)67.5best: 85.21 (gpt4-0613_fewshot-10)
Natural Language Processing2 results
- F1 (%)53.9best: 95.78 (gpt4-0613_zeroshot)
- F1 (%)67.5best: 85.21 (gpt4-0613_fewshot-10)