DMRST (2021)
Reported on 4 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing4 results
- Standard Parseval (Full)· 2021-10-0948.6best: 58.1 (Bottom-up Llama 2 (70B))
- Standard Parseval (Nuclearity)· 2021-10-0959.4best: 70.4 (Bottom-up Llama 2 (70B))
- Standard Parseval (Relation)· 2021-10-0949.4best: 60 (Bottom-up Llama 2 (70B))
- Standard Parseval (Span)· 2021-10-0969.8best: 79.8 (Bottom-up Llama 2 (70B))