Nguyen et al. (2021)

Reported on 4 benchmarks across 1 task · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing4 results

Discourse ParsingonRST-DT
Standard Parseval (Full)· 2021-05-23
46.6
best: 58.1 (Bottom-up Llama 2 (70B))
RST Parsing from Scratch arXiv:2105.10861
Discourse ParsingonRST-DT
Standard Parseval (Nuclearity)· 2021-05-23
59.1
best: 70.4 (Bottom-up Llama 2 (70B))
RST Parsing from Scratch arXiv:2105.10861
Discourse ParsingonRST-DT
Standard Parseval (Relation)· 2021-05-23
47.8
best: 60 (Bottom-up Llama 2 (70B))
RST Parsing from Scratch arXiv:2105.10861
Discourse ParsingonRST-DT
Standard Parseval (Span)· 2021-05-23
68.4
best: 79.8 (Bottom-up Llama 2 (70B))
RST Parsing from Scratch arXiv:2105.10861