TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Dialogue Generation

Dialogue Generation

42 benchmarks606 papers

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Benchmarks

Dialogue Generation on Persona-Chat

Avg F1BLEU-1CIDrMETEORROUGE-L

Dialogue Generation on OpenViDial 2.0

BLEUDis-1Dis-2Dis-3Dis-4

Dialogue Generation on FusedChat

Slot AccuracyJoint SAInformInform_mctSuccessSuccess_mctBLEUPPLSensiblenessSpecificitySSA

Dialogue Generation on Harry Potter Dialogue Dataset

mauve

Dialogue Generation on Amazon-5

1 in 10 R@2

Dialogue Generation on CMU-DoG

F1MeteorROUGE-1Rouge-L

Dialogue Generation on PG-19

Perplexity

Dialogue Generation on Reddit (multi-ref)

interest (human)relevance (human)

Dialogue Generation on Twitter Dialogue (Noun)

F1PrecisionRecall

Dialogue Generation on Ubuntu Dialogue (Activity)

F1PrecisionRecall

Dialogue Generation on Ubuntu Dialogue (Entity)

F1PrecisionRecall

Dialogue Generation on Twitter Dialogue (Tense)

Accuracy

Dialogue Generation on Ubuntu Dialogue (Cmd)

Accuracy

Dialogue Generation on Ubuntu Dialogue (Tense)

Accuracy