Papers With Code 2 | ML Benchmarks, SotA Results & Code

This shared task will examine automatic evaluation metrics for machine translation. The goals of the shared metrics task are:

To achieve the strongest correlation with human judgement of translation quality; To illustrate the suitability of an automatic evaluation metric as a surrogate for human evaluation; To address problems associated with comparison with a single reference translation; To move automatic evaluation beyond system-level ranking to finer-grained sentence-level ranking.

All datasets for this task are available here.

WMT 2019 Metrics Task