TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Reasoning/Human Judgment Correlation/Flickr8k-Expert

Human Judgment Correlation on Flickr8k-Expert

Metric: Kendall's Tau-c (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Kendall's Tau-c▼Extra DataPaperDate↕Code
1MID54.9NoMutual Information Divergence: A Unified Metric ...2022-05-25Code
2SoftSPICE54.2NoFACTUAL: A Benchmark for Faithful and Consistent...2023-05-27Code
3RefCLIP-S53NoCLIPScore: A Reference-free Evaluation Metric fo...2021-04-18Code
4CLIP-S51.2NoCLIPScore: A Reference-free Evaluation Metric fo...2021-04-18Code