TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Sentence Ordering/EconLogicQA

Sentence Ordering on EconLogicQA

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1GPT-4-Turbo0.5692NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
2GPT-40.5538NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
3GPT-3.5-Turbo0.3769NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
4Llama-3-8B-Instruct0.3462NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
5Mistral-7B-Instruct-v0.20.3154NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
6Mistral-7B-v0.10.2615NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
7Mistral-7B-v0.20.2615NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
8Llama-3-8B0.2385NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
9Zephyr-7B-Alpha0.2308NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
10Yi-6B-Chat0.2077NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
11Zephyr-7B-Beta0.1769NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
12Mistral-7B-Instruct-v0.10.1538NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
13Llama-2-13B-Chat0.1462NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
14Llama-2-7B-Chat0.0923NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
15Gemma-2B-IT0.0846NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
16Yi-6B0.0385NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
17Gemma-7B-IT0.0231NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code
18Llama-2-7B0.0077NoEconLogicQA: A Question-Answering Benchmark for ...2024-05-13Code