Metric: F1-Score (higher is better)
| # | Model↕ | F1-Score▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT-4_10_example_values_&_10_demonstrations | 90.54 | No | Using LLMs for the Extraction and Normalization ... | 2024-03-04 | Code |
| 2 | GPT-3.5_10_example_values_&_10_demonstrations | 88.02 | No | Using LLMs for the Extraction and Normalization ... | 2024-03-04 | Code |
| 3 | AVEQA | 80.83 | No | Using LLMs for the Extraction and Normalization ... | 2024-03-04 | Code |
| 4 | MAVEQA | 65.1 | No | Using LLMs for the Extraction and Normalization ... | 2024-03-04 | Code |
| 5 | SU-OpenTag | 60.44 | No | Using LLMs for the Extraction and Normalization ... | 2024-03-04 | Code |