Trajectory Planning on ToolBench
Metric: Win rate (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | Win rate▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT4-TOPGUN | 86.54 | No | SwissNYF: Tool Grounded LLM Agents for Black Box... | 2024-02-15 | Code |
| 2 | Attention Bucket | 71.5 | No | Fortify the Shortest Stave in Attention: Enhanci... | 2023-12-07 | Code |
| 3 | GPT4- DFSDT | 70.4 | No | ToolLLM: Facilitating Large Language Models to M... | 2023-07-31 | Code |