Industrial Robots on ToolBench

Metric: Win rate (higher is better)

LeaderboardDataset
Loading chart...
#ModelWin rateExtra DataPaperDateCode
1GPT4-TOPGUN86.54NoSwissNYF: Tool Grounded LLM Agents for Black Box...2024-02-15Code
2Attention Bucket71.5NoFortify the Shortest Stave in Attention: Enhanci...2023-12-07Code
3GPT4- DFSDT70.4NoToolLLM: Facilitating Large Language Models to M...2023-07-31Code