ToT

Reported on 2 benchmarks across 1 task · 2 papers · 1 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing3 results

Question AnsweringonTruthfulQA
EM· 2023-05-17
66.6
best: 67.3 (CoA)
SOTA
Tree of Thoughts: Deliberate Problem Solving with Large Language Models arXiv:2305.10601
Question AnsweringonWebQuestions
EM· 2024-03-26
26.3
best: 84.6 (PoG-GPT4 (Tan et al., 2024))
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models arXiv:2403.17359
Question AnsweringonWebQuestions
EM· 2024-03-26
26.3
best: 84.6 (PoG-GPT4 (Tan et al., 2024))
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models arXiv:2403.17359