Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Conversational Web Navigation
/
WebLINX
Conversational Web Navigation on WebLINX
Metric: Overall score (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Overall score (best first)
Overall score (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Overall score
▼
Extra Data
Paper
Date
↕
Code
1
Llama-2-13B
25.21
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
2
S-LLaMA-2.7B
25.02
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
3
Llama-2-7B
24.57
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
4
Flan-T5-3B
23.77
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
5
S-LLaMA-1.3B
23.73
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
6
GPT-3.5F
21.22
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
7
MindAct-3B
20.94
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
8
Fuyu-8B
19.97
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
9
Flan-T5-780M
17.27
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
10
Pix2Act-1.3B
16.88
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
11
MindAct-780M
15.13
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
12
Flan-T5-250M
14.99
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
13
MindAct-250M
12.63
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
14
Pix2Act-282M
12.51
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
15
GPT-4T (Zero-Shot)
10.72
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
16
GPT-4V (Zero-Shot)
10.45
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
17
GPT-3.5T (Zero-Shot)
8.51
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
#1
Llama-2-13B
SOTA
25.21
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#2
S-LLaMA-2.7B
25.02
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#3
Llama-2-7B
24.57
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#4
Flan-T5-3B
23.77
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#5
S-LLaMA-1.3B
23.73
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#6
GPT-3.5F
21.22
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#7
MindAct-3B
20.94
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#8
Fuyu-8B
19.97
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#9
Flan-T5-780M
17.27
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#10
Pix2Act-1.3B
16.88
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#11
MindAct-780M
15.13
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#12
Flan-T5-250M
14.99
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#13
MindAct-250M
12.63
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#14
Pix2Act-282M
12.51
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#15
GPT-4T (Zero-Shot)
10.72
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#16
GPT-4V (Zero-Shot)
10.45
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#17
GPT-3.5T (Zero-Shot)
8.51
Overall score
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code