Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Conversational Web Navigation
/
WebLINX
Conversational Web Navigation on WebLINX
Metric: Text (F1) (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Text (F1) (best first)
Text (F1) (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Text (F1)
▼
Extra Data
Paper
Date
↕
Code
1
S-LLaMA-2.7B
27.17
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
2
Llama-2-13B
26.6
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
3
Llama-2-7B
26.5
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
4
S-LLaMA-1.3B
25.85
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
5
Flan-T5-3B
25.75
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
6
Pix2Act-1.3B
25.21
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
7
MindAct-3B
23.16
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
8
GPT-3.5F
22.39
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
9
Fuyu-8B
22.3
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
10
Pix2Act-282M
16.4
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
11
Flan-T5-780M
14.05
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
12
MindAct-780M
13.58
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
13
Flan-T5-250M
9.21
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
14
MindAct-250M
7.67
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
15
GPT-4T (Zero-Shot)
6.75
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
16
GPT-4V (Zero-Shot)
6.21
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
17
GPT-3.5T (Zero-Shot)
3.45
No
WebLINX: Real-World Website Navigation with Mult...
2024-02-08
Code
#1
S-LLaMA-2.7B
SOTA
27.17
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#2
Llama-2-13B
26.6
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#3
Llama-2-7B
26.5
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#4
S-LLaMA-1.3B
25.85
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#5
Flan-T5-3B
25.75
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#6
Pix2Act-1.3B
25.21
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#7
MindAct-3B
23.16
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#8
GPT-3.5F
22.39
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#9
Fuyu-8B
22.3
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#10
Pix2Act-282M
16.4
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#11
Flan-T5-780M
14.05
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#12
MindAct-780M
13.58
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#13
Flan-T5-250M
9.21
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#14
MindAct-250M
7.67
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#15
GPT-4T (Zero-Shot)
6.75
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#16
GPT-4V (Zero-Shot)
6.21
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code
#17
GPT-3.5T (Zero-Shot)
3.45
Text (F1)
· 2024-02-08
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Code