Metric: Visual Matching-Open/Close Drawer (lower is better)
| # | Model↕ | Visual Matching-Open/Close Drawer▲ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Octo-Base | 0.227 | Yes | Octo: An Open-Source Generalist Robot Policy | 2024-05-20 | - |
| 2 | TraceVLA | 0.24 | Yes | TraceVLA: Visual Trace Prompting Enhances Spatia... | 2024-12-13 | - |
| 3 | RT-2-X | 0.25 | Yes | RT-2: Vision-Language-Action Models Transfer Web... | 2023-07-28 | Code |
| 4 | RoboVLM | 0.268 | Yes | Towards Generalist Robot Policies: What Matters ... | 2024-12-18 | Code |
| 5 | OpenVLA | 0.356 | Yes | OpenVLA: An Open-Source Vision-Language-Action M... | 2024-06-13 | Code |
| 6 | SoFar | 0.403 | No | SoFar: Language-Grounded Orientation Bridges Spa... | 2025-02-18 | Code |
| 7 | Dita-300M | 0.463 | Yes | Dita: Scaling Diffusion Transformer for Generali... | 2025-03-25 | Code |
| 8 | SpatialVLA | 0.593 | Yes | SpatialVLA: Exploring Spatial Representations fo... | 2025-01-27 | - |
| 9 | RT-1-X | 0.597 | Yes | RT-1: Robotics Transformer for Real-World Contro... | 2022-12-13 | Code |