AndroidWorld

Introduced 2024-05-17

AndroidWorld is an environment for building and benchmarking autonomous computer control agents.

It runs on a live Android emulator and contains a highly reproducible benchmark of 116 hand-crafted tasks across 20 apps, which are dynamically instantiated with randomly-generated parameters to create millions of unique task variations.

In addition to the built-in tasks, AndroidWorld also supports the popular web benchmark, MiniWoB++ from Liu et al..

Key features of AndroidWorld include:

šŸ“ 116 diverse tasks across 20 real-world apps šŸŽ² Dynamic task instantiation for millions of unique variations šŸ† Durable reward signals for reliable evaluation 🌐 Open environment with access to millions of Android apps and websites šŸ’¾ Lightweight footprint (2 GB memory, 8 GB disk) šŸ”§ Extensible design to easily add new tasks and benchmarks šŸ–„ļø Integration with MiniWoB++ web-based tasks