FinalRun AI Agent achieves 76.7% success rate on Android World Benchmark, across 116 diverse real-world tasks, setting a new state-of-the-art by beating the previous record.
76.7%
Success Rate
116
Tasks Evaluated
89
Tasks Passed
Android World Benchmark Leaderboard
116 real-world Android tasks tested with our AI agent
89
Tasks Passed
27
Tasks Failed
76.7%
Success Rate
Success rates across different task types and difficulty levels
| Task Category | Easy | Medium | Hard |
|---|---|---|---|
| Complex UI Understanding | 83% | 60% | 57% |
| Data Edit | 91% | 14% | 0% |
| Data Entry | 93% | 70% | 44% |
| Game Playing | 100% | — | — |
| Information Retrieval | 86% | 67% | 33% |
| Math & Counting | 100% | 67% | 67% |
| Memorization | 100% | 100% | 25% |
| Multi-App Operations | 100% | 100% | 33% |
| Parameterized Tasks | 93% | 62% | 56% |
| Repetition | 100% | 40% | 20% |
| Requires Setup | 100% | 33% | 0% |
| Screen Reading | 92% | 100% | 44% |
| Search | 91% | 60% | 67% |
| Transcription | 0% | 50% | 50% |
| Untagged | 80% | 100% | — |
| Verification | 86% | — | — |