FinalRun AI Agent achieves 76.7% success rate on Android World Benchmark, across 116 diverse real-world tasks, setting a new state-of-the-art by beating the previous record.
76.7%
Success Rate
116
Tasks Evaluated
89
Tasks Passed
Android World Benchmark Leaderboard
116 real-world Android tasks tested with our AI agent
89
Tasks Passed
27
Tasks Failed
76.7%
Success Rate
Success rates across different task types and difficulty levels
Task Category | Easy | Medium | Hard |
---|---|---|---|
Complex UI Understanding | 83% | 60% | 57% |
Data Edit | 91% | 14% | 0% |
Data Entry | 93% | 70% | 44% |
Game Playing | 100% | — | — |
Information Retrieval | 86% | 67% | 33% |
Math & Counting | 100% | 67% | 67% |
Memorization | 100% | 100% | 25% |
Multi-App Operations | 100% | 100% | 33% |
Parameterized Tasks | 93% | 62% | 56% |
Repetition | 100% | 40% | 20% |
Requires Setup | 100% | 33% | 0% |
Screen Reading | 92% | 100% | 44% |
Search | 91% | 60% | 67% |
Transcription | 0% | 50% | 50% |
Untagged | 80% | 100% | — |
Verification | 86% | — | — |