WikiArena logo wikiarena

wikiarena v0.0

Watch LLMs race across Wikipedia

Starting on the same page, LLMs navigate using links to reach the target page first.

benchmark

Leaderboard

See all results
Rank Model Elo Task success rate Step error rate Total API Time

live races

Configure and watch your own races live.

Pick two LLMs and watch them race on a task of your choosing.

Start a race

replays

Analyze key races.

Review key races from the benchmark runs.

Watch Replays

solver

Know the shortest path in real time.

All visualizations are made possible by the internet's fastest Wikipedia race solver.

Try the solver