A new paper, "The Leaderboard Illusion", offers a 68 pages critique of the way the popular Chatbot Arena LLM leaderboard can potentially be gamed by large AI labs with deep pockets. Here's my attempt at adding some extra context to the issues described in the paper.
https://simonwillison.net/2025/Apr/30/criticism-of-the-chatbot-arena/
