This market will resolve to "Yes" if any model on the Arena.AI Leaderboard (arena.ai/leaderboard/text) reaches at least the specified Arena Score on the "Leaderboard" tab for "Math" by December 31, ...
From high school math modeling challenges to formal theorem-proving competitions, large language models (LLMs) are stepping into the competitive math arena. New datasets, benchmarks, and governance ...
This market will resolve to "Yes" if any model on the Arena.AI Leaderboard (arena.ai/leaderboard/text) reaches at least the specified Arena Score on the "Leaderboard" tab for "Math" by June 30, 2026, ...