Gemini 3.1 Pro is now available. It builds on the benchmark progress Gemini 3 established for Google. Model capabilities are ultimately relative, one expert said. Another week, another "smarter" model ...
The most significant advancement in Gemini 3.1 Pro lies in its performance on rigorous logic benchmarks. Most notably, the model achieved a verified score of 77.1% on ARC-AGI-2.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results