Google's new Gemini 3.1 Pro dramatically boosts AI reasoning
Digest more
Interesting Engineering on MSN
New Gemini 3.1 Pro crushes previous benchmarks, outperforms GPT 5.2 reasoning
Google has rolled out Gemini 3.1 Pro, the latest update to its flagship AI
Claude Code vs ChatGPT Codex compared for performance, pricing, workflows, and privacy to find the best AI coding assistant in 2026.
Anthropic today updated its Sonnet model to version 4.6, and the company says it is the most capable Sonnet model to date with upgrades across coding, computer use, long-context reasoning, agent planning,
Gemini 3.1 posts higher reasoning scores on ARC AI2 and Humanity’s Last Exam; Gemini 3 Flash beats 3 Pro on some tests, clearer for mixed workloads
Anthropic upgrades its Claude chatbot with Sonnet 4.6, promising sharper coding, stronger reasoning, and a massive 1M token context window.
Anthropic has launched Claude Opus 4.6, bringing improvements to long-context reasoning, accuracy in coding, and extended task handling for developers and enterprise users
Claude Opus 4.6 and ChatGPT 5.3 Codex launch with a 1-million-token window and 25% faster runs, letting you match tasks to each model’s strengths.
Anthropic has introduced Claude Sonnet 4.6, saying it is its most powerful model yet for coding, reasoning and handling large volumes of data. The model is now available as the default option for users of its Claude chatbot and the Claude Cowork platform.
OpenAI has released two AI “reasoning” models that it says are its most capable yet as well as an open-source AI agent that helps computer programmers code, as the company seeks to gain a lead over its rivals. The open-source coding agent, called Codex ...