Reasoning Coding - Search News

6hon MSN

Google’s new Gemini 3.1 Pro dramatically boosts AI reasoning — but some users say it feels less human in the process

New benchmarks show it more than doubles problem-solving ability vs Gemini 3 Pro ...

Interesting Engineering on MSN

New Gemini 3.1 Pro crushes previous benchmarks, outperforms GPT 5.2 reasoning

Google has rolled out Gemini 3.1 Pro, the latest update to its flagship AI ...

12h

Google Gemini 3.1 Pro first impressions: a 'Deep Think Mini' with adjustable reasoning on demand

The question now is whether this release triggers a response from competitors. Gemini 3 Pro's original launch last November set off a wave of model releases ...

Tech Times

Claude Code vs ChatGPT Codex: Which AI Coding Agent is Actually the Best in 2026

Claude Code vs ChatGPT Codex compared for performance, pricing, workflows, and privacy to find the best AI coding assistant ...

Gemini 3.1 Targets General AI While Rivals Focus on Coding Models

Gemini 3.1 posts higher reasoning scores on ARC AI2 and Humanity’s Last Exam; Gemini 3 Flash beats 3 Pro on some tests, clearer for mixed workloads ...

Gemini 3 Deepthink For Beginners : Supports Research, Coding & Business Analysis with Deep Reasoning

Gemini 3 Deepthink is built for complex reasoning and can take 10–20 minutes per query, helping you get clearer, ...

Claude Sonnet 4.6 Brings Improved Coding, Computer Use, and Office Tasks

Anthropic today updated its Sonnet model to version 4.6, and the company says it is the most capable Sonnet model to date with upgrades across coding, computer use, long-context reasoning, agent ...

Bloomberg L.P.

OpenAI Releases New Reasoning Models for Coding and Visual Tasks

OpenAI is rolling out a pair of new artificial intelligence models that mimic the process of human reasoning to field more complicated coding questions and visual tasks, the latest in a flurry of ...

AOL

OpenAI debuts new ‘reasoning’ models and coding agent as it seeks to stay at the front of the AI pack

OpenAI has released two AI “reasoning” models that it says are its most capable yet as well as an open-source AI agent that helps computer programmers code, as the company seeks to gain a lead over ...

Anthropic Rolls Out Claude Sonnet 4.6 as Default Model, Boosts Coding and Long-Context Reasoning

Anthropic upgrades its Claude chatbot with Sonnet 4.6, promising sharper coding, stronger reasoning, and a massive 1M token context window.

17h

Google launches Gemini 3.1 Pro, retaking AI crown with 2X+ reasoning performance boost

The most significant advancement in Gemini 3.1 Pro lies in its performance on rigorous logic benchmarks. Most notably, the model achieved a verified score of 77.1% on ARC-AGI-2.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results