Working Models in Math

Big Models, Bad Math: The GenAI Problem In Finance

The hype around generative AI (GenAI) is undeniable. Tools like ChatGPT have captivated the public imagination, demonstrating an impressive ability to generate human-like text, create content and ...

Hosted on MSN

AI models are starting to crack high-level math problems

Over the weekend, Neel Somani, who is a software engineer, former quant researcher, and a startup founder, was testing the math skills of OpenAI’s new model when he made an unexpected discovery. After ...

VentureBeat

AI’s math problem: FrontierMath benchmark shows how far technology still has to go

Artificial intelligence systems may be good at generating text, recognizing images, and even solving basic math problems—but when it comes to advanced mathematical reasoning, they are hitting a wall.

The Register on MSN

AI models still suck at math

Just less than before, according to the ORCA test exclusive Current-day LLMs are prediction engines and, as such, they can ...

NextBigFuture

AI Large Language Model Math Breakthroughs

AI large language models have been especially weak on math. There are now several papers from Google Deep Mind, Alibaba and other universities where AI large language models are at Math Olympiad ...

TechCrunch

Researchers question AI’s ‘reasoning’ ability as models stumble on math problems with trivial changes

How do machine learning models do what they do? And are they really “thinking” or “reasoning” the way we understand those things? This is a philosophical question as much as a practical one, but a new ...

TechRepublic

OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It?

OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It? Your email has been sent A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of ...

InfoQ

Alibaba Releases Two Open-Weight Language Models for Math and Voice Chat

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Prevent AI-generated tech debt with Skeleton ...

Seeking Alpha

DeepSeek updates open-source model that solves math-related problems-report

DeepSeek has reportedly open-sourced Prover-V2 model, a new specialist artificial intelligence model, as competition heated up within China's AI industry. The announcement comes a day after Alibaba ...

Phys.org

Leading AI models struggle to solve original math problems

Mathematics, like many other scientific endeavors, is increasingly using artificial intelligence. Of course, math is the backbone of AI, but mathematicians are also turning to these tools for tasks ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results