Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...
The Register on MSN

AI models still suck at math

Just less than before, according to the ORCA test exclusive Current-day LLMs are prediction engines and, as such, they can ...
Alan Veliz-Cuba has received funding from the Simons Foundation and the American Mathematical Society for some of his research. You can probably think of a time when you’ve used math to solve an ...
24-year-old founder and CEO Carina Hong created Axiom Math in March 2025 and has recruited a team of ten employees, most of whom are from Meta, to build a math-focused AI model. Last fall, Carina Hong ...
From writing essays to coding, there’s seemingly nothing modern AI chatbots like ChatGPT and Microsoft Copilot cannot accomplish. But even though they seem limitless on the surface, they’re certainly ...
AI could soon spew out hundreds of mathematical proofs that look "right" but contain hidden flaws, or proofs so complex we can't verify them. How will we know if they're right?
Google DeepMind’s AlphaProof and AlphaGeometry 2 are milestones for AI reasoning. This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox ...
Microsoft found that small language models can exceed the performance of much larger ones when trained to specialize in a single area. Researchers fine-tuned the Mistral 7B model to create Orca-Math, ...
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Students and STEM researchers of the world, rejoice! Particularly if you ...
What if the secrets to the universe’s most perplexing mathematical riddles were no longer locked away, but instead cracked open by an artificial mind? In a new development, OpenAI’s o3-mini model has ...
OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It? Your email has been sent A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of ...