VUB's Data Analytics Lab has published new results showing that it is possible to develop original mathematical proofs using commercial language models. In a paper posted to the arXiv preprint server, ...
New benchmark study results show leading AI models, including ChatGPT, Claude, and Gemini, still lag humans in visual math reasoning.
In A Nutshell Researchers used computer modeling and physics-based math to calculate the minimum antibiotic and moisturizer doses needed to push severe eczema into remission and keep it there. The ...
The Register on MSN

AI models still suck at math

Just less than before, according to the ORCA test exclusive Current-day LLMs are prediction engines and, as such, they can only find the most likely solution to problems, which is not necessarily the ...
Alan Veliz-Cuba has received funding from the Simons Foundation and the American Mathematical Society for some of his research. You can probably think of a time when you’ve used math to solve an ...
24-year-old founder and CEO Carina Hong created Axiom Math in March 2025 and has recruited a team of ten employees, most of whom are from Meta, to build a math-focused AI model. Last fall, Carina Hong ...
Chinese AI lab DeepSeek has quietly updated Prover, its AI model that’s designed to solve math-related proofs and theorems. According to South China Morning Post, DeepSeek uploaded the latest version ...
From writing essays to coding, there’s seemingly nothing modern AI chatbots like ChatGPT and Microsoft Copilot cannot accomplish. But even though they seem limitless on the surface, they’re certainly ...