Math Models Projects - Search News

News

7don MSN

Mathematical model reveals how collapsing matter and expanding voids shape universe's evolution

A University of Queensland researcher has developed a new mathematical model to explain the evolution of the universe which, ...

Scientific American5d

Can Writing Math Proofs Teach AI to Reason Like Humans?

OpenAI researchers reveal how their experimental model, devoid of any external aids, powered through hours-long proofs to earn a gold-medal score at the International Math Olympiad—and they discuss th ...

OpenAI Model Earns Gold-Medal Score at International Math Olympiad and Advances Path to Artificial General Intelligence

A few months before the 2025 International Mathematical Olympiad (IMO) in July, a three-person team at OpenAI made a long bet ...

Hosted on MSN7mon

Microsoft says 'rStar-Math' demonstrates how small language models ...

Microsoft enhances the capabilities of small language models (SLMs) with rStar-Math. The technique boosts the capabilities of SLMs, allowing them to compete or even surpass the math reasoning ...

Ars Technica4mon

New study shows why simulated reasoning AI models don’t yet live up ...

New study shows why simulated reasoning AI models don’t yet live up to their billing Top AI models excel at math problems but lack reasoning needed for Math Olympiad proofs.

Ars Technica9mon

New secret math benchmark stumps AI models and PhDs alike

secret math problems dept. New secret math benchmark stumps AI models and PhDs alike FrontierMath's difficult questions remain unpublished so that AI companies can't train against it.

Forbes3mon

Big Models, Bad Math: The GenAI Problem In Finance - Forbes

The reason for this is fundamental: ChatGPT, and many models like it, can't actually do math. They rely on sophisticated pattern recognition and statistical memory, not true mathematical computation.

TechRepublic1mon

OpenAI Model Wins Gold at International Mathematical Olympiad – or ...

And yet, recently, Gemini 2.5 Pro and OpenAI’s o3 scored 86.7% and 88.9%, respectively, in the American Invitational Mathematics Examination, a key math benchmark for AI models.

TechCrunch3mon

DeepSeek upgrades its math-focused AI model Prover

The company recently released an upgraded version of V3, a general-purpose model, and is expected to update its R1 “reasoning” model soon. Topics AI, deepseek, In Brief October 27-29, 2025 ...

VentureBeat4y

Researchers find that large language models struggle with math

To measure the problem-solving ability of large and general-purpose language models, the researchers created a dataset called MATH, which consists of 12,500 problems taken from high school math ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results