Mathematical Net Models

OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It?

A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of OpenAI’s claim about its gold-medal score. OpenAI’s latest model has achieved a gold-level score at the ...

VentureBeat

New open-source math model Light-R1-32B surpasses equivalent DeepSeek performance with only $1000 in training costs

Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

Ars Technica

New study shows why simulated reasoning AI models don’t yet live up to their billing

There’s a curious contradiction at the heart of today’s most capable AI models that purport to “reason”: They can solve routine math problems with accuracy, yet when faced with formulating deeper ...

Neowin

DeepSeek launches new math-oriented model to solve secrets of the universe

DeepSeek made waves in early 2025, launching one of the world's first free-to-access thinking models. Now, the Chinese firm has just released DeepSeekMath-V2 with the objective of achieving ...

Phys.org

Beyond intuition: Using mathematical models to shape behavior

A new study introduces choice engineering—a powerful new way to guide decisions using math instead of guesswork. By applying carefully designed mathematical models, researchers found they could ...

National Academies of Sciences%2c Engineering%2c and Medicine

Assessing the Reliability of Complex Models: Mathematical and Statistical Foundations of Verification, Validation, and Uncertainty Quantification

In Chapter 1, verification is defined as the process of determining how accurately a computer program (“code”) correctly solves the equations of a mathematical model. This includes code verification ...

Ars Technica

Google claims math breakthrough with proof-solving AI models

On Thursday, Google DeepMind announced that AI systems called AlphaProof and AlphaGeometry 2 reportedly solved four out of six problems from this year’s International Mathematical Olympiad (IMO), ...

Phys.org

Humans beat AI at international math contest despite gold-level AI scores

Humans beat generative AI models made by Google and OpenAI at a top international mathematics competition, despite the programs reaching gold-level scores for the first time. Neither model scored full ...

National Academies of Sciences%2c Engineering%2c and Medicine

Assessing the Reliability of Complex Models: Mathematical and Statistical Foundations of Verification, Validation, and Uncertainty Quantification

NOTICE: The project that is the subject of this report was approved by the Governing Board of the National Research Council, whose members are drawn from the councils of the National Academy of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results