Examples of Math Models

New study shows why simulated reasoning AI models don’t yet live up to their billing

There’s a curious contradiction at the heart of today’s most capable AI models that purport to “reason”: They can solve routine math problems with accuracy, yet when faced with formulating deeper ...

Phys.org

Beyond intuition: Using mathematical models to shape behavior

A new study introduces choice engineering—a powerful new way to guide decisions using math instead of guesswork. By applying carefully designed mathematical models, researchers found they could ...

VentureBeat

Meet LLEMMA, the math-focused open source AI that outperforms rivals

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more In a new paper, researchers from various ...

InfoQ

Microsoft Research Unveils rStar-Math: Advancing Mathematical Reasoning in Small Language Models

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

VentureBeat

Microsoft’s new Orca-Math AI outperforms models 10x larger

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Students and STEM researchers of the world, rejoice! Particularly if you ...

Nature

MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data

Large language models (LLMs) have significantly advanced natural language understanding and demonstrated strong problem-solving abilities. Despite these successes, most LLMs still struggle with ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results