MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data

Meng Fang; Xiangpeng Wan; Fei Lu; Fei Xing; Kai Zou

doi:10.1038/s41597-025-05283-3

Scientific Data (Aug 2025)

MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data

Meng Fang,
Xiangpeng Wan,
Fei Lu,
Fei Xing,
Kai Zou

Affiliations

Meng Fang: Department of Computer Science, University of Liverpool
Xiangpeng Wan: NetMind.AI
Fei Lu: Department of Mathematics, Johns Hopkins University
Fei Xing: Mathematica Policy Research
Kai Zou: NetMind.AI

DOI: https://doi.org/10.1038/s41597-025-05283-3
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Large language models (LLMs) have significantly advanced natural language understanding and demonstrated strong problem-solving abilities. Despite these successes, most LLMs still struggle with solving mathematical problems due to the intricate reasoning required. To support rigorous evaluation of mathematical reasoning in LLMs, we introduce the “MathOdyssey” dataset - a curated collection of 387 expert-generated mathematical problems spanning high school, university, and Olympiad-level topics. Each problem is accompanied by a detailed solution and categorized by difficulty level, subject area, and answer type. The dataset was developed through a rigorous multi-stage process involving contributions from subject experts, peer review, and standardized formatting. We provide detailed metadata and a standardized schema to facilitate consistent use in downstream applications. To demonstrate the dataset’s utility, we evaluate several representative LLMs and report their performance across problem types. We release MathOdyssey as an open-access resource to enable reproducible and fine-grained assessment of mathematical capabilities in LLMs and to foster further research in mathematical reasoning and education.

Published in Scientific Data

ISSN: 2052-4463 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science
Website: https://www.nature.com/sdata/

About the journal