ExplorerArtificial IntelligenceAI
Research PaperResearchia:202605.22014

Advancing Mathematics Research with AI-Driven Formal Proof Search

George Tsoukalas

Abstract

Large language models (LLMs) increasingly excel at mathematical reasoning, but their unreliability limits their utility in mathematics research. A mitigation is using LLMs to generate formal proofs in languages like Lean. We perform the first large-scale evaluation of this method's ability to solve open problems. Our most capable agent autonomously resolved 9 of 353 open Erdős problems at the per-problem cost of a few hundred dollars, proved 44/492 OEIS conjectures, and is being deployed in comb...

Submitted: May 22, 2026Subjects: AI; Artificial Intelligence

Description / Details

Large language models (LLMs) increasingly excel at mathematical reasoning, but their unreliability limits their utility in mathematics research. A mitigation is using LLMs to generate formal proofs in languages like Lean. We perform the first large-scale evaluation of this method's ability to solve open problems. Our most capable agent autonomously resolved 9 of 353 open Erdős problems at the per-problem cost of a few hundred dollars, proved 44/492 OEIS conjectures, and is being deployed in combinatorics, optimization, graph theory, algebraic geometry, and quantum optics research. A basic agent alternating LLM-based generation with Lean-based verification replicated the Erdős successes but proved costlier on the hardest problems. These findings demonstrate the power of AI-aided formal proof search and shed light on the agent designs that enable it.


Source: arXiv:2605.22763v1 - http://arxiv.org/abs/2605.22763v1 PDF: https://arxiv.org/pdf/2605.22763v1 Original Link: http://arxiv.org/abs/2605.22763v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
May 22, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
Advancing Mathematics Research with AI-Driven Formal Proof Search | Researchia