AlphaProof Nexus Advances AI Formal Verification

DeepMind’s AlphaProof Nexus pairs large language models with the Lean formal verifier to autonomously solve research-level math problems, including 9 open Erdős problems and 44 OEIS conjectures, at modest compute cost. Using iterative agentic loops—LLM-generated proof drafts checked and refined by a verifier—the system turned competition-grade reasoning into dependable, machine-checked theorems. Different agent configurations, from simple LLM+Lean setups to evolutionary proof drafting, showed that even basic configurations can succeed, highlighting stronger base-model capabilities and the value of verifier feedback. Beyond mathematics, Nexus signals practical advances for formal verification in smart contracts, zero-knowledge proofs, and cryptographic protocol assurance.

Why It Matters

AlphaProof Nexus demonstrates that coupling LLMs with formal verifiers can turn high-level reasoning into machine-checked proofs, lowering barriers to reliable automated verification. This matters for tech professionals because the same techniques can improve assurance in smart contracts, zero-knowledge systems, and cryptographic protocols.

Latest Changes

AlphaProof Nexus autonomously solved 9 of 353 open Erdős problems and proved 44 of 492 OEIS conjectures.

The system uses iterative agentic loops where LLM drafts are checked and refined by the Lean verifier to produce machine-checked theorems.

Different agent configurations, from simple LLM+Lean to evolutionary drafting, showed even basic setups can succeed, indicating stronger base-model capabilities.

Nexus achieved these results at modest compute cost, highlighting practical feasibility for broader verification tasks.

Related work ATLAS proposes autoformalizing large textbook corpora into machine-checkable libraries to scale formalization.

Timeline

2026-05-23 — DeepMind reports Nexus solved 9 Erdős problems and proved 44 OEIS conjectures using LLMs paired with Lean.

2026-05-23 — Duplicate report published detailing Nexus's autonomous proofs and verifier integration.

2026-05-26 — Chinese-language coverage highlights Nexus solving two problems unresolved for 56 years among its achievements.

2026-05-29 — Researchers introduce ATLAS, proposing pipelines to autoformalize large textbook corpora into formal libraries.

Recent News (4)

ATLAS: Autoformalized Textbook Library At Scale

Researchers introduce ATLAS, a project to autoformalize large textbook corpora into machine-checkable proofs and formal libraries using AI. The paper “Formalizing Mathematics at Scale” proposes pipelines combining LLM-driven translation, interactive theorem provers, and verification tooling to convert informal mathematics into formal languages, aiming to scale formalization across domains and lower the manual burden on proof engineers. Key players include the paper’s authors and the broader proof-assistant and LLM ecosystem; the work ties into systems like Lean, Coq and modern large language models. This matters because scalable autoformalization could accelerate trustworthy mathematical knowledge, improve software verification, and create high-quality training data for reasoning-focused AI.

20pts

HNvrm1d ago

谷歌 AI 框架 AlphaProof Nexus 攻克 2 道悬置 56 年数学难题

Google DeepMind’s new AI framework AlphaProof Nexus combined large language models with Lean formal verification to autonomously solve 9 open Erdős problems out of 353, including two that had been unresolved for 56 years. The system also proved 44 conjectures in OEIS, solved a 15-year-old Hilbert function problem, and improved bounds in convex optimization, with per-problem costs of a few hundred dollars. AlphaProof Nexus uses four agents of increasing complexity (Agent A using Gemini 3.1 Pro + Lean, Agent B integrating AlphaProof, Agent C adding evolutionary proof drafting, and Agent D combining all features). Researchers found even the simplest agent could solve the nine cases, underscoring stronger base-model capabilities and the anchoring effect of compiler feedback on LLM reasoning. This advances automated formal math and tool-assisted theorem proving.

NewsNowMay 26, 2026

Why It Matters

Latest Changes

Timeline

What to Watch

Recent News (4)