chatgpt 5.5 pro / additive number theory / ai/ml

Why It Matters

LLMs demonstrating the ability to produce publishable mathematical research changes expectations for research assistance, doctoral training, and verification workflows. Tech professionals must reassess tooling, evaluation, and risk controls when models can autonomously generate research-level outputs.

Latest Changes

Field Medalist Timothy Gowers publicly tested ChatGPT 5.5 Pro on additive number theory problems.

Gowers reported the model produced publishable/PhD-level results within minutes to about an hour with little human input.

Gowers raised his assessment of LLM capabilities and warned this could affect doctoral training and what counts as a sufficiently hard problem.

Timeline

2026-05-09 — Gowers posted initial reports that ChatGPT 5.5 Pro produced PhD-level mathematical research in about an hour.

2026-05-09 — Gowers shared a Twitter thread linking his tests and interactions with ChatGPT 5.5 Pro on additive number theory.

2026-05-10 — A follow-up writeup titled 'A recent experience with ChatGPT 5.5 Pro' summarized his testing and outcomes.

2026-05-11 — News reported that Gowers found publishable-level results in as little as 17 minutes, amplifying concerns about doctoral training.

Recent News (4)

菲尔兹奖得主亲测 ChatGPT 5.5 Pro：17 分钟出论文级成果，替学生拉响红色警报

Field Medalist Timothy Gowers tested ChatGPT 5.5 Pro and the model produced publishable-level results on additive number theory problems within minutes, prompting alarm about doctoral training. Using only high-level prompts, Gowers fed problems from Mel Nathanson’s paper to GPT-4o-class ChatGPT 5.5 Pro; the model produced a quadratic upper-bound construction (improving on an exponential bound) in ~17 minutes and combined results into a LaTeX preprint in under an hour. It later generated novel k-dissociated-set ideas that extended MIT student Isaac Rajagopal’s work, iterating to stronger bounds with minimal human math input. Gowers warns this raises authorship, publication, and PhD training challenges; Terence Tao suggests human “digestion” of proofs remains a key value. The story matters for AI-assisted research workflows, publication norms, and graduate education in math and related fields.

NewsNow2h ago

A recent experience with ChatGPT 5.5 Pro

3pts

Lobsters1d ago

A recent experience with ChatGPT 5.5 Pro

Mathematician Timothy Gowers reports that ChatGPT 5.5 Pro produced PhD-level mathematical research in about an hour with minimal human input, prompting him to raise his assessment of LLM capabilities. He tested the model on open problems from Mel Nathanson’s paper on additive number theory and found LLMs increasingly able to spot overlooked simple arguments or assemble existing literature to solve research-level problems. Gowers argues this raises the bar for what counts as a sufficiently hard problem: researchers must now expect LLMs to solve easier open questions. The piece highlights the implications for mathematical research practices and problem selection as powerful generative models become research collaborators.

chatgpt 5.5 pro / additive number theory / ai/ml

Why It Matters

Latest Changes

Timeline

What to Watch

Articles

LLMs Corrupt Docs in Long Delegations — fix your agent patterns

Recent News (4)