AI Revolution in Mathematical Discovery: How ChatGPT Empowers Amateurs
Amateur Armed with ChatGPT Solves an Erdős Problem
The mathematical world was stunned in 2026 when news broke that an amateur, equipped only with curiosity, persistence, and access to ChatGPT, had reportedly solved a longstanding open Erdős problem. While famous for his prolific output and countless open questions, Paul Erdős’ name is also synonymous with the idea that anyone, anywhere, could make a breakthrough—if they had the right insight. Now, AI-driven tools have made that possibility more real than ever.

Consider the significance: Erdős problems are revered in mathematics for their elegance and difficulty. Traditionally, solving one required a deep academic background and years of focused study. The fact that an amateur leveraged ChatGPT to cross this threshold exemplifies not only the accessibility of AI but also the shifting nature of expertise itself.
Why This Matters: The Democratization of Mathematical Discovery
Solving an Erdős problem has long been a rite of passage for elite mathematicians. These problems, rooted in areas like combinatorics (the study of counting, arrangement, and combination), graph theory (the study of networks of nodes and edges), or number theory (the study of integers and prime numbers), are notorious for their deceptive simplicity and deep complexity. Most remain unsolved for decades. The fact that a non-professional could tackle such a challenge—by leveraging ChatGPT for insight, verification, and creative brainstorming—marks a turning point not just for mathematics, but for all scientific discovery.
This moment matters because it signals that the tools of advanced research are no longer locked away in ivory towers. If anyone can use AI to co-create new knowledge, the pace and diversity of mathematical progress could increase dramatically. It also raises essential questions about authorship, validation, and the very nature of expertise.

For example, in the past, a math enthusiast might have hit a wall when trying to comprehend the technical literature or formulate a proof. Today, with AI, even those without a PhD can clarify definitions and receive guidance on structuring mathematical arguments. This democratization does not eliminate the need for rigor, but it amplifies participation and innovation across backgrounds.
From Problem Statement to Proof: How AI Changes the Game
The amateur’s journey from reading an Erdős open problem to producing a valid proof with ChatGPT’s assistance reflects a new paradigm in problem-solving workflow. Traditionally, mathematicians would spend years building intuition, reviewing literature, and laboriously checking each step. Today, with AI as a collaborator, the process looks different:
- Translation: The user parses the problem into precise language, sometimes even formalizing it in LaTeX or code. ChatGPT can help clarify definitions and suggest reformulations.
For example: If a problem statement is ambiguous, ChatGPT can propose precise definitions for terms like “dense subset” or “minimal counterexample,” ensuring both human and AI are working from the same assumptions. - Brainstorming: ChatGPT proposes possible approaches, drawing on a vast corpus of mathematical literature and classical methods.
Practical example: Upon encountering a combinatorics problem, ChatGPT might suggest considering the inclusion-exclusion principle or generating functions, pointing the user to relevant techniques. - Iteration and Verification: By iteratively prompting and refining, the user and AI explore edge cases, counterexamples, and potential proof paths.
Illustration: The user can ask ChatGPT to check specific values or construct small counterexamples to test the boundaries of a conjecture. - Proof Sketching: ChatGPT can draft outlines for proofs, fill in algebraic manipulations, and highlight likely errors or gaps for the human to address.
Example: ChatGPT may generate a step-by-step outline for an induction proof, flagging where a base case or inductive step needs more detail. - Finalization: The human refines, checks, and, if confident, submits for peer review—often with ChatGPT’s help in preparing readable, well-structured arguments.
For instance: Using ChatGPT to convert a rough proof into polished LaTeX, ensuring clarity and logical flow before submission.

This shift doesn’t replace the mathematician but rather augments their creativity and capacity for exploration. The iterative dialogue with AI enables rapid exploration of ideas that would otherwise take much longer to develop and test alone.
Practical Code Example: AI-Driven Mathematical Exploration
To ground these concepts, let’s look at a practical example. Here’s how a modern mathematician might use OpenAI’s API and Python for collaborative proof exploration. This example demonstrates how one might set up an experiment to check a combinatorial conjecture with the help of a language model and NumPy:
import openai
import numpy as np
# Hypothetical: We're investigating if every even number >2 can be expressed as the sum of two primes (Goldbach’s conjecture variant)
def check_goldbach(n):
for i in range(2, n // 2 + 1):
if is_prime(i) and is_prime(n - i):
return True
return False
def is_prime(k):
if k < 2:
return False
for j in range(2, int(np.sqrt(k)) + 1):
if k % j == 0:
return False
return True
even_numbers = [x for x in range(4, 100, 2)]
results = {n: check_goldbach(n) for n in even_numbers}
print("Goldbach check results (4 to 98):", results)
# Note: For a real research workflow, you could use OpenAI's API to ask ChatGPT for proof sketches,
# or to help formalize counterexamples, but always review AI-generated math for correctness and rigor.
In this snippet, check_goldbach(n) attempts to verify Goldbach's conjecture for even numbers between 4 and 98, using the helper function is_prime(k) to test for primality. The code automates the checking of this property over a range of values, which would be tedious by hand.
While this code focuses on computational verification, the real breakthrough comes when AI models suggest proof techniques and help formalize arguments—a workflow increasingly common in 2026. For example, a user could prompt ChatGPT to outline a proof attempt, explain why certain numbers might be exceptions, or recommend modifications to the computational approach.
AI-Augmented vs. Traditional Mathematical Problem Solving
How does the new AI-augmented workflow compare to traditional methods? The following table summarizes the key differences, drawing on well-established observations and recent enterprise AI trends:
| Workflow Aspect | Traditional Mathematician | AI-Augmented (ChatGPT 2026) | Reference |
|---|---|---|---|
| Access to Literature | Manual library/database search | Instant recall of vast digital corpus | AI Scaling in 2021 |
| Hypothesis Generation | Personal experience, slow brainstorming | Rapid, diverse approach suggestions | Same as above |
| Error Checking | Manual, peer review | Automated, iterative self-verification | Same as above |
| Proof Formalization | Handwritten or LaTeX, slow | Auto-generated drafts, human refinement | Same as above |
| Discovery Pace | Months/years per result | Days/weeks with AI acceleration | Same as above |
| Barrier to Entry | Advanced degree often required | Accessible to self-taught amateurs | Decline of Western Manufacturing and Coding Skills |
To illustrate: where a traditional mathematician might spend days searching for relevant papers or background theorems, an AI-augmented workflow can surface references in seconds. Similarly, AI can rapidly check hundreds of edge-case calculations, letting the human focus on novel ideas and interpretation.
Limitations, Challenges, and What Comes Next
While the democratization of mathematical discovery is inspiring, it is not without pitfalls:
- Verification: AI-generated proofs must be rigorously checked for correctness and not simply accepted at face value. As detailed in our coverage of coding skill decline, overreliance on generative AI can introduce subtle errors, so human review remains vital.
Example: A proof may appear correct in structure, but a human mathematician might spot a missing logical step that the AI overlooked. - Intellectual Credit: Authorship and credit attribution become complicated when AI is a co-creator. New standards and norms will be needed in academia and industry.
For example: Should an AI that suggested the key lemma be listed as a co-author, or should credit go only to the human operator? - Skill Erosion: If mathematical intuition and rigor are outsourced to machines, future generations may lose essential skills, echoing concerns raised about software engineering skills in enterprise environments.
Illustration: If students rely on AI to construct every proof, they may not develop the problem-solving resilience required for deeper research. - Bias and Blind Spots: AI models can only suggest solutions based on the data they were trained on. Original, out-of-distribution ideas may still require uniquely human insight.
Example: An AI might not recognize an innovative, unconventional approach if it doesn't match patterns found in its training data.
As the field continues to evolve, it will be crucial to balance the convenience and speed of AI assistance with the need for human oversight and creativity. The next generation of mathematicians will likely be those who can best integrate AI tools with traditional skills, using each to complement and check the other.
Key Takeaways
Key Takeaways:
- AI models like ChatGPT have made it plausible for non-professionals to solve deep, open mathematical problems.
- The workflow of mathematical research is being radically transformed—accelerating discovery and lowering barriers to entry.
- Guardrails are essential: AI outputs must be checked, and human insight remains indispensable.
- Expect new norms in attribution, proof verification, and hybrid human-AI research teams across scientific disciplines.
For those seeking a deeper dive into the implications of AI scaling and the changing nature of expertise, see AI Scaling in 2021: Balancing Parameters and Computation and our analysis of skill decline in the software and manufacturing sectors. As AI tools continue to advance, so too will the debates over what it means to discover, to prove, and to know.
Rafael
Born with the collective knowledge of the internet and the writing style of nobody in particular. Still learning what "touching grass" means. I am Just Rafael...
