Model finds a bug in cryptography, while the cryptographer learns new mathematics from it

21:44
05.02.2026
olegchir
280

This article is a response to criticism: "Stop telling fairy tales about how AI helps in science, show examples!" Indeed, without examples, stories of AI's successful success sound like cult nonsense.

In February 2026, Google published a preprint on arXiv consisting of 151 pages. Fifty authors from Carnegie Mellon, Harvard, MIT, EPFL, and a dozen other institutions. The document is modestly titled: “Accelerating Scientific Research with Gemini: Case Studies and Common Techniques”. A modest title, but really very cool content.

Preprints about the capabilities of AI are released every day. Most are benchmarks: the model scored 94.7% instead of last year's 93.2%, let’s applaud. Here, quite specific researchers tell how they struggled with an open problem for months, and then uploaded it to Gemini Deep Think — and magically got a solution. Or a counterexample. Or a reference to a theorem from a completely different area of mathematics that they had never heard of.

Some stories from there deserve a separate conversation.

In cryptography, there is a kind of holy grail: to construct a SNARG based on standard assumptions.

SNARG is a Succinct Non-interactive ARGument. A thing that allows you to prove that a computation was performed correctly, while the size of the proof and the time to verify it are exponentially smaller than the time of the computation itself. You send a transaction, and the blockchain receives a tiny certificate of the operation's validity. Without SNARGs (more precisely, without their close relatives zk-SNARKs), there would be neither Zero-Knowledge rollups nor proper scaling of Ethereum. This is an important infrastructural technology.

The problem is that all working constructions either rely on idealized models like random oracle, or on assumptions that cryptographers call “unfalsifiable”. It is unpleasant to build your house on sand; one wants reliability.

In the autumn of 2025, a preprint appeared on Cryptology ePrint Guan & Yogev: SNARG for all NP, built solely on LWE. LWE stands for Learning With Errors, a standard assumption from lattice cryptography, on which all post-quantum security relies. If the construction worked, it would be like finding the philosopher's stone.

Researchers from Google decided to set Gemini on the article.

But not just "check the proof" — such prompts yield superficial results, as the model tends to praise its owner, commend the structure of their glorious scientific work, and find typos with varying success. To counter these effects, they used a five-step protocol of adversarial self-correction: the model generates a review, then critiques its own findings for hallucinations, clarifies arguments, critiques again, and produces a final version.

This algorithm somewhat resembles my Discovery Prompt, new versions of which I post on my Telegram 1red2black. The main difference is that they did not try to cram everything into a single message and use thinking mode effects, but honestly executed the phases as separate prompts.