@[email protected] You make several great points.
The non-surveyability issue is a big one:
https://en.wikipedia.org/wiki/Non-surveyable_proof . A bunch of people rejected the computer-assisted proof of the four-color theorem until it was significantly simplified. Imagine a math LLM spitting out considerably more complicated proofs at a breakneck pace. I argue that eventually such a thing would be indistinguishable from a random string generator. It'd also waste the time and energy of a whole lot of mathematicians in the process, as you pointed out.
We are already seeing code review---human beings checking pull requests etc---being overwhelmed by LLM code generators. Some organizations are abandoning this step as a result. What purpose is served by introducing this kind of dynamics into mathematics, of all things? It's quite strange to me, this bias towards always accelerating everything whenever that's possible to do, regardless of systemic or other risks.
Proofs written in Lean and similar systems have the very big benefit of surveyability, and there's probably a world in which ethically made and constituted LLMs could add beneficial features to such tools.
The analogy to cranks is interesting. I guess in my head it's similar to why we don't throw a handful of leaves up in the air and try to read a proof out of the pattern they make when they fall to the ground (usually!). It's the folly of approaching the problem of finding a needle in a haystack by making the haystack bigger. People love making the haystack bigger for some reason.
@[email protected] @[email protected] @[email protected]