We knew, but the proof is nice.

"Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

The guess-the-next-words machines don’t actually understand anything.

https://nitter.poast.org/heynavtoor/status/2041243558833987600#m

#math #ai

@davidaugust Well, there have actually been successes by connecting LLMs to proof assistant and computer algebra programs. As this post rightly puts, the LLM is not capable in itself to perform computations reliably, but it can write commands sent to the computer algebra programs, or proof candidates sent to the proof assistant; which can answer that the proof is incorrect, and the process goes on until a correct proof is produced.

See also uses by pro mathematicians:
https://bsky.app/profile/wildverzweigt.bsky.social/post/3miua4ulxhk2f

Also see Terence Tao

Wildverzweigte Erweiterung (@wildverzweigt.bsky.social)

Si K contient une racine carrée de -1 alors on a ce contre-exemple (trouvé par mon ami R.R. en utilisant un LLM, je suis pas fan mais bon). J'ai vérifié le calcul dans Sage.

Bluesky Social