We knew, but the proof is nice.

"Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

The guess-the-next-words machines donโ€™t actually understand anything.

https://nitter.poast.org/heynavtoor/status/2041243558833987600#m

#math #ai

@davidaugust Ecosia AI gets it right. It looks like the paper referenced was published in 2025, so the research conducted prior. The models are all much better now. Iโ€™m no AI apologist, but I think any argument of โ€œAI sucks because itโ€™s not good at _____โ€ is on tenuous ground and will be proven wrong as the models continue to improve. @Ecosia

@audioflyer79 @davidaugust Most (every?) big "AI" chatbot have been patched to intercept math questions and hand them over to an actual program (often Python).

That doesn't change the fact that the LLM part itself cannot do math, and there are still risks that it will misinterpret your question and produce the wrong program to calculate the answer.