Mastodawn

David August ❌👑1d ago

We knew, but the proof is nice.

"Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

The guess-the-next-words machines don’t actually understand anything.

https://nitter.poast.org/heynavtoor/status/2041243558833987600#m

#math #ai

Show thread

audioflyer79 1d ago

@davidaugust Ecosia AI gets it right. It looks like the paper referenced was published in 2025, so the research conducted prior. The models are all much better now. I’m no AI apologist, but I think any argument of “AI sucks because it’s not good at _____” is on tenuous ground and will be proven wrong as the models continue to improve. @Ecosia

Show thread

Alison Wilder 1d ago

@audioflyer79 @davidaugust I mean, it's worth noting that the LLMs have ingested that paper by now. : /

Show thread

audioflyer79

@alisynthesis @davidaugust fair enough. I changed up the problem completely and added some reasoning and it did pretty well. It appears to be generating code to solve the math. The only thing it missed is that very unripe bananas are green, not yellow.

James picks 40 apples on Monday. Then he picks 35 lemons on Tuesday. On Wednesday, he picks half as many bananas as he did apples, but five of them were very unripe. How many yellow fruits does James have?

Show thread

Morten Hilker-Skaaning 19h ago

@audioflyer79 @alisynthesis @davidaugust how does it do if you swap the colors of the fruit?