"phd-level reasoning"
"phd-level reasoning"
The web search poisoned it. ChatGPT with web search off gets it. As well as basically every other LLM.
Its response:
This is a trick riddle built on the classic wolf–goat–cabbage problem.
Key twists:
So there are no constraints left to manage.
✅ Minimum number of trips needed: 1
The goat simply crosses the river once.
I mean, this is just one of half a dozen experiments I conducted (replicating just a few of the thousands that actual scientists do), but the point stands: what PhD (again, that was Sam Qltman’sclaim, not mine) would be thrown off by a web search?
Unless the creators of LLMs admit that their systems won’t achieve AGI by just throwing more money at it, shitty claims will prevent the field from actual progress.
Do you know many PhDs? Being thrown off by a web search isn’t that unbelievable.
Half the ones I know can barely operate their email
agreed. he did.
my comment was mostly about PhD level being a nonsense term when speaking about general intelligence rather than depth of knowledge in a specific field