Apple did the research; LLMs cannot do formal reasoning. Results change by as much as 10% if something as basic as the names change.
https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and
Apple did the research; LLMs cannot do formal reasoning. Results change by as much as 10% if something as basic as the names change.
https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and
Companies like OpenAI and their defenders claim generative AI can reason, learn, etc. We know itās nonsense, but itās still extremely important it gets called out.
@nf3xn @rubenerd @graue @halva @ShadowJonathan I doubt Hinton is lying although heās probably wrong. Thereās a problem in philosophy: is the mind separate from the body? If itās not, then it should be possible to model the brain well enough to simulate thought processes (at least in principle.)
Computational physics tells us that there is a function that could perform the simulation and Hintonās career is looking for it.
@dalias @graue @halva @ShadowJonathan Youād think that people who own a bicycle can just checkā¦
On a tangentially related note, flying bicycles are invented by future humanity āThe Dark Forestā personal flying vehicles in the form of helicopter backpacks. Theyāre ābicyclesā in the sense that theyāre two counter-rotating coaxially-mounted propellers. Thatās actually not a bad idea. If only we poured billions of dollars into making that work.
@enoch_exe_inc @dalias @graue @halva
> Youād think that people who own a bicycle can just checkā¦
does the emperor have no clothes? would people call him out on it?
@ShadowJonathan @enoch_exe_inc @dalias @graue @halva
I'm an avid cyclist. I'll be out riding most of today and tomorrow.
Technically ...
Some bicycles do fly:
https://www.youtube.com/watch?app=desktop&v=v9KJwOZ3frk
And many "mountain bikers" are known for getting a lot of "air"
https://www.youtube.com/watch?v=6s6-O054SXg&ab_channel=BrendanFairclough
But yes, generally, bicycles do *not* fly.
@graue @halva @ShadowJonathan You started with ācan flyā. But sure move the goalposts to ācan carry commercial passenger trafficā to avoid the point of the analogy extension. š
Have a safe flight and be sure to tip the pilot.
@MartyFouts @graue @halva @ShadowJonathan
You got the story wrong, exactly like LLM wrong.
They had wing to fly, but needed speed so they added the bicycle.
With LLM we have text generation, when we will have a reasoning IA, we will add LLM to talk to us.
Like the bicycle that can't fly but can produce speed, LLM can't reason but can talk.
@Aedius @graue @halva @ShadowJonathan I didnāt say anything about how the device evolved, only describing its eventual state. So no, I didnāt get the story wrong.
But I see you do understand the underlying point: there are researchers who are taking the bicycle with wings approach, making the assumption that multi-agent methods will work around LLM limitations.
This is in the context of massive companies spending billions hyping bicycles as viable replacements for aircraft.
It's blindingly obvious it's all a lie, but the hype keeps making it onto the front page and people keep investing in it as if it was true. Airlines are talking about replacing their planes with bikes etc etc.
There are serious discussions (by people who should really know better) about how plane makers are no longer needed because bicycles exist. It makes no sense but there's so much money invested that no one wants to be the one to admit it.
The question is, what happens when such research conflicts with share price-juicing hype?
Do companies try to damp down the hype for the sake of long term sanity? Or do they go with the hype to get maximum juicing and bury any sceptical voices?