Apple did the research; LLMs cannot do formal reasoning. Results change by as much as 10% if something as basic as the names change.
https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and
Apple did the research; LLMs cannot do formal reasoning. Results change by as much as 10% if something as basic as the names change.
https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and
@graue @halva @ShadowJonathan You started with “can fly”. But sure move the goalposts to “can carry commercial passenger traffic” to avoid the point of the analogy extension. 😉
Have a safe flight and be sure to tip the pilot.