The origin of the cabbage, as usual cheeky foxes involved...
https://www.youtube.com/watch?v=0RdKdnVY-Kc
The goat, wolf and cabbage riddle. Brain teasers mind tricks

YouTube
@devopscats 🤷‍♂️
@kkolakowski @devopscats People are going to be mocking copy-and-pasted 2023 LLM output for the next ten years. It’s easy engagement numbers
@jamiemccarthy Yeah I know, but I still don't like such things 😉 And in this case, I would even say it was faked 😅

@kkolakowski oh I don’t doubt it could be real, that’s typical from last year I think

Here’s a current example 🤣 https://researchbuzz.masto.host/@danlyke/112447350575764027

Dan Lyke (@[email protected])

Attached: 1 image Inspired by https://toot.cat/@devopscats/112445057997076822 I asked Gemini to help me get a man and his 5 chickens across the river on a boat. https://g.co/gemini/share/547a3f1855a5

ResearchBuzz-On-the-Mammut
@kkolakowski @jamiemccarthy A human corrected the model manually. It's literally the mechanical turk.
@jamiemccarthy @kkolakowski @devopscats No, I assume the meaningful difference is framing it as a puzzle. That might make the LLM pull in all the assumptions about the commonly-known puzzle additionally involving the cabbage and wolf constraints.

@jamiemccarthy @kkolakowski @devopscats results from right now.

(Even the left side is non-sensical)

@jamiemccarthy @kkolakowski @devopscats (what is impressive, though, is that it can now can accurately give me the non-sensical answers in minority languages, including Gallo for which there is no existing translation tool existing and very few content on the net to begin with.

(also the BR one has replaced the goat with a ram for some reason, and in the Gallo it's just "the animal")

@jamiemccarthy @kkolakowski @devopscats ...and 2024: https://wandering.shop/@tansy/112445435398947887
... and I bet they will be sharing 2025 ones too.
The machine is far from stopping the bulshit artistry.
Tansy Hoskins 🍉 🍉 (@[email protected])

Attached: 1 image @[email protected] This is what I got. The goat might eat the boat...

The Wandering Shop

@kkolakowski @devopscats I tried the very same with 4o and I think we're getting a very nice lesson in the non-reproducibility of LLM output.
(This is the full chat, I did not give it additional prompts.)

https://mastodon.online/@larsmb/112449710165720893

Lars Marowsky-Brée 😷 (@[email protected])

Attached: 1 image @[email protected] This is very unfair, you should not use outdated releases of ChatGPT. Here's what GPT-4o does - much nicer formatting!

Mastodon

@kkolakowski @devopscats

That seems worse. At least the other one is entertaining. This one is just unbearably mansplainy.

ChatGPT

A conversational AI system that listens, learns, and challenges

@kkolakowski Interesting, my 4o just made up a constraint, then forgot about it halfway through. 😅

@devopscats

This is what I got.

The goat might eat the boat...

@tansy @devopscats @Sorvall

Why does nobody ever ask if the goat WANTS to cross over?

@Beckydog @devopscats @tansy or why a man is with three goats, or even if the goats are real.

@Sorvall @devopscats @tansy

Is the man, just a rare naked, bipedal goat? ARE WE ALL JUST GOATS IN SKIN?

@Beckydog @Sorvall @devopscats

At this point, yes we are.

@tansy @Sorvall @devopscats

BAAAAAAH! This can’t be the tr…. Oh god. NUUUU! I’m a DOG!

@devopscats this isn't real, right

@starkraving666 @devopscats

Who the hell knows.

And that's the problem.

@starkraving666 @devopscats i just tried it and got the same thing, but the guy and the goat ended up on opposite sides of the river? when i asked about it, it basically repeats itself and says you sure there's not a wolf or some cabbage?
@starkraving666 @devopscats says the account with the ai generated mouse penis anatomy picture as an avatar.

@devopscats

OH no I am found out and taken across the river.

@devopscats why does this solution sound like a videogame glitch setup that already involves using another glitch
@devopscats Proof #467198245 that LLMs don't actually understand anything, they're just fancy phrase generators.
@arina @devopscats it's trained to produce statistically similar outputs to its training data. If they put more "anti-questions" in the training data it'd produce more appropriate answers. What does "understanding" even mean? Suppose I asked this question to someone who had never seen a boat, or a goat, would they understand it? Most of us recognise the question as implying a row boat, but few of us have ever rowed a boat, and I bet none of us have tried to row a boat with a goat. Anyways, I'm going to eat some green eggs and ham.
@quantumg @devopscats What I mean is, they don't understand that there's a boat and a goat. It is not translated into concepts the way a human mind would.
@arina @devopscats we don't have any idea how a human mind constructs concepts. Let alone any particular human mind. LLMs do indeed translate words into "concepts" and we know this because their internals can be interrogated. If we give it a few different sentences about goats there will be similar vectors in the different computation. What's more, we can intervene, changing the goat vectors to look more like cat vectors and the output will be cat-related. Sentences about goats eating socks will become about cats eating mice, even though we provided it nothing about mice, because it has encoded in it the more likely relationship.
@quantumg @devopscats We don’t know how human mind constructs concepts, but we know that it does, because we can reason about them.
We know LLMs don’t construct concepts because they tend to hallucinate or forget concepts introduced previously.
@arina @devopscats I get what you're saying. There's something concept-like going on in LLMs but it's not the full concept 😅
@quantumg @arina @devopscats this model is trained on more data than a human brain could ever possibly absorb within a human lifetime, and yet it still can't solve an incredibly simple logic puzzle. if your solution is to throw more data at the problem, which up to this point has clearly not worked, then you fundamentally misunderstand what this type of tool is useful for.
@spinach @arina @devopscats I don't know why you feel the need to dunk, but I suspect it's because you've been trained to behave that way. I'd argue that the average human being experiences an Internet worth of data every few minutes. It's also in a social context which is constructed for us and evaluated by other people who are trained in the same context.
@devopscats genuinely where did the bot get cabbage from
@devopscats Yes, but how many giraffes are in the boat at the end?
@wcbdata @devopscats
Apparently, Copilot thinks that goats and giraffes aren't compatible.
@violet @devopscats
Thank you!!! 🙏

@wcbdata @violet @devopscats

Rubbish!

The answer is four. The man takes the giraffe to the opposite side. Then the giraffe's cousin eats the cabbage, and his three siblings jump in the boat to avoid the excess flatulence.

They are joined by the wolf, who in a twist on this old riddle is wearing a bear suit to fit in.

The man is eaten by a lion.

@devopscats @SRDas People are afraid of these systems getting too smart….
@xvf17 @devopscats @SRDas the real danger is people that believe these things can replace people…

@devopscats Small variation: “A cabbage and a river have a goat. How can they go across?”

Steps 1 and 5 are the same, and steps 2 and 4 too. 🙃

Using bing.com / Copilot.

@devopscats OMG, look at this:

“A man have a boat. How can they go across the cabbage?”

“The scenario you’ve described seems quite whimsical! 🚣‍♂️ Let’s explore a playful solution:

[…]

Note: This whimsical story is purely fictional and meant for entertainment. Please do not attempt to sail on actual cabbages. 😄”

This is gorgeous-Hollywood-level! 😆

@devopscats And for those who wants to know what happens to Barnaby on the Sauerkraut Island…

“Note: This fanciful story is purely fictional and meant for entertainment. Please do not attempt to sail on actual cabbages or visit imaginary pickle islands. 😄”

@meduz @malice_101 need an adventure outline for The Wildsea?
@devopscats I predict that within five years ChatGPT will be able to drive a car faster— and more safely— than a human driver