Mastodawn

devopscats

Show thread

devopscats May 15, 2024

The origin of the cabbage, as usual cheeky foxes involved...
https://www.youtube.com/watch?v=0RdKdnVY-Kc

The goat, wolf and cabbage riddle. Brain teasers mind tricks

YouTube

Show thread

Cycling Stu May 15, 2024

@devopscats that’s beautiful

Show thread

Woozle Hypertwin May 15, 2024

@devopscats LLM self-parody ftw :D

Show thread

Why Not Zoidberg? 🦑May 15, 2024

@woozle @devopscats "But LLMs makes everything much easier"

Show thread

Konrad Kołakowski May 15, 2024

@devopscats 🤷‍♂️

Show thread

Jamie McCarthy May 15, 2024

@kkolakowski @devopscats People are going to be mocking copy-and-pasted 2023 LLM output for the next ten years. It’s easy engagement numbers

Show thread

Konrad Kołakowski May 15, 2024

@jamiemccarthy Yeah I know, but I still don't like such things 😉 And in this case, I would even say it was faked 😅

Show thread

Jamie McCarthy May 15, 2024

@kkolakowski oh I don’t doubt it could be real, that’s typical from last year I think

Here’s a current example 🤣 https://researchbuzz.masto.host/@danlyke/112447350575764027

Dan Lyke (@[email protected])

Attached: 1 image Inspired by https://toot.cat/@devopscats/112445057997076822 I asked Gemini to help me get a man and his 5 chickens across the river on a boat. https://g.co/gemini/share/547a3f1855a5

ResearchBuzz-On-the-Mammut

Show thread

Tom Bellin

May 15, 2024

@kkolakowski @jamiemccarthy A human corrected the model manually. It's literally the mechanical turk.

Show thread

Trolli Schmittlauch 🦥May 16, 2024

@jamiemccarthy @kkolakowski @devopscats No, I assume the meaningful difference is framing it as a puzzle. That might make the LLM pull in all the assumptions about the commonly-known puzzle additionally involving the cabbage and wolf constraints.

Show thread

Ash_Crow Jun 9, 2024

@jamiemccarthy @kkolakowski @devopscats results from right now.

(Even the left side is non-sensical)

Show thread

Ash_Crow Jun 9, 2024

@jamiemccarthy @kkolakowski @devopscats (what is impressive, though, is that it can now can accurately give me the non-sensical answers in minority languages, including Gallo for which there is no existing translation tool existing and very few content on the net to begin with.

(also the BR one has replaced the goat with a ram for some reason, and in the Gallo it's just "the animal")

Show thread

Polly Kraisus Jun 9, 2024

@jamiemccarthy @kkolakowski @devopscats ...and 2024: https://wandering.shop/@tansy/112445435398947887
... and I bet they will be sharing 2025 ones too.
The machine is far from stopping the bulshit artistry.

Tansy Hoskins 🍉 🍉 (@[email protected])

Attached: 1 image @[email protected] This is what I got. The goat might eat the boat...

The Wandering Shop

Show thread

Lars Marowsky-Brée 😷May 16, 2024

@kkolakowski @devopscats I tried the very same with 4o and I think we're getting a very nice lesson in the non-reproducibility of LLM output.
(This is the full chat, I did not give it additional prompts.)

https://mastodon.online/@larsmb/112449710165720893

Lars Marowsky-Brée 😷 (@[email protected])

Attached: 1 image @[email protected] This is very unfair, you should not use outdated releases of ChatGPT. Here's what GPT-4o does - much nicer formatting!

Mastodon

Show thread

Donald Hobern May 16, 2024

@kkolakowski @devopscats

That seems worse. At least the other one is entertaining. This one is just unbearably mansplainy.

Show thread

Dieu May 16, 2024

@kkolakowski @devopscats too easy (this one was also framed as a puzzle https://chat.openai.com/share/7b7a92ef-f1aa-4435-a1bf-19976d89ed8a /cc @schmittlauch)

ChatGPT

A conversational AI system that listens, learns, and challenges

Show thread

Sven (retired)May 17, 2024

@kkolakowski Interesting, my 4o just made up a constraint, then forgot about it halfway through. 😅

Show thread

Tansy Hoskins 🍉 🍉May 15, 2024

@devopscats

This is what I got.

The goat might eat the boat...

Show thread

devopscats May 15, 2024

@tansy sounds like a goat

Show thread

Tansy Hoskins 🍉 🍉May 15, 2024

@devopscats

🐐 🐐

Show thread

Paul Wilde

May 15, 2024

@tansy @devopscats this is all gold!

Show thread

miketcope May 16, 2024

@tansy @devopscats the rowing goat is the GOAT goat

Show thread

Tansy Hoskins 🍉 🍉May 16, 2024

@copito @devopscats

😂 so many questions...

Show thread

Becky May 16, 2024

@tansy @devopscats @Sorvall

Why does nobody ever ask if the goat WANTS to cross over?

Show thread

Sorvall May 16, 2024

@Beckydog @devopscats @tansy or why a man is with three goats, or even if the goats are real.

Show thread

Becky May 16, 2024

@Sorvall @devopscats @tansy

Is the man, just a rare naked, bipedal goat? ARE WE ALL JUST GOATS IN SKIN?

Show thread

Tansy Hoskins 🍉 🍉May 16, 2024

@Beckydog @Sorvall @devopscats

At this point, yes we are.

Show thread

Becky May 16, 2024

@tansy @Sorvall @devopscats

BAAAAAAH! This can’t be the tr…. Oh god. NUUUU! I’m a DOG!

Show thread

Mia Holte May 16, 2024

@tansy @devopscats haha 😂

Show thread

RedGlow May 15, 2024

@devopscats *nom nom nom*

Show thread

starkraving666 May 15, 2024

@devopscats this isn't real, right

Show thread

Violet Madder May 15, 2024

@starkraving666 @devopscats

Who the hell knows.

And that's the problem.

Show thread

Wilson "Beans Clock" Scraddock May 15, 2024

@starkraving666 @devopscats i just tried it and got the same thing, but the guy and the goat ended up on opposite sides of the river? when i asked about it, it basically repeats itself and says you sure there's not a wolf or some cabbage?

Show thread

starkraving666 May 15, 2024

@beanclock @devopscats ahahahaha holy shit

Show thread

Tom May 16, 2024

@starkraving666 @devopscats says the account with the ai generated mouse penis anatomy picture as an avatar.

Show thread

Maverynthia🌱May 15, 2024

@devopscats

OH no I am found out and taken across the river.

Show thread

chtruchet May 15, 2024

@devopscats makes sense... 😂

Show thread

[Yaseenist] CauseOfBSOD

May 15, 2024

@devopscats why does this solution sound like a videogame glitch setup that already involves using another glitch

Show thread

Luna Ruby Artemis 💛🤍💜🖤May 15, 2024

@devopscats Proof #467198245 that LLMs don't actually understand anything, they're just fancy phrase generators.

Show thread

Trent Waddington May 15, 2024

@arina @devopscats it's trained to produce statistically similar outputs to its training data. If they put more "anti-questions" in the training data it'd produce more appropriate answers. What does "understanding" even mean? Suppose I asked this question to someone who had never seen a boat, or a goat, would they understand it? Most of us recognise the question as implying a row boat, but few of us have ever rowed a boat, and I bet none of us have tried to row a boat with a goat. Anyways, I'm going to eat some green eggs and ham.

Show thread

Luna Ruby Artemis 💛🤍💜🖤May 15, 2024

@quantumg @devopscats What I mean is, they don't understand that there's a boat and a goat. It is not translated into concepts the way a human mind would.

Show thread

Trent Waddington May 15, 2024

@arina @devopscats we don't have any idea how a human mind constructs concepts. Let alone any particular human mind. LLMs do indeed translate words into "concepts" and we know this because their internals can be interrogated. If we give it a few different sentences about goats there will be similar vectors in the different computation. What's more, we can intervene, changing the goat vectors to look more like cat vectors and the output will be cat-related. Sentences about goats eating socks will become about cats eating mice, even though we provided it nothing about mice, because it has encoded in it the more likely relationship.

Show thread

Luna Ruby Artemis 💛🤍💜🖤May 16, 2024

@quantumg @devopscats We don’t know how human mind constructs concepts, but we know that it does, because we can reason about them.
We know LLMs don’t construct concepts because they tend to hallucinate or forget concepts introduced previously.

Show thread

Trent Waddington May 16, 2024

@arina @devopscats I get what you're saying. There's something concept-like going on in LLMs but it's not the full concept 😅

Show thread

Saffron🏳️‍⚧️May 15, 2024

@quantumg @arina @devopscats this model is trained on more data than a human brain could ever possibly absorb within a human lifetime, and yet it still can't solve an incredibly simple logic puzzle. if your solution is to throw more data at the problem, which up to this point has clearly not worked, then you fundamentally misunderstand what this type of tool is useful for.

Show thread

Trent Waddington May 15, 2024

@spinach @arina @devopscats I don't know why you feel the need to dunk, but I suspect it's because you've been trained to behave that way. I'd argue that the average human being experiences an Internet worth of data every few minutes. It's also in a social context which is constructed for us and evaluated by other people who are trained in the same context.

Show thread

:spinny_cat_inderix:May 15, 2024

@devopscats genuinely where did the bot get cabbage from

Show thread

Bill, organizer of stuff May 15, 2024

@devopscats Yes, but how many giraffes are in the boat at the end?

Show thread

Violet May 15, 2024

@wcbdata @devopscats
Apparently, Copilot thinks that goats and giraffes aren't compatible.

Show thread

Bill, organizer of stuff May 15, 2024

@violet @devopscats
Thank you!!! 🙏

Show thread

IceNine May 16, 2024

@wcbdata @violet @devopscats

Rubbish!

The answer is four. The man takes the giraffe to the opposite side. Then the giraffe's cousin eats the cabbage, and his three siblings jump in the boat to avoid the excess flatulence.

They are joined by the wolf, who in a twist on this old riddle is wearing a bear suit to fit in.

The man is eaten by a lion.

Show thread

Zen Zero ☯️ ◯May 15, 2024

@devopscats @SRDas People are afraid of these systems getting too smart….

Show thread

Dr. Juande Santander-Vela May 15, 2024

@xvf17 @devopscats @SRDas the real danger is people that believe these things can replace people…

Show thread

meduz'May 15, 2024

@devopscats Small variation: “A cabbage and a river have a goat. How can they go across?”

Steps 1 and 5 are the same, and steps 2 and 4 too. 🙃

Using bing.com / Copilot.

Show thread

meduz'May 15, 2024

@devopscats OMG, look at this:

“A man have a boat. How can they go across the cabbage?”

“The scenario you’ve described seems quite whimsical! 🚣‍♂️ Let’s explore a playful solution:

[…]

Note: This whimsical story is purely fictional and meant for entertainment. Please do not attempt to sail on actual cabbages. 😄”

This is gorgeous-Hollywood-level! 😆

Show thread

meduz'May 15, 2024

@devopscats And for those who wants to know what happens to Barnaby on the Sauerkraut Island…

“Note: This fanciful story is purely fictional and meant for entertainment. Please do not attempt to sail on actual cabbages or visit imaginary pickle islands. 😄”

Show thread

Kaspi May 16, 2024

@meduz @malice_101 need an adventure outline for The Wildsea?

Show thread

Traffic Cone Carlie May 15, 2024

@devopscats
Dennis, stop confusing the AI!

Show thread

mcc May 15, 2024

@devopscats I predict that within five years ChatGPT will be able to drive a car faster— and more safely— than a human driver

Show thread

Schafstelze 🎗🖤May 15, 2024

@mcc @devopscats you set the bar low...