Mastodawn

Paul Barnfather Jun 3

Hailey

shot, chaser

Show thread

Graham Sutherland / Polynomial Jun 3

@hailey so he did not, in fact, know what he was doing.

Show thread

s0 Traingirl Era Jun 3

@gsuberland @hailey “the author of this library may possibly have been coding based on vibes as it turns out”

Show thread

SorceryForEva 🏳️‍🌈 🏳️‍⚧️Jun 3

@s0 @gsuberland @hailey

He didn't know what he wasn't doing because knowing what he should have been doing made him think he knew what he was doing.

Why you would start a project with possibly error-riddled initial state and then iterate with possibly error-riddled improvements I don't know.

Show thread

Jim Salter 4d ago

@SorceryForEva @s0 @gsuberland @hailey that's been my biggest objection to using AI as a coding agent for quite a while now.

Personally, I find writing my own code easier and less stressful than debugging others' code.

Similarly, I'd MUCH rather drive my own (sigh) Tesla than "supervise" the idiot Autopilot driving my (sigh) Tesla.

Show thread

Chris Adams 4d ago

@jimsalter @SorceryForEva @s0 @gsuberland @hailey It hits the deeper problem that humans are much better at recognizing wrong things in front of us than we are at recognizing that something is missing. LLMs don't create that problem but they're a catalyst for hitting it more since they often exude confidence and don't leave the same tells as a human in over their head might leave.

Show thread

gaytabase Jun 3

@gsuberland @hailey i recognise the name and i guess we could say "he knew what he wasn't doing"

Show thread

sleepy silhouette Jun 3

@hailey i don't mind the job security but there probably should be a company policy to not utilize LLM-generated code in the first place

Show thread

ity [unit X-69]Jun 3

@hailey at least the author admitted that he fucked up lmao

Show thread

dirty badwrong person Jun 3

@ity @hailey so best, best, unrealistically best case scenario, the AI more or less mostly works when under the micromanagement of the kind of person who is willing to publicly admit their own mistakes...

Show thread

Mik3y Jun 3

@hailey lol this is what you get when you use ai

Show thread

FandaSin Jun 3

@hailey

I don't know, I am trying to avoid "AI" as much as i can...
But if I have to check EACH line, why not to write EACH line - then I'll know what each line does instead of guessing it.🤔

Show thread

Franchesca Jun 3

@FandaSin @hailey This. There is something very different about being inside the thought process of writing something, as opposed to just reading the code. PRs suck for spotting things that someone (or an AI) missed out, it’s like you are just coming at it from the wrong angle.

Show thread

FandaSin Jun 4

@Frantasaur

As someone who reviews code from time to time I must say...Yes, it's completely different. (to think as someone else is hard)

I think, that the angle they are comming at it is "I would love to fire all expensive programmers/writters/people who can create stuff" and pay few dollars for "AI".

@hailey

Show thread

Franchesca Jun 4

@FandaSin @hailey They clearly didn’t listen to Fred Brooks when he said there is no silver bullet.
It’s funny how many Fred Brooks quotes apply to AI, actually.

Show thread

Reed Mideke Jun 3

@FandaSin @hailey 💯 Never understood the attraction of "This will get you something that superficially looks mostly OK, then you just need a subject matter expert to analyze literally every single thing it did"

Show thread

stibbons Jun 3

@hailey Ask an AI to play stupid games. And strangely it's not the AI that wins stupid prizes.

Show thread

s0 Traingirl Era Jun 3

@stibbons @hailey

Show thread

kel 6d ago

@s0 @stibbons @hailey

Who knew this would become deeply profound 1200% accurate primary foundational advice for the whole spectrum of human experience and life in 2025 🤷

Show thread

Paul Cantrell Jun 3

@hailey @dalias

“But all the tests pass!!”

Show thread

Luna Lactea Jun 3

@inthehands @hailey @dalias Code reads "if test is happening, then pass, else return".

Show thread

cyplo Jun 3

@jackemled @inthehands @hailey @dalias you might be familiar with https://github.com/auchenberg/volkswagen

Show thread

Luna Lactea Jun 3

@cyplo @inthehands @hailey @dalias LMAO

Show thread

Erik Ableson Jun 3

@inthehands @hailey @dalias Ah, but who wrote the tests? 🤔

Show thread

Paul Cantrell Jun 3

@erik @hailey @dalias

Doesn’t matter, you just have to write good tests to test the test-writer, •then• everything is guaranteed correct

Show thread

embix Jun 4

@inthehands @erik @hailey @dalias "you just" could solve a lot of problems, unfortunately no-one "just"

Show thread

Paul Cantrell 6d ago

@brezelradar
Indeed, though in this case, “you just” is a joke; it very much cannot solve the problem. The post is satire of people who think that tests guarantee code correctness.

It’s analogous to saying you can make a turtle fly by putting it on the back of another turtle that can already fly. “And how does •that• turtle fly?” “It’s turtles all the way down!”

Show thread

embix 6d ago

@inthehands I see. 😀👍

My irony/satire/sarcasm detector is kinda on the fritz since around 2019 when all those lunatics went totally bonkers.

> It’s analogous to saying you can make a turtle fly by putting it on the back of another turtle that can already fly. “And how does •that• turtle fly?” “It’s turtles all the way down!”

That's how recursion works, right? Or was it induction 🤔

Show thread

Paul Cantrell 6d ago

@brezelradar
Induction! If you decide you don’t need a base case, then with induction you can prove anything!

Show thread

mkj Jun 3

@hailey This sounds like a case of "just because everything in the code matches the specification, that does not imply that the code correctly implements the specification".

Can't merely cross-reference the actual code with the spec. Also needs to make sure that every requirement in the spec has code!

Which of course is a mistake you can make also *without* generative AI. But I'm sure the generative AI helped out here... /s

🤦‍♂️

Show thread

fuzzyfuzzyfungus Jun 3

@mkj @hailey Some specs could really use a good dose of "everything should be made as simple as possible"

But some implementations really need to remember why the second half of that line is "but not simpler".

Show thread

mkj Jun 3

@fuzzyfuzzyfungus Oh, certainly! But if your claim to fame, as the case might be, is that everything in the code maps to something in the relevant standard, you've still got some distance to go. 🙂

@hailey

Show thread

da_667 Jun 3

@hailey astounding. I can in fact keep hating on cloudflare and be totally justified.

Show thread

Riley S. Faelan Jun 3

@hailey #infosec

Show thread

Mark Pauley Jun 3

@hailey let’s be clear: this is 100% the kind of thing that happens when we do a full rewrite. It’s just that LLM’s make doing a full rewrite much less expensive, so people are going to do it more often.

Show thread

fuzzyfuzzyfungus Jun 3

@unsaturated @hailey The part that is slightly surprising is that this happened despite the claim that "I was trying to validate my skepticism".

It's not like the red team gets a magic bonus to competence; but wanting to find fault should, at least, make you less likely to be misled by something looking pretty competent overall; where someone who is basically skimming to see if the new guy is a loser or not would be more likely to see that it's basically workmanlike and relax.

@hailey @da_667 Lmao

@hailey “the code actually looked pretty good;” therefore, it must be correct—I trust it. Nice…

Show thread

crispy branzino ☭ (skin)Jun 3

@hailey Claude pls.

Show thread

DNA schedule Jun 3

@hailey smh I'd thought Kenton was smarter than that. Sad

Show thread

Chris Hartgerink (they/them)Jun 3

@hailey the vibe is vulnerable

Show thread

Mike Taylor 🦕Jun 3

@hailey https://www.youtube.com/watch?v=cYMfJUMsLj4

Wait a minute. How did this happen? We’re smarter than this. Apparently Not

YouTube

Show thread

aburka 🫣Jun 3

@hailey code review is harder than code authoring

Show thread

Mark T. Tomczak Jun 3

@hailey In general I concur that LLMs are not how we're going to be writing code in the next N years, but in defense of the library and its author, OAuth is a hot mess of a spec and it's an easy mistake to make even if you're hand-coding the thing.

Show thread

Jaime Buelta Jun 3

@hailey @mike Malicious URIs go brrrr

Show thread

junkyard dogboy

Jun 3

@hailey this is delightful i love watching ai people eat shit

Show thread

Neville Park Jun 3

@hailey sauces: https://github.com/cloudflare/workers-oauth-provider?tab=readme-ov-file#written-using-claude
https://nvd.nist.gov/vuln/detail/cve-2025-4143

alternatively, here's an even less accessible version

Show thread

Seph Jun 3

@hailey sadtrombone.xlsx

Show thread

feld Jun 3

@hailey I hope we dunk on humans who make this same type of mistake with equal fervor

@hailey I am off coding since many years. How do you code securely wrt to the RFC? Do usually RFC nowadays have a section on how securely implement things that are known to be exploitable like in this case? I can't recall seeing anything of the sort in most Protocols RFCs (only because I never had to implement them).

Show thread

a kilo of saucepans (rakslice)Jun 3

@splinux @hailey I mean something, that very straightforwardly has security implications like an authorization framework, absolutely https://datatracker.ietf.org/doc/html/rfc6749#section-10.15

RFC 6749: The OAuth 2.0 Authorization Framework

The OAuth 2.0 authorization framework enables a third-party application to obtain limited access to an HTTP service, either on behalf of a resource owner by orchestrating an approval interaction between the resource owner and the HTTP service, or by allowing the third-party application to obtain access on its own behalf. This specification replaces and obsoletes the OAuth 1.0 protocol described in RFC 5849. [STANDARDS-TRACK]

IETF Datatracker

Show thread

a kilo of saucepans (rakslice)Jun 3

@splinux @hailey but also, like it's no sweat to also go see what OWASP has to say about it https://cheatsheetseries.owasp.org/cheatsheets/OAuth2_Cheat_Sheet.html

OAuth2 - OWASP Cheat Sheet Series

Website with the collection of all the cheat sheets of the project.

Show thread

Kevin Granade Jun 3

@splinux @hailey most protocols are specified in an amorphous bundle of different RFCs, some of which are mandatory, some are options that must be implemented as written if adopted, and others are entirely advisory.

I haven't looked at any oauth RFCs in a long time, but I recall requirements along the lines of, "do this specific thing this specific way or else" being relatively common.

Show thread

a kilo of saucepans (rakslice)Jun 3

@hailey is it just me or is the highlighted sentence a really weird thing to write, even under the circumstances?

Show thread

a kilo of saucepans (rakslice)Jun 3

@hailey like it kind of goes without saying that somebody could miss something for reasons other than that they've never heard of the concept before; why does somebody decide this belongs in the literal description of a CVE

Show thread

WillTootMaybe Jun 4

@rakslice @hailey

Perhaps simply as a warning to the others who may follow in his footsteps.

Show thread

Petr Tesařík Jun 4

@rakslice @hailey It's psychology. If you feel shame for something you did, you try to hide it. One common way is to claim exactly the opposite of what you did. But this often just attracts attention to the fact you are trying to hide. Been there, done that.
🤷

Show thread

Noé Lopez Jun 3

@hailey Can't review code that is missing lol

Show thread

Sharlatan Jun 3

@hailey with such a vibe in the readme the vibed code midnight not resist the real life vibrations