Mastodawn

The Seven Voyages Of Steve Jul 2, 2024

Every week there’s an article about the high vulnerability of package managers to supply chain attacks and I’m just amazed it’s taken this long for people to figure out that routinely auto-pulling 500 disparate third party libraries unseen into your project is a terrible idea

Show thread

The Seven Voyages Of Steve Jul 2, 2024

I remember back in my MacOS dev days being told that I should be using CocoaPods and when I told them that was a stupid idea (I had like 3 dependencies and regularly poked around in the source for all of them) I was the old fashioned old man. “But it automates all the updates!”. So what? There’s 3. a) I don’t need it, it’s super easy to pull changes from source and b) when I do it manually I actually *look* at the updates like a sane person would https://arstechnica.com/?p=2034866

3 million iOS and macOS apps were exposed to potent supply-chain attacks

Apps that used code libraries hosted on CocoaPods were vulnerable for about 10 years.

Ars Technica

Show thread

The Seven Voyages Of Steve Jul 2, 2024

Of course there’s no reason you can’t use automated package managers *and* do the kind of due diligence a responsible developer would do when pulling code from third parties into their project, but I don’t think I’ve ever seen anyone do this. Instead it seems normal to implicitly trust anything that comes out of a package management system no matter who controls it and that’s always been wild to me.

Show thread

The Seven Voyages Of Steve Jul 2, 2024

And the thing is, the number of external dependencies (and their update volume) that you can realistically, properly vet for inclusion in your project, is inherently small enough that you don’t need a package manager. And if you need a package manager to handle it all, you can’t be checking what you’re pulling in and so you’re definitely vulnerable.

Show thread

The Seven Voyages Of Steve Jul 2, 2024

“Vetting” can mean delegating due diligence to the publisher (or repackager) rather than personally reading the source, but that means vetting the publisher instead. And there is a finite number of those that you can maintain vetted trust in at any one time. You can’t just assume that the “community” somehow automatically protects you against bad actors. It might, but it’s been shown many times that it might not; sometimes everyone thinks someone else would have spotted a problem and no-one does

Show thread

The Seven Voyages Of Steve

It makes me laugh when I see programmers harping on about their memory safe languages and how they’re not subject to buffer overruns like the old man languages, while auto-pulling 500 dependencies from randos on the Internet into their projects without even looking at them

Show thread

The Seven Voyages Of Steve Jul 3, 2024

I've muted this thread now because it seems to have been popular and my notifications are dead now

Show thread

chris martens Jul 2, 2024

@sinbad 💀💀💀

Show thread

Kapitän Clownfeuer Jul 2, 2024

@sinbad Memory-Safe Malware ;)

Show thread

aeva Jul 2, 2024

@sinbad I think about how the design decisions of languages directly determine the kinds of problems one frequently encounters working with it, and the strength of a language is in part to what degree people can accept the problems created by the language. From everything I've heard Rust's main one is a significantly elevated cognitive load hurdle for ordinary tasks. This makes me wonder if over-dependence on tiny 3rd party libraries is pretty much required for most nontrivial Rust projects.

Show thread

aeva Jul 2, 2024

@sinbad I used to assume that the main problem created by high cognitive load languages like Rust and Haskell is that the occurrence of algorithmic bugs would itself just be higher due to programmer exhaustion, but from everything I've heard it sounds like it's not so much that so much as front-loading more of your debugging necessitates having a very comprehensive understanding of what you want up front or you're in for a slog. Easy shortcuts must start to look very appealing very quickly.

Show thread

aeva Jul 2, 2024

@sinbad Personally if I had to solve every sketch of an idea for Safety Purity and Memory Correctness before I could see whether or not they were bad ideas I'd probably have given up on my dreams years ago.

Show thread

James Widman Jul 2, 2024

@aeva @sinbad this is why i've come to view C and C++ as languages for rapid prototyping/experimentation (and if the software to be shipped is a game, then they're also languages for production, because in this use case, memory unsafety makes for good times at GDQ)

Show thread

Josh Simmons Jul 2, 2024

@JamesWidman @aeva @sinbad I tend to find what people rapidly prototype is unique ways to crash. I think it's slightly too simplistic to think of it as a strict loss when you're often trading time spent trying to figure out where your memory safety related crash is coming from, v.s. reading an error message in the compiler. (this is not to say rust is fantastic for prototyping, but neither is C++ so ymmv)

Show thread

Josh Simmons Jul 2, 2024

@JamesWidman @aeva @sinbad effective prototyping is really just about having a bunch of good strategies for simplifying your approach to problems and banging out things without running afoul of the known pitfalls of your language / environment. just like how in C++ and C i have a variety of approaches chosen to avoid creating difficult to deal with memory related sadness (usually: just put it in an array), the same exists for Rust. (also usually: just put it in an array)

Show thread

James Widman Jul 2, 2024

@dotstdy on the other hand:
https://mastodon.social/@regehr/112299547430262545

Show thread

Josh Simmons Jul 2, 2024

@JamesWidman btw you make me wonder now how many speedrun strats are actually memory safety bugs. obviously there's the classic mario ones, but I wonder if there's any notable modern ones. the plot twist of course being that many modern (and not so modern) games are written in memory safe languages already. So I wonder how that shakes out.

Show thread

James Widman Jul 2, 2024

@dotstdy some of my favorite speedruns are ones that depend on memory layout; e.g. for _Link to the Past_ and _Ocarina of Time_.

i feel like it might be interesting if game designers reintroduced this kind of thing deliberately!

Show thread

Josh Simmons Jul 2, 2024

@JamesWidman ironically, it would be a lot easier to do that sort of thing with a design oriented around simply avoiding memory safety problems, but then just turning all the bounds checks off. e.g. if you build all your game state out of a big mega struct containing a bunch of fixed size arrays. the problem with doing it with "real" memory safety violations in the modern day is that you tend to need sophisticated techniques to prime the heap state / deal with aslr and friends.

Show thread

James Widman Jul 2, 2024

@dotstdy yeah, i mean, you would have to design your allocator(s) in such a way that objects are located deterministically, at least relative to each other (so, you can't predict absolute addresses, but you can predict relative addresses)

Show thread

Josh Simmons Jul 2, 2024

@JamesWidman Yea, or don't do any dynamic allocation. Code like it's 1999 :)

Show thread

Trey Roady Jul 3, 2024

@aeva @sinbad It's a good example of what I consider the strength of R and its tidyverse: it's opinionated (so there are common ways to do things) and it allows statisticians to focus on the data, itself, instead of structuring the language.
It's not always efficient for processing, but that's why they keep rewriting major functions in C++ on the backend.

Show thread

John Kaniarz Jul 3, 2024

@sinbad @aeva when you depend on 3rd party Rust code, at least you know the injected ransomware will be memory safe.

Show thread

Mark T. Tomczak Jul 2, 2024

@sinbad I'm confused by the implication that people using non-memory-safe languages aren't also relying on dozens upon dozens of libraries (static and dynamic) that they haven't personally vetted.

What, you just trust dpkg ?

ETA Wait, my mistake. You actually do read all the source? How on Earth do you make time to actually write code?

Show thread

The Seven Voyages Of Steve Jul 2, 2024

@mark oh sure non-memory safe languages can have the same problem; although many of them lack package managers because they’re old.

I don’t use many dependencies (that I haven’t written). And the ones I do use are either from a source I know and trust, or I read the source. Sometimes both.

Show thread

Mark T. Tomczak Jul 2, 2024

@sinbad glances at your Mastodon bio

... ah, yes, I see. Having some experience rolling my own game engine back in the day, I see how you get there. For the longest time, our only three dependencies were Lua, OpenSSL, and Open Dynamics Engine (and we did have an engineer who'd read all the OpenSSL source and we could have, theoretically, read all of ODE and Lua, though I don't think we ever found occasion to lay eyes on every single byte).

I don't miss the weird corner-case errors but I do miss being that intimately familiar with almost the whole stack; we had to add some hilarious hacks to both ODE and Lua to get it to work in our context (we made ODE work in the context of an Internet Explorer plugin by patching over its use of alloca to do a slab alloc from a big chunk of heap because it wanted preposterous sizes of stack in one go that slammed into the hard-coded IE thread redzone, and we made Lua able to handle "stack unwinding" in a crash by adding a flag to its interpreter that converted every opcode to "UNCONDITIONAL RETURN" so it would unwind its stack if the plugin was trying to close and a script was still running).

Show thread

david

Jul 3, 2024

@sinbad my hot programming take is that python having awful package management for most of its history was somewhat of a good thing in that it resulted in most python packages nowadays having very few dependencies

Show thread

Pete Alex Harris🦡🕸️🌲/∞🪐∫Jul 3, 2024

@david @sinbad
Bad package management makes you less likely to bring in a package dependency for something you can write yourself in 10-100 lines of readable code, and you're also less likely to care if the package management is a bit janky if the language is good for doing quite a lot in 10-100 lines of readable code, so the causality works in both directions I think.

Show thread

Alper Çuğun-Gscheidel Jul 3, 2024

@sinbad Writing all the code yourself (I’ve seen it done) is definitely great for your job security and increasing the power of labor.

THIS

Inga stands with 🇺🇦 🇵🇸Jul 3, 2024

@sinbad when I was working on a payment provider where we processed credit card numbers and CVVs (highly sensitive data), we had several external dependencies that we had to vet. So eventually I just said "nah, fuck that shit" and vendored them all, source code and stuff (basically mirroring their repos in our local version control and building them on our local CI signing results with our own key, and publishing them on our local NuGet, without any of these systems being connected to the internet).
Thanks .NET for having most of the stuff we'd need in the standard library. This wouldn't be possible with Node.js.