AI Agents Could Make Free Software Matter Again, George London

I've been vibe-coding a lot lately. Like, a lot a lot

George London

Having over a decade of open source software I've written freely available online, I actually really appreciate the value that AI && LLMs have provided me.

The thing that leaves a bad taste in my mouth is the fact that my works were likely included in the training data and, if it doesn't violate my licenses (GNU 2/3), it certainly feels against the spirit of what I intended when distributing my works.

I was made redundant recently "due to AI" (questionable) and it feels like my works in some way contributed to my redundancy where my works contributed to the profits made by these AI megacorps while I am left a victim.

I wish I could be provided a dividend or royalty, however small, for my contribution to these LLMs but that will never happen.

I've been looking for a copy-left "source available" license that allows me to distribute code openly but has a clause that says "if you would like to use these sources to train an LLM, please contact me and we'll work something out". I haven't yet found that.

I'm guessing that such a license would not be enforceable because I am not in the US, but at least it would be nice to declare my intent and who knows what the future looks like.

I feel kind of good knowing that my code, design decisions, styles, are now part of the data shaping all software now.

Reading this I hear The Roots playing The Seed 2.0[1] in my mind.

It’s a wild thought to think that of all the things that will remain on this earth after you’re gone, it’ll be your GPL contributions reconstituting themselves as an LLM’s hallucinations.

[1]: https://youtu.be/ojC0mg2hJCc

The Roots - The Seed (2.0) (Official Music Video) ft. Cody ChesnuTT

YouTube

If we're being clear, it's going to be a lot more than that.

Our comments here on HN are almost certainly going to live in fame/infamy forever. The twitter firehose is a pathway to 140-character immortality essentially.

You can already summon an agent to ingest essentially an entire commenter's history, correlate it across different sites based on writing style or similar nicknames, and then chat with you as that persona, even more so with a finetune or lora. I can do that with my gmail and text message history and it becomes eerily similar to me.

History is going to be much more direct and personal in the future. We can also do this with historical figures with voluminous personal correspondence, that's possible now.

It's very interesting because I think the era before mass LLM usage but also after digitalization is going to be the most intensely studied. We've lived through a thing that is going to be on the cusp of history, for better or worse.