• Claude code source "leaks" in a mapfile
  • people immediately use the code laundering machines to code launder the code laundering frontend
  • now many dubious open source-ish knockoffs in python and rust being derived directly from the source

What's anthropic going to do, sue them? Insist in court that LLM recreating copyrighted code is a violation of copyright???

@jonny do LLMs trained on gpl code have to be gpl? I don't know whether code-as-data is equivalent to code as executable, but I had honestly never considered that issue before.
@srvanderplas
They sure don't! Or at least if they did the entire industry would collapse overnight.
@jonny @srvanderplas
well, IANAL, but:
1) I have published GPLed code, and AFAI Understand, if the produced code is *linked* to the GPLed code/requires the GPLed code to run, to redistribute the new code it MUST be GPLed.
2) last I checked, the US court system was of the opinion that work produced by AI was NOT COPYRIGHTABLE. AFAIK, that should include any produced code. Other jurisdictions may have differing laws.
@traecer @jonny @srvanderplas I'm not a lawyer and this isn't legal advice, but for AI output and copyright you might find this interesting: (watch and draw your own conclusions) https://hachyderm.io/@ell1e/116313321022811490
@traecer @jonny @srvanderplas There's also this: https://www.twobirds.com/en/insights/2025/landmark-ruling-of-the-munich-regional-court-(gema-v-openai)-on-copyright-and-ai-training It seems to be talking about fair use as it relates to AI training (I could be wrong though, read it for yourself).
Landmark ruling of the Munich Regional Court (GEMA v OpenAI) on copyright and AI training - Bird & Bird

@ell1e @traecer Strictly speaking, it’s not talking about that: “Fair use” is not a legal concept within that court’s jurisdiction.

EU law allows to ignore any and all copyright for “data mining”, and OpenAI tried to argue that since their business is data mining, they never have to care about copyright to begin with. This particular ruling simply says that if your product reproduces lyrics of an entire song, that isn’t just data mining, it is in fact copying.

So what the court says is: Under current EU law, you’re allowed to copy as much data as you want for *training* your LLM, but that doesn’t mean you’re also allowed to actually provide LLM as a service to the public. (IANAL)

Note that this particular ruling is not legal precedent and it’s already being appealed.

@ajnn @traecer You say IANAL, but the lawyer in the clip seems to be a lawyer. Beyond that, I don't have much to say.
@ell1e @traecer I mean, the article you cite doesn’t even mention the words “fair use”? Just because it’s what *you* are familiar with, doesn’t mean it’s a thing anywhere else.