Meta's latest legal wheeze is to insist that pirating books is fair use, actually. And it might be working.

In order to help train its AI models, Meta (and others) have been using pirated versions of copyrighted books, without the consent of authors or publishers. The company behind Facebook and Instagram faces an ongoing class-action lawsuit brought by authors including Richard Kadrey, Sarah Silverman, and Christopher Golden, and one in which it has already scored a major (and surprising) victory: The Californian court concluded last year that using pirated books to train its Llama LLM did qualify as fair use.

You’d think this case would be as open-and-shut as it gets, but never underestimate an army of high-priced lawyers. Meta has now come up with the striking defense that uploading pirated books to strangers via BitTorrent qualifies as fair use. It further goes on to claim that this is double good, because it has helped establish the United States’ leading position in the AI field.

Meta further argues that every author involved in the class-action has admitted they are unaware of any Llama LLM output that directly reproduces content from their books. It says if the authors cannot provide evidence of such infringing output or damage to sales, then this lawsuit is not about protecting their books but arguing against the training process itself (which the court has ruled is fair use).

Judge Vince Chhabria now has to decide whether to allow this defense, a decision that will have consequences for not only this but many other AI lawsuits involving things like shadow libraries. The BitTorrent uploading and distribution claims are the last element of this particular lawsuit, which has been rumbling on for three years now, to be settled.

Meta's latest legal wheeze is to insist that pirating books is fair use, actually

Rather than, y'know, outright theft.

PC Gamer
  • Shorter and more reasonable copyright lengths would make this a moot point because then there would sufficient literature in the public domain to pull from.

  • These kind of charges are what put the Pirate Bay admins in prison and caused Aaron Swartz to kill himself because of a threat of lifetime in prison. The claim that they did this either with the goal of profit or actually successful profit and that this was a serious crime. Neither TPB or Swartz at that point in time had ever moved as much data as Meta has for these claims, nor did they ever have the profit or possibility of profit Meta aims to make from their AI offerings.

  • Now Meta is claiming they’ve profited so hard you can’t possibly hold them accountable.

  • It will be the biggest “fuck you” in history to anyone ever hit with civil charges for piracy in the early 2000s, let alone the TPB admins and Swartz, if they let this go. Which means they probably will because in America, apparently if you crime hard enough and big enough they stop putting you in prison and start patting you on the back and calling it good business sense.

    It’s weird that your take away is “Meta needs to get it” and not “Clearly, these laws work for no one”. You don’t get better copyright laws by cheering for the copyright companies.

    Literally the first thing I said was in regards to more sensible copyright making this all a moot point but you do you.

    The only reason Meta needs to get it is because it’s entirely hypocritical to all the dirt poor people who couldn’t afford these kind of lawyers. It doesn’t make the current legal status right or correct. It’s just a slap in the face to someone like Swartz who died over far less.

    I would rather copyright be amended but sadly that’s less likely to happen here.

    I read this as setting precedent that others couldn’t. Court cases like this are one way to make it possible for everyone to break an absurd law.
    Precedent only applies equally if we are able to prove the same in court as Meta did. Are you going to need petabytes of pirated data to train your AI? Can you afford a team of top quality lawyers to fight your case and prove you were training a small locally-hosted AI at home? Do you think Meta, of all companies, really is fighting for you to be able to do the same as them? You will still get taken to court, you will still have to fight your case, “precedent” isn’t an automatic get out of jail free card. Do you have the money to fight massive copyright holders with endless money? Of course you don’t, none of us do.
    Precedent means we can cite it, so yes, this helps a bit. The rest you wrote is a fair bit of assumption or unnecessary: evidence to back your points would help. Otherwise, it just looks like inconclusive defeatism.