2 authors say OpenAI 'ingested' their books to train ChatGPT. Now they're suing, and a 'wave' of similar court cases may follow.
2 authors say OpenAI 'ingested' their books to train ChatGPT. Now they're suing, and a 'wave' of similar court cases may follow.
There’s an additional question: who holds the copyright on the output of an algorithm? I don’t think that is copyrightable at all. The bot doesn’t really add anything to the output, it’s just a fancy search engine. In the US, in particular, the agency in charge of Copyrights has been quite insistent that a copyright can only be given to the output if a human.
So when an AI incorporates parts of copyrighted works into its output, how can that not be infringement?
How can you write a blog post reviewing a book you read without copyright infringement? How can you post a plot summary to Wikipedia without copyright infringement?
I think these blanket conclusions about AI consuming content being automatically infringing are wrong. What is important is whether or not the output is infringing.
You can write that blog post because you are a human, and your summary qualifies for copyright protection, because it is the unique output of a human based on reading the copywrited material.
But the US authorities are quite clear that a work that is purely AI generated can never qualify for copyright protection. Yet since it is based on the synthesis of works under copyright, it can’t really be considered public domain either. Otherwise you could ask the AI “Write me a summary of this book that has exactly the same number of words”, and likely get a direct copy of the book which is clear of copyright.
I think that these AI companies are going to face a reckoning, when it is ruled that they misappropriated all this content that they didn’t explicitly license for use.
But the US authorities are quite clear that a work that is purely AI generated can never qualify for copyright protection.
Which law says this? The government is certainly discussing the problem, but I wasn’t aware of any legislation.
If there is such a law, it seems to overlook an important point: an algorithm - an AI - is itself an expression of human intelligence. Having a computer carry out an algorithm for summarizing content can be indistinguishable from a person having a pattern they follow for writing summaries.