OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series
OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series
They’re stealing a ridiculous amount of copyrighted works to use to train their model without the consent of the copyright holders.
This includes the single person operations creating art that’s being used to feed the models that will take their jobs.
OpenAI should not be allowed to train on copyrighted material without paying a licensing fee at minimum.
But they don’t purchase the data. That’s the whole problem.
And copyright is absolutely violated by training off it. It’s being used to make money and no longer falls under even the widest interpretation of free use.
Creating an AI model is a commercial work. They’re made to make money. Now these models are dependent on other artists data to train on. The models would be useless if they weren’t able to train on anything.
I hold the stance that using copyrighted data as part of a training set is a violation of copyright. That still hasn’t been fully challenged in court, so there’s no specific legal definition yet.
Due to the requirement of copywritten materials to make the model function I feel that they are using copyrighted works in order to build a commercial product.
Also AI doesn’t learn. LLMs build statistical models based on sentence structure of what they’ve seen before. There’s no level of understanding or inherent knowledge, and there’s nothing new being added.