Mastodawn

@yabellini@fosstodon has moved Jan 9, 2024

Did you realize that we live in a reality where SciHub is illegal, and OpenAI is not?

Duco Jan 10, 2024

@yabellini SciHub makes papers public that are behind paywalls. I agree, that they shouldn't be behind paywalls, but it's completely different to OpenAI.
I think they used mostly sources that are public anyway, like Wikipedia, etc. They also didn't publish them but trained an AI with it, that creates new texts. So they did a remix in a way. Remixes are handled differently in copyright law.
"The corpus [GPT-2] was trained on, […] 40 [GB] of text from URLs shared in Reddit" https://en.wikipedia.org/wiki/OpenAI

OpenAI - Wikipedia

Show thread

@yabellini@fosstodon has moved Jan 10, 2024

@duco

https://www.nytimes.com/2023/12/27/business/media/new-york-times-open-ai-microsoft-lawsuit.html#:~:text=1.3k-,The%20Times%20Sues%20OpenAI%20and%20Microsoft%20Over%20A.I.,with%20it%2C%20the%20lawsuit%20said.

I recommend reading the lawsuit, it was not only written by lawyers who know the law but it is also very clear:

https://nytco-assets.nytimes.com/2023/12/NYT_Complaint_Dec2023.pdf

New York Times Sues OpenAI and Microsoft Over Use of Copyrighted Work

Millions of articles from The New York Times were used to train chatbots that now compete with it, the lawsuit said.

The New York Times

Show thread

Duco Jan 10, 2024

@yabellini I can not read the article as it's behind a paywall and the other document is 69 pages long. I will not read that. If you want to say something with it, say waht you want to say. Depending on what you will say, I will think about if I want to check that with the provided sources or not.

Show thread

Skylarking Mullet

@duco @yabellini We can bypass paywalls by prepending "archive .is/" to the URL.

https://archive.is/YOFMJ