So why aren't the big AI companies more transparent about what's in the data that they use to train their models?
One reason, experts say, is because they're afraid they'd get in trouble if people found out. https://www.washingtonpost.com/technology/interactive/2023/ai-chatbot-learning/
@willoremus Remember when Google scanned and digitized All the World’s Books (approx.) without asking authors for permission or offering any compensation? Many people thought that was just fine. They said it would only give authors more “visibility.” They said it was “fair use.”
I’m still bitter about it. This was one of Google’s principal motivations, and few understood that at the time.