Since you’re writing a book you may also want to consider the impact of opaque data-less “open source” models in terms of creating a “chain of custody” from your work back to the content oligarchs: Disney, NYT, etc. but with models Rajiva’s on Common Crawl, everyone on the internet.
That today’s #opensource proponents like the OSI are actively participating in what effectively amounts to contributory infringement of copyrights boggles the mind!
@sj @florenciocano because that's not how copyright works unless you'd also be perfectly fine with being a perpetual debt peon to the #Contentmafia...
I’m afraid @senficon is mistaken (as you can see from the many comments pointing this out) and his posts (eg https://opensource.org/blog/copyright-law-makes-a-case-for-requiring-data-information-rather-than-open-datasets-for-open-source-ai) are being used by the bandits at the OSI to make the case for #OSAID based on wishful thinking.
These models are behaving more like photocopiers, which clearly do not magically erase copyrights.
For example, consider the GPL-protected output of Linux as a Model (LaaM) https://github.com/alea-institute/laam
When we look at applying Open Source principles to the subject of AI, copyright law comes into play, especially for the topic of training data access. Open datasets have been a continuous discussion point in the collaborative process of writing the Open Source AI Definition. I would like to explain why the concept of data information is a viable alternative for the purposes of the OSAID.
@sj @florenciocano #NotLegalAdvice but I still think @senficon is correct in his analysis because #copyright is way more stringent than say #patents.
Don't get me wrong, I think wannabe "#AI" is still bad as in #WastefulComputing of the #Shitcoin kind (i.e. #Bitcoin) and that the very few useful applications are overshadowed by #grifters wanting to #scam #VC's off their money.

Not even sure where to start with this…
Here’s a song: https://youtu.be/5pidokakU4I
