Open source license that doesn’t allow your code to be used for AI data training?

https://feddit.online/c/opensource/p/1476571/open-source-license-that-doesnt-allow-your-code-to-be-used-for-ai-data-training

Open source license that doesn’t allow your code to be used for AI data training?

Does a license like this exist?

Literally any license that says any derivatives of your work must include attribution, bonus points if you use a license that says derivative works must be shared under the same license, eg GPL or MIT.

The AI Bros will take it anyway though and ignore your license, and the courts are very pro-business and AI is like half the US economy at this point, so it’s probably all pretty pointless.

They’re currently being sued for just that in Doe’s v. GitHub et al, which has already been going on for years. Currently its waiting to be scheduled in the 9th circuit, they’ve already been waiting 18 months.

MIT is a permissive license and it doesnt enforce anything.

if you want freedom, use strict licenses like GPL.

Gnu has a great guide on licenses

www.gnu.org/…/license-recommendations.html

www.gnu.org/licenses/license-list.html

How to Choose a License for Your Own Work - GNU Project - Free Software Foundation

MIT still requires the license and copyright notice to be maintained though, it is why even proprietary software includes an ‘open-source licenses’ listing somewhere under help / alongside the distribution. Arguably, AI models reproducing a bit of MIT licensed code would be just as much in violation as with any other license.

GPL still gives much better guarantees w.r.t. providing the source code and modifications made thereto, though.