If it were up to me, Apple would create the world’s first “Do Not Train” registry. Enter your domain or web page, get a TXT record or meta tag to prove ownership, prevent Apple from using your content to train any LLM.

Only a small percentage of sites would do it, I think, so the training impact would be low, but it’d be extremely meaningful for those people. And it’d be a valuable PR tool, brand-booster, and competitor-shamer. (Bonus: make it open and encourage competitors to follow it.)

@cabel surely every single enterprise and corporation would immediately register in it given the current temperature? They would, collectively, remove most of the internet from the training model. Every lawyer & biz dev would be making it happen overnight.

This feels like something that has to happen *after* (a) contracts for licensing ‘big’ content have been signed (b) business has collectively accepted that actually training LLMs on their dross isn’t actually impacting their business.