If it were up to me, Apple would create the world’s first “Do Not Train” registry. Enter your domain or web page, get a TXT record or meta tag to prove ownership, prevent Apple from using your content to train any LLM.

Only a small percentage of sites would do it, I think, so the training impact would be low, but it’d be extremely meaningful for those people. And it’d be a valuable PR tool, brand-booster, and competitor-shamer. (Bonus: make it open and encourage competitors to follow it.)

@cabel at least second; ServiceNow and HuggingFace have been doing it for at least a year: https://github.com/bigcode-project/opt-out-v2

https://spawning.io have also been advocating for an EU-based centralized opt-out registry (since EU law requires opt-outs starting in less than a year, but doesn't require a central registry) but not sure how far along their proof-of-concept is.

GitHub - bigcode-project/opt-out-v2: Repository for opt-out requests.

Repository for opt-out requests. Contribute to bigcode-project/opt-out-v2 development by creating an account on GitHub.

GitHub