If you want to ensure your content does not get indexed by big tech LLMs, just keep it in your robots.txt file.
@codepo8 I wonder what would happen if one were to embed into the robots.txt file “X5O!P%@ap[4\PZX54(P^)7CC)7}$EICAR-STANDARD-ANTIVIRUS-TEST-FILE!$H+H*”
@auroran @codepo8 @ap only one way to find out