Github has a setting "Allow GitHub to use my data for AI model training" which defaults to Enabled. You might want to turn it off, thought it's probably too late and likely won't stop other bots crawling your code.

https://github.com/settings/copilot/features

Build software better, together

GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

GitHub
@dougbinks "code"

@dougbinks To make things complex, I don't have a fundamental problem with a Chinese AI lab that releases the resulting weights for me to use for free. It works, and it's one of the best models currently available.

But giving the data to Microslop, so they can keep them closed up tight, so they can sell them back to me? Yeah, no thanks.

@wolfpld Personally I 'd rather neither train on the code I write, but I am indeed more perturbed by those who want to sell access to it back.