Here's where you go to tell GitHub that it can’t train CoPilot on your code & interactions: https://github.com/settings/copilot/features
Just sayin’
Here's where you go to tell GitHub that it can’t train CoPilot on your code & interactions: https://github.com/settings/copilot/features
Just sayin’
@soc @timbray I'm ~90% certain that setting just disables training on your copilot sessions. They absolutely 100% have already trained on all of your public repos by declaring it fair use. I suspect most of the noise is people who don't use copilot and are upset about the latter.
GitHub is managed by Microsoft's CoreAI group. They couldn't be clearer about what they expect in return for free hosting at this point.
@slembcke @soc @timbray "The Pile" is a common training set that includes a snapshot of everything publicly visible on GitHub. "The Stack" is a less common training set that includes most permissively licensed things (non-GPL, e.g.) on GitHub, subject to opt-out.
One judge roughly said training is "fair use", but the next judge rejected that claim. It's quite uncertain right now, legally. (Both were U.S. judges; I'm OOtL for other countries.)
There are a lot of people that are going to scrape anything publicly available, and then come up with a ethical / legal justification later, and being off the biggest hosting sites might help avoid that.