ok genuine crisis of faith here. I've got two games I've been working on, both are intended to be fully open source. One is nearly ready for a public release, the other is probably realistically a year or two out. I feel it is important for the source to be available for the former, and it might not yet be part of any training sets because it only has 17 stars on github, but I'm assuming it probably is already.

The latter is largely not online at all, because I simply haven't built it yet...

I was ambivalent about my code going into training sets, but now for a variety of exciting reasons I am no longer ambivalent. So here's the conflict: the only way I can think of to realistically prevent it from ever going into the plagiarism machine is to simply never make the source available. There we go. That's the crisis of faith.
This is all possible because we had a healthy commons and so that's the reward for building a healthy commons: enclosure by billionaires who are promising the end of everyone's ability to make a living in a creative field. If that really will come to pass, we'd have been better off if we never shared anything on the internet and that really hurts.
right now my best answer is just... not open source anything. or at least, hang on to it for a few years, see what happens with this bubble. if it pops and takes down half the west coast tech industry, nothing was lost and source code can go online again. if it doesn't though, well, i guess go ask the god mommy machine to hallucinate an interactive video for you or something i'm sure it'll be just as good 🙄

@aeva Honestly, it's hard to really say anything at all. Corporations clearly don't give a flying fuck about licensing, but I suppose the best route forward is to favor GPL-style ones wherever possible (permissive licensing often becomes an okay for corporations to exploit your work).

Oh, and skip out on GitHub if you can. Use Codeberg or your own Forgejo instance if you have the resources.

While one can hope this bubble will pop and (ideally) take some of the tech industry with it, it's still a lose-lose situation.

@ColorfulCeleste I doubt using codeberg or any other public forgejo instance is effective at preventing source code from being scraped into training sets beyond the most opportunistic scraping, but github definitely is the fast path to the bad ending. I am also skeptical that github will continue to exist (without getting much much worse) after the bubble pop because the paid services are redundant to azure and the rest is a significant cost that wont be paying for itself anymore.
@ColorfulCeleste the number of people who replied to my thread to say there had been feeling the same thing was... well, it didn't make me feel better, ok let's put it this way. we are constantly bombarded by Problems, and the number of Actionable Problems is always lower than the number of Known Problems, much as the set of Real Problems and Perceived Problems don't fully overlap. Knowing which taxonomy is relevant is itself a form of agency even if the Problem is not Real and Actionable

@ColorfulCeleste if a particular Problem is Real, not directly Actionable, the most useful thing to do is then plan accordingly and put any work in you can do to minimize the damage.

so like Actionable problem is "bad thing will happen unless I do X" and not directly Actionable is "bad thing will happen, and I can survive it and/or help others survive it if I do X"

@ColorfulCeleste a significant amount of agreement and virtually non-existent dissent is not proof of Real or Actionable, but comparing notes with people you trust and respect can bring some clarity to what is probable or improbable
@aeva That makes a lot of sense. I doubt there's much else I can say that wouldn't just be regurgitating what has already been said in this thread, but I definitely see where you are coming from :(