@ebassi @nirbheek @pwithnall @hbons Whether something is a "clean room" implementation or not is also not that easy. Maybe the LLM saw other implementations of the same thing during training, but is that different from you having read some other implementation of something some time before in your life and then writing your own? In either case it's not like the LLM or you can recite* any of the originals but you have an abstract model of it and anything else you ever learned that you work from.
* Give it a try: let a recent LLM write you the FreeBSD implementation of /bin/yes that was surely in its training data and is small enough, and then compare to the original. Or maybe more interesting: let an LLM write Rust/GStreamer code, and you'll clearly see that it learned from code I have written. Just like most humans did if you look over github/etc. Unlike what some humans do, you don't see 1:1 copy&paste of whole little helper functions though.
Very different to that is the case when you (or an LLM) actively look, compare, copy another implementation during development. (Which is also probably more common than we'd like to pretend based on all the code I've seen over the years)
I'm sure we're going to have lots of interesting philosophical discussions between lawyers and courts in the future, ideally with outcomes that don't backfire at us.