I've been thinking about removing all open source code I have ever written from the Internet, and re-uploading it exclusively under licenses that prohibit mixing it with code generated by a statistical model.
If you have a fediverse account, you can quote this note from your own instance. Search https://mastodon.social/users/mcc/statuses/115872922320160715 on your instance and quote it. (Note that quoting is not supported in Mastodon.)
RE: https://mastodon.social/@mcc/115872922320160715
To explain my reasoning here—
- You cannot get LLM model creators to comply with a license or law, and they operate in secret, so you don't know they've stolen your code.
- However, people who *use* LLM code are not so lucky. They are non-anonymous. They must comply with the TOSes of their code hosts and the legal departments of their employers.
So attack the problem at the other end. People scrape and steal your code— you can't stop it. But once they do, they get *nothing else from you*.