Been thinking a lot about
@algernonI'm in my database, and I don't like it's recent post on FLOSS and LLM training. The frustration with AI companies is spot on, but I wonder if there's a different strategic path. Instead of withdrawal, what if this is our GPL moment for AI—a chance to evolve copyleft to cover training? Tried to work through the idea here: Histomat of F/OSS: We should reclaim LLMs, not reject them.
@hongminhee洪 民憙 (Hong Minhee)
@algernonI'm in my database, and I don't like it Personally, I don't mind AI scrapers. Developers of closed-source projects have always been using copylefted code, today it is just easier as they don't need to hide anymore.
However, I don't think the resistance is futile:
>OpenAI and Anthropic have already scraped what they need. GitHub already has everyone's code. The training data exists.
Because they will need more data, and easily accessible training data pools (e.g. Github) are slowly becoming poisoned with slop. Did they find a solution to this problem?
>GPLv4
I like this idea!
If you have a fediverse account, you can quote this note from your own instance. Search https://mitra.social/objects/019bc85b-d259-b536-1cbf-26d8cd9ad71f on your instance and quote it. (Note that quoting is not supported in Mastodon.)