A year and a half ago, i opened my nginx logs to discover that tens of thousands of individual IPs had started fetching random pages in my git forge. Today, i have mostly beaten these bots and confined them to a torture room where they are endlessly fed garbage, thanks to Iocaine.

This is a post about what worked, what did not, some numbers, and the cost (technical, financial, human) of giant tech companies scraping all of our small services for LLM training.

Guarding My Git Forge Against AI Scrapers vulpinecitrus.info/blog/guardi

0
0
0

If you have a fediverse account, you can quote this note from your own instance. Search https://eldritch.cafe/users/SharpLimefox/statuses/115650261456977101 on your instance and quote it. (Note that quoting is not supported in Mastodon.)