Hackers' Pub

@jsbarretto@social.coopJoshua Barretto Just throwing in my 2 cents that I don't think that data poisoning is as effective as forcing companies to pay an acquisition cost for good text. Doubtless they have a QA pipeline, one that includes low-paid workers overseas, that can filter out generated text pretty effectively and at low cost. They'll have a much harder time avoiding tarpits, which could really slow down scraping if deployed at a large enough scale. As

@jsbarretto@social.coopJoshua Barretto suggested, having the tarpits be as fast as possible and as diverse as possible furthers that aim.

I was also thinking a model like what signature-based malware or adware detectors use would be cool too. Whenever one tarpit discovers a new technique for snaring bots or keeping bots snared longer, it pushes that innovation to a database where other tarpits can download it and adjust their own strategy, allowing tarpits to collectively adapt faster than the bots can.

Anyway thanks for the good work!

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`