Hackers' Pub

while considering whether i should be looking into replacing astral's ruff for projects i'm responsible for (tl;dr probably not) i realized that code formatting is an excellent place to use a language model. not for developing it, i mean; rather, make the entire formatter be a language model. verify that the AST is semantically equivalent in the end (which is a solved problem for the restricted case we're considering here) and we're golden

while there are people who are fine with the draconian approach taken by tools like black or (to a lesser extent) rustfmt, i find these tools intolerable. they promise consistency but this consistency butchers so much code that i'd rather quarrel with contributors over formatting (and presumably lose some) than read the absolute trash these tools emit in many common cases, with the resolution for this problem being WONTFIX

anyway, semantic style transfer is one of the things CNNs are pretty good at. if i could say "PRs should be formatted 'more or less like this'" and as a result they are formatted 'more or less like this' (with the quality being somewhere in between "manually reformatting all of it by hand" and "let everyone pick whatever they want at all" but closer to the first option), with near-zero per-PR action required from all participants, that would be nice

(you should be able to train a model like this by generating random snippets of code formatted in particular ways)

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`