to be clear i have no idea if the implementation is okay; 6% failing tests might as well be most important ones, or hiding fundamental correctness issues. i think this is a very cool experiment though, and also motivates adding tests for things that aren't reflected but are important

0

If you have a fediverse account, you can quote this note from your own instance. Search https://bsky.brid.gy/convert/ap/at://did:plc:fpruhuo22xkm5o7ttr2ktxdo/app.bsky.feed.post/3mfndpwfoec2r on your instance and quote it. (Note that quoting is not supported in Mastodon.)