JustHTML by @EmilStenstromEmil Stenström is a new Python library (no dependencies) that parses HTML according to the HTML5 specification and passes the 9,200 test html5lib-tests suite
It's 3,000 lines of code mostly written by coding agents over a couple of months https://simonwillison.net/2025/Dec/14/justhtml/
