I wonder if there's an easy way to make the Python lxml package use a modern Python HTTP/HTTPS client, like urllib or requests, when loading external entities and DTDs. If I just pass no_network=False to XMLParser, that seems to use a client built into libxml2, which doesn't support HTTPS.

Or maybe I need to figure out how to put the DTDs I'm working with into a catalog.

0

If you have a fediverse account, you can quote this note from your own instance. Search https://toot.cafe/users/matt/statuses/114321013619162868 on your instance and quote it. (Note that quoting is not supported in Mastodon.)