Hackers' Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

Simon Willison @simon@fedi.simonwillison.net

4/25/2025, 6:21:46 PM

Public

@xezpeletaXabi yes, using a local LLM works with datasette-extract already

I'd suggest using Ollama to run a model like Gemma 3 or Mistral Small 3.1 (though others could work well too) - with the llm-ollama plugin installed datasette-extract should work fine

I don't know of any local models that handle PDFs in Ollama yet but plenty of them can handle images

If you have a fediverse account, you can quote this note from your own instance. Search https://fedi.simonwillison.net/users/simon/statuses/114399989343013504 on your instance and quote it. (Note that quoting is not supported in Mastodon.)