I'm sure this is a solved problem somewhere, but a bit of searching didn't find it, so I'm writing my own local web app to periodically download my home timeline (and related accounts, statuses, etc) and stuff it into #duckdb to make a local #searchengine.
This is basically to help with the "I know I saw this on my timeline but it's now > [server cache size] statuses old and I want to find it" problem.
1) Anyone know of a baked solution for this?
2) Am I going to piss people off?
3) Should I respect "noindex" on Statuses even if this is a personal tool?