5
Best way to search files on remote server?
(lemmy.zip)
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
What's a bajillion? If the OCR output is less than a few GB, which is a heck of a lot of text (like a million pages), just grepping the files is not too bad. Maybe a second or two. Otherwise you need search software. solr.apache.org is what I'm used to but there are tons of options.