this post was submitted on 09 Nov 2023
3 points (100.0% liked)

Self-Hosted Main

589 readers
1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

For Example

We welcome posts that include suggestions for good self-hosted alternatives to popular online services, how they are better, or how they give back control of your data. Also include hints and tips for less technical readers.

Useful Lists

founded 2 years ago
MODERATORS
 

So I'm looking for a service I can run that can search the internal contents of multiple PDFs (multiple 1000+ page reference manuals) for a a phrase/word, similar to Adobe acrobats advance search function.

Bonus points of I can control the scope of which documents it searches through through some sort of interface.

top 4 comments
sorted by: hot top controversial new old
[–] morrowind@lemmy.ml 1 points 2 years ago

https://github.com/phiresky/ripgrep-all

Made by the same phiresky that's been contributing incredible improvements to lemmy

[–] Successful_Try543@feddit.de 1 points 2 years ago* (last edited 2 years ago)

For Linux command line, there is pdfgrep. It can be found e.g. in the official Debian repository.

[–] thekrautboy@alien.top 1 points 2 years ago

Look at the subreddit sidebar and find this: awesome-selfhosted, category document management, and more.

[–] StrykerSigma@alien.top 1 points 2 years ago

You should checkout a free program called Seekfast