this post was submitted on 16 Jul 2024
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/The_other_kiwix_guy on 2024-07-15 14:07:22.

This was announced last week at r/Kiwix and I should have crossposted here earlier, but here we go.

Zimit is a (near-) universal website scraper: insert a URL and voilà, a few hours later you can download a fully packaged, single zim file that you can store and browse offline using Kiwix.

You can already test it at zimit.kiwix.org (will crawl up to 1,000 pages; we had to put an arbitrary limit somewhere) or compare this website with its zimit copy to try and find any difference.

The important point here is that this new architecture, while far from perfect, is a lot more powerful than what we had before, and also that it does not require Service Workers anymore (a source of constant befuddlement and annoyance, particularly for desktop and iOS users).

As usual, all code is available for free at github.com/openzim/zimit, and the docker is here. All existing recipes have been updated already and you can find them at library.kiwix.org (or grab the whole repo at download.kiwix.org/zim, which also contains instructions for mirroring)

If you are not the techie type but know of freely-licensed websites that we should add to our library, please open a zim-request and we will look into it.

Last but not least, remember that Kiwix is run by a non-profit that pushes no ads and collects no data, so please consider making a donation to help keep it running.

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here