this post was submitted on 12 Jun 2025
1 points (100.0% liked)

Self-Hosted Alternatives to Popular Services

222 readers
1 users here now

A place to share, discuss, discover, assist with, gain assistance for, and critique self-hosted alternatives to our favorite web apps, web...

founded 2 years ago
MODERATORS
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/selfhosted by /u/biolds on 2025-06-11 12:03:19+00:00.


Hey everyone! We're excited to announce the release of Sosse 1.13, the newest version of our open-source search engine, web archiving, and crawling platform.

For those unfamiliar, Sosse (Selenium Open Source Search Engine) lets you:

πŸ” Search the full content of web pages, including JavaScript-rendered content

πŸ•΅οΈ Crawl sites on a schedule and detect content changes

πŸ“₯ Download files in bulk from web pages

πŸ“‘ Archive web pages (with assets) for full offline access

πŸ”” Monitor websites and generate Atom feeds for updates

πŸ”’ Authenticate to access protected or private content

πŸš€ What’s new in 1.13?

This release includes powerful new features and improvements to make Sosse more useful and easier to integrate:

  • 🏷️ Support for Document Tagging – Categorize and filter your indexed data
  • πŸ“‘ Webhook Triggers During Crawling – Integrate crawling into workflows (AI, automation, notifications, and more)
  • πŸ“€ CSV Export – Export crawl results in a standard format
  • 🐳 Simplified Setup with Docker Compose – Get started faster with pre-configured services
  • πŸ› οΈ Metadata Extraction with Scripting – Use JavaScript or webhooks to scrape and index custom metadata

Sosse 1.13 is more powerful, more flexible, and easier to integrate into your data pipelines and research workflows.

πŸ™ Thank You!

Huge thanks to everyone who provided feedback and suggestions after the 1.12 release β€” your input directly shaped the improvements in this version.

We’re looking forward to hearing what you think about 1.13! πŸš€

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here