this post was submitted on 28 May 2024
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/UltraNigatelo1911 on 2024-05-28 17:33:53.

https://resubscene.vercel.app/

A subtitles database website using all the data that was dumped before subscene closure (Only extracted Arabic & English subtitle)

website screenshot

The dump was massive with over 2 million extracted subtitle files (deduped & counting only english & arabic)

With over 75 GB of extracted files

and 1.2 GB of just the metadata

The whole goal of this project was to provide a website to access this vast amount of subtitles accumulated over the years of subscene operation

and also an opportunity to improve the horrible user experience the website suffered from, and the slow and inaccurate search, inability to download individual .srt; .ass; files directly.

I plan on adding the missing languages and open sourcing the whole project alongside the processed data

Huge thanks to the Subscene dump:

Subscene.com full Dump : r/DataHoarder (reddit.com)

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here