this post was submitted on 04 Feb 2025
22 points (92.3% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

63271 readers
138 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):

🏴‍☠️ Other communities

FUCK ADOBE!

Torrenting/P2P:

Gaming:


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 2 years ago
MODERATORS
 

Given a hypothetical folder structure like this:

Star.Trek.Discovery.S04E06.German.DL.1080p.BluRay.x264-iNTENTiON/
├── star.trek.discovery.s04e06.german.dl.1080p.bluray.x264-intention.mkv
├── star.trek.discovery.s04e06.german.dl.1080p.bluray.x264-intention.nfo
└── Subs
    ├── star.trek.discovery.s04e06.german.dl.1080p.bluray.x264-intention-eng.idx
    ├── star.trek.discovery.s04e06.german.dl.1080p.bluray.x264-intention-eng.sub
    ├── star.trek.discovery.s04e06.german.dl.1080p.bluray.x264-intention.idx
    └── star.trek.discovery.s04e06.german.dl.1080p.bluray.x264-intention.sub
Star.Trek.Discovery.S04E07.German.DL.1080p.BluRay.x264-iNTENTiON/
├── star.trek.discovery.s04e07.german.dl.1080p.bluray.x264-intention.mkv
├── star.trek.discovery.s04e07.german.dl.1080p.bluray.x264-intention.nfo
└── Subs
    ├── star.trek.discovery.s04e07.german.dl.1080p.bluray.x264-intention-eng.idx
    ├── star.trek.discovery.s04e07.german.dl.1080p.bluray.x264-intention-eng.sub
    ├── star.trek.discovery.s04e07.german.dl.1080p.bluray.x264-intention.idx
    └── star.trek.discovery.s04e07.german.dl.1080p.bluray.x264-intention.sub

4 directories, 12 files

What's the best way to integrate all the subtitles into the corresponding MKV file?

you are viewing a single comment's thread
view the rest of the comments
[–] AnEilifintChorcra@sopuli.xyz 3 points 6 months ago* (last edited 6 months ago) (5 children)

EDIT: https://github.com/gwen-lg/subtile-ocr is better than https://github.com/ruediger/VobSub2SRT Subtitle-ocr had no issue with recognising "I" as "I" while vobsub2srt constantly sees "I" as "|" and blacklisting "|" causes "I" to be recognised as "]" and blacklisting "|" "[" and "]" just leaves a blank space so thanks for the recommendation! I'll be using that to convert VOBSUB to srt instead

for f in ./*.idx; do
    subtile-ocr -l ger -o "${f%.*}.srt" "$f"; done

And then https://mkvtoolnix.download/ to remove any unwanted subtitles from the mkv files

for f in ./*.mkv; do
    outdir="nosubs"
    mkvmerge -o "${outdir}/$f" --no-subtitles "$f"; done

Finally, I merge the srt files with the mkv files

for f in ./nosubs/*.mkv; do
    outdir=addedsubs
    g="${f##*/}"
    mkvmerge -o "${outdir}/$g" "$f" --language 0:ger --track-name 0:German "${g%.*}".srt; done

Once the files in the addedsubs directory look good, I delete all of the other files. You can add it all to a bash file or make an alias or both and run it wherever

[–] Lemmchen@feddit.org 2 points 6 months ago* (last edited 6 months ago) (4 children)

I'm pretty sure MKV can handle VOBSUB. Why do you convert them to .srt before merging them?

Edit:

I've also just found this: https://github.com/elizagamedev/vobsubocr

The most comparable tool to vobsubocr is VobSub2SRT, but vobsubocr has significantly better output, especially for non-English languages, mainly because VobSub2SRT does not do much preprocessing of the image at all before sending it to Tesseract. For example, Tesseract 4.0 expects black text on a white background, which VobSub2SRT does not guarantee, but vobsubocr does. Additionally, vobsubocr splits each line into separate images to take advantage of page segmentation method 7, which greatly improves accuracy of non-English languages in particular.

Edit 2:

And a fork of it, of course: https://github.com/gwen-lg/subtile-ocr

As you seems to not update this project anymore, I have done a fork to continue the project. With subtile-ocr I have use subtile subtile is a fork no longer maintained vobsub crate. With this I was able to :

  • modernise the code by :
    • update dependencies, especially nom who need a lot of code modification.
    • migrate to thiserror and anyhow for error management
  • do some small optim (by reducing a lot the memory allocation count) And it could be a better start to add functionality (like managing .sup: blue-ray subtitle format).
[–] AnEilifintChorcra@sopuli.xyz 1 points 6 months ago

Yeah, mkv can handle VOBSUB. I just prefer text based like srt or ass since you can edit the subtitles to get better timings or changing font/ colour or fix spellings really easily. I also find the VOBSUB a bit blurry around the edges of the text.

If you're happy with the VOBSUB, then the last bit of code above will merge them with the mkv file and they should be automatically on when using vlc.

Thanks for the links, I was thinking about how outdated vobsub2srt was and definitely want to try these instead!

load more comments (3 replies)
load more comments (3 replies)