this post was submitted on 21 Apr 2025
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/Claude_Jan on 2025-04-20 15:44:19.

Hey folks,

I’m looking for a reliable OCR solution that works well with French text—accents and all. The catch is: I’ve got several hundred photos of book pages to process.

What I’ve tried so far:

  • Tools that give very messy output (mud levels of quality)
  • Others that only let you process one image at a time—which isn’t feasible at this scale
  • ChatGPT's OCR is surprisingly decent, but not trained well for French: it struggles with accents
  • I also tried some Python libraries locally, but I’m probably missing something, because results aren't better than ChatGPT—and way less convenient

So if anyone has an up-to-date OCR setup in 2025 that works for bulk image processing in French, I’d love some pointers.

Thanks in advance!

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here