460
DOGE employee (lemmy.world)
you are viewing a single comment's thread
view the rest of the comments
[-] MonkderVierte@lemmy.ml 5 points 2 weeks ago

Selecting text doesn't work in most multi-column pdfs and good OCR cost money. And if the original source is lost and you want an exact copy in word, the OCR tools need to be really good at guessing whitespace-to-line ratio, because pdf is only an output format and not a processing format.

For most other converting needs, there's pandoc, imagemagick and ffmpeg.

this post was submitted on 07 Feb 2025
460 points (98.1% liked)

Programmer Humor

32707 readers
130 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS