29
How do you encode your paper scans?
(lemmy.ml)
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
Rules:
Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam.
Posts here are to be centered around self-hosting. Please ensure it is clear in your post how it relates to self-hosting.
Don't duplicate the full text of your blog or git here. Just post the link for folks to click.
Submission headline should match the article title.
No trolling.
Resources:
Any issues on the community? Report it using the report flag.
Questions? DM the mods!
PDF/A
And how do you encode the images of the scan contained in the PDF/A? That's the crux here.
I'm not sure I understand. I just scan anything and let my software spit out PDF/A
PDF/A is not an image format. As a document, it may contain images.
My PDF/A documents contain all kinds of content, including text and images. To me, it doesn't matter what format the encoded images are, as long as I can open them 20 years from now. Why would one care one way or another?
I care that the text remains readable (both to me and also software) and that I don't balloon my storage out of control.
JPEG (even at higher levels) subjectively degrades text in particular to a degree that I worry about the former and PNG makes me worry about the latter.
My current plan is to go with the latter since storage is a relatively cheap issue to fix while data loss is pretty much permanent.