461
you are viewing a single comment's thread
view the rest of the comments
[-] kayohtie@pawb.social 21 points 2 months ago

You left out what I feel is the best part: even in the "uncompressed" mode, even when that was disabled, it was still happening sometimes.

[-] GamingChairModel@lemmy.world 12 points 2 months ago

To be precise, the "lossless" compression is still a compression algorithm. They just didn't implement the steps that actually make the compression algorithm lossless.

From the write up:

JBIG2, the image format used in the affected PDFs, usually has lossless and lossy operation modes. Pattern Matching & Substitution„ (PM&S) is one of the standard operation modes for lossy JBIG2, and „Soft Pattern Matching“ (SPM) for lossless JBIG2 (Read here or read the papery by Paul Howard et al.1)). In the JBIG2 standard, the named techniques are called „Symbol Matching“.

PM&S works lossy, SPM lossless. Both operation modes have the basics in common: Images are cut into small segments, which are grouped by similarity. For every group only a representative segment is is saved that gets reused instead of other group members, which may cause character substitution. Different to PM&S, SPM corrects such errors by additionally saving difference images containing the differences of the reused symbols in comparison to the original image. This correction step seems to have been left out by Xerox.

[-] kayohtie@pawb.social 2 points 2 months ago

TIL! Thank you for the added detail, I hadn't read the full write up but had watched his presentation in English and it was wild to hear presented.

this post was submitted on 19 Aug 2025
461 points (98.3% liked)

Science Memes

17112 readers
2536 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.



Rules

  1. Don't throw mud. Behave like an intellectual and remember the human.
  2. Keep it rooted (on topic).
  3. No spam.
  4. Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.



Research Committee

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 2 years ago
MODERATORS