569
Password123! (media.piefed.social)
you are viewing a single comment's thread
view the rest of the comments
[-] edinbruh@feddit.it 6 points 17 hours ago

did they open access their model weights?

In that instance it wasn't really training, it was crowdsourcing the transcription. Rechapta would pull out a word from their book archive that the OCR failed to recognise, and if many people identified it as the same word, it would be archived. Now that rechapta has been purchased by Google, the archive and the transcriptions are available on Google books.

They stopped doing this once ai became more effective than rechapta for book transcriptions.

Modern chapta actually is about training models. But old, classic rechapta was really just about book transcriptions, and those are available.

[-] ulterno@programming.dev 0 points 16 hours ago

Nice.
Looks like they did make good use of the opportunity.

this post was submitted on 11 Feb 2026
569 points (99.1% liked)

Programmer Humor

29692 readers
1272 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS