95

cross-posted from: https://lemmy.ml/post/44059967

for those not familiar with Mark Pilgrim, he is/was a prolific author, blogger, and hacker who abruptly disappeared from the internet in 2011.

cross-posted from: https://lemmy.bestiver.se/post/968527

HN comments

you are viewing a single comment's thread
view the rest of the comments
[-] sem@piefed.blahaj.zone 2 points 20 hours ago

So no one is going to say what chardet is, huh.

[-] cypherpunks@lemmy.ml 4 points 19 hours ago* (last edited 19 hours ago)

It's a library for detecting which character encoding a string is encoded with.

Here are the docs for the vibe-coded rewrite, and here is the version before it.

The new vibe-coded version also adds language detection; it isn't clear to me why the current version of the readme shows it classifying the string "It’s a lovely day — let’s grab coffee." as Spanish with 99% confidence, without any comment in the docs about that being a misclassification, but I guess that if the LLM-authored program says it is then that must be one of those phrases that looks the same in Spanish as in English 👀

this post was submitted on 05 Mar 2026
95 points (99.0% liked)

Technology

42081 readers
159 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 6 years ago
MODERATORS