95
you are viewing a single comment's thread
view the rest of the comments
[-] sparky@lemmy.federate.cc 80 points 1 month ago* (last edited 1 month ago)

This kind of seems like a non-article to me. LLMs are trained on the corpus of written text that exists out in the world, which are overwhelmingly standard English. American dialects effectively only exist while spoken, be it a regional or city dialect, the black or chicano dialect, etc. So how would LLMs learn them? Seems like not a bias by AI models themselves, rather a reflection of the source material.

[-] BlackEco@lemmy.blackeco.com 27 points 1 month ago* (last edited 1 month ago)

Seems like not a bias by Al models themselves, rather a reflection of the source material.

That's what is usually meant by AI bias: a bias in the material used to train the model that reflects in its behavior

[-] 30p87@feddit.org 19 points 1 month ago

But why is it even mentioned then? It's FUCKING OBVIOUS. It's like saying "AIs are biased towards english and neglect latin" or smth ffs

[-] n2burns@lemmy.ca 11 points 1 month ago

Great comparison, a dialect used by millions of people to a dead language. It really shows how much you care about the people who speak that dialect...

[-] 30p87@feddit.org 4 points 1 month ago

AIs are trained on what is written in the Internet. Latin is not spoken, it's written. But even then, it's rarely used. African american is a dialect, it's only present in speech.

[-] MostlyBlindGamer@rblind.com 9 points 1 month ago

You need to get out more. I totally get that you would think that’s the case, but only if you’re not exploring parts of the internet outside your bubble. It’s absolutely written.

[-] curbstickle@lemmy.dbzer0.com 4 points 1 month ago

There are actually quite a few books written in AAVE...the earliest I'm aware of is their eyes were watching god, from the 1930s. The Color Purple, Beloved, The Sellout, the books of Chester Himes...

load more comments (5 replies)
load more comments (5 replies)
load more comments (9 replies)
this post was submitted on 29 Aug 2024
95 points (100.0% liked)

Technology

37573 readers
361 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS